Scipy-correlate: как изменить задержки точек данных на временные задержки? - Цифровое Кемерово

Scipy-correlate: как изменить задержки точек данных на временные задержки? ⇐ Python

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Anonymous

Scipy-correlate: как изменить задержки точек данных на временные задержки?

Цитата

Сообщение Anonymous » 06 мар 2024, 11:21

I have a problem regarding the correlation of two light curves in my bachelor thesis. I use Scipio.signal.correlate to calculate the correlation. The light curves both have a different amount of data points and have different times. I think the first one has data between 2020 and 2020 and the second one before 2020. So I created data frames with pandas, added both curves into one frame and filled everything that said "NaN" up with zero. I correlated and normalized the curves and added the lags and correlation into one plot that looks like it could be correct. Now I have the problem, that the lags seem to be data lags, not time lags. So they say to me how much data points I have to postpone a curve to best match the other curve, not how much days I have to postpone. I found a calculation but I need the "slab rate" (?) to do this, which is not possible because my data points don't have the same distances. I found same other modules like Stingray but it needs bins at the same size and I think I can't resize because it would be a data loss. My idea was to subtract every point from the point with the biggest lag (would be 312) but then I would have only 534 datapoints while the correlation gives me 1067 (I still don't know why the correlation doubles the points...). I am running out of ideas. I think the following would be the important code block:

def ccf_values(series1, series2): p = series1 q = series2 p = (p - np.mean(p)) / (np.std(p) * len(p)) q = (q - np.mean(q)) / (np.std(q)) c = scipy.signal.correlate(p, q, 'full') return c #fermi_df=merged_df[merged_df['fermi_data']!='nan'] #print(fermi_df) #ztf_lc.set_index('filter', inplace=True) #test = pd.DataFrame({'r-data': [ztf_lc.loc['ZTF_r', 'fluxtot']]}) # ztf_r_frame=ztf_lc[ztf_lc['filter']=='ZTF_r'] #ccf_ielts = ccf_values(fermi_lc['y'], test.iloc[:,9]) ztf_zeit = ztf_r_frame['mjd'] fermi_zeit = fermi_lc.iloc[:,0] ztf_df = pd.DataFrame({'Time': ztf_zeit, 'ztf_data': ztf_r_frame['mjd']}) fermi_df = pd.DataFrame({'Time': fermi_zeit, 'fermi_data': fermi_lc['y']}) merged_df = pd.merge(ztf_df, fermi_df, on='Time', how='outer') merged_df.sort_values(by='Time', inplace=True) merged_df['ztf_data'] = merged_df['ztf_data'].fillna(0) merged_df['fermi_data'] = merged_df['fermi_data'].fillna(0) zeitdifferenzen_df = merged_df['Time'] # Nehmen Sie die Zeitdifferenzen von der Spalte 'time' in sortierter_zeit_df. zeitdifferenzen_df['Time'] = zeitdifferenzen_df - zeitdifferenzen_df.iloc[312] #zeitdifferenzen_df = zeitdifferenzen_df[(zeitdifferenzen_df != 0).all(1)] # Jetzt enthält zeitdifferenzen_df die Zeitdifferenzen von jedem Zeitpunkt zu Zeitpunkt 1. #print(zeitdifferenzen_df) ccf_ielts = ccf_values(merged_df['ztf_data'],merged_df['fermi_data']) #ccf_ielts = ccf_values(merged_df['ztf_data'],merged_df['ztf_data']) lags = signal.correlation_lags(len(merged_df['fermi_data']), len(merged_df['ztf_data'])) #lags = signal.correlation_lags(len(merged_df['ztf_data']), len(merged_df['ztf_data'])) def ccf_plot(lags, ccf): fig, ax = plt.subplots(figsize=(9, 6)) ax.plot(lags, ccf) ax.axhline(-2/np.sqrt(23), color='red', label='5% confidence interval') ax.axhline(2/np.sqrt(23), color='red') ax.axvline(x=0, color='black', lw=1) ax.axhline(y=0, color='black', lw=1) ax.axhline(y=np.max(ccf), color='blue', lw=1, linestyle='--', label='highest +/- correlation') ax.axhline(y=np.min(ccf), color='blue', lw=1, linestyle='--') ax.set(ylim=[-1, 1]) ax.set_title('Cross Correation IElTS Search and Registeration Count', weight='bold', fontsize=15) ax.set_ylabel('Correlation Coefficients', weight='bold', fontsize=12) ax.set_xlabel('Time Lags', weight='bold', fontsize=12) plt.legend() #ccf_plot(zeitdifferenzen_df['time'], ccf_ielts) ccf_plot(zeitdifferenzen_df['Time'], ccf_ielts) merged_df is a data frame with 534 datapoints per row. I don't know if I left out an important info; if so, please let me know.

the plot:

Thank you very much

Источник: https://stackoverflow.com/questions/771 ... -time-lags

Реклама

1709713289

Anonymous


I have a problem regarding the correlation of two light curves in my bachelor thesis. I use Scipio.signal.correlate to calculate the correlation. The light curves both have a different amount of data points and have different times. I think the first one has data between 2020 and 2020 and the second one before 2020. So I created data frames with pandas, added both curves into one frame and filled everything that said "NaN" up with zero. I correlated and normalized the curves and added the lags and correlation into one plot that looks like it could be correct. Now I have the problem, that the lags seem to be data lags, not time lags. So they say to me how much data points I have to postpone a curve to best match the other curve, not how much days I have to postpone. I found a calculation but I need the "slab rate" (?) to do this, which is not possible because my data points don't have the same distances. I found same other modules like Stingray but it needs bins at the same size and I think I can't resize because it would be a data loss. My idea was to subtract every point from the point with the biggest lag (would be 312) but then I would have only 534 datapoints while the correlation gives me 1067 (I still don't know why the correlation doubles the points...). I am running out of ideas. I think the following would be the important code block:
 
def ccf_values(series1, series2):     p = series1     q = series2     p = (p - np.mean(p)) / (np.std(p) * len(p))     q = (q - np.mean(q)) / (np.std(q))       c = scipy.signal.correlate(p, q, 'full')     return c #fermi_df=merged_df[merged_df['fermi_data']!='nan'] #print(fermi_df) #ztf_lc.set_index('filter', inplace=True) #test = pd.DataFrame({'r-data': [ztf_lc.loc['ZTF_r', 'fluxtot']]}) #  ztf_r_frame=ztf_lc[ztf_lc['filter']=='ZTF_r'] #ccf_ielts = ccf_values(fermi_lc['y'], test.iloc[:,9]) ztf_zeit = ztf_r_frame['mjd'] fermi_zeit = fermi_lc.iloc[:,0] ztf_df = pd.DataFrame({'Time': ztf_zeit, 'ztf_data': ztf_r_frame['mjd']}) fermi_df = pd.DataFrame({'Time': fermi_zeit, 'fermi_data': fermi_lc['y']}) merged_df = pd.merge(ztf_df, fermi_df, on='Time', how='outer') merged_df.sort_values(by='Time', inplace=True) merged_df['ztf_data'] = merged_df['ztf_data'].fillna(0) merged_df['fermi_data'] = merged_df['fermi_data'].fillna(0) zeitdifferenzen_df = merged_df['Time'] # Nehmen Sie die Zeitdifferenzen von der Spalte 'time' in sortierter_zeit_df. zeitdifferenzen_df['Time'] = zeitdifferenzen_df - zeitdifferenzen_df.iloc[312] #zeitdifferenzen_df = zeitdifferenzen_df[(zeitdifferenzen_df != 0).all(1)] # Jetzt enthält zeitdifferenzen_df die Zeitdifferenzen von jedem Zeitpunkt zu Zeitpunkt 1. #print(zeitdifferenzen_df) ccf_ielts = ccf_values(merged_df['ztf_data'],merged_df['fermi_data']) #ccf_ielts = ccf_values(merged_df['ztf_data'],merged_df['ztf_data']) lags = signal.correlation_lags(len(merged_df['fermi_data']), len(merged_df['ztf_data'])) #lags = signal.correlation_lags(len(merged_df['ztf_data']), len(merged_df['ztf_data'])) def ccf_plot(lags, ccf):     fig, ax = plt.subplots(figsize=(9, 6))     ax.plot(lags, ccf)     ax.axhline(-2/np.sqrt(23), color='red', label='5% confidence interval')     ax.axhline(2/np.sqrt(23), color='red')     ax.axvline(x=0, color='black', lw=1)     ax.axhline(y=0, color='black', lw=1)     ax.axhline(y=np.max(ccf), color='blue', lw=1, linestyle='--', label='highest +/- correlation')     ax.axhline(y=np.min(ccf), color='blue', lw=1, linestyle='--')     ax.set(ylim=[-1, 1])     ax.set_title('Cross Correation IElTS Search and Registeration Count', weight='bold', fontsize=15)     ax.set_ylabel('Correlation Coefficients', weight='bold', fontsize=12)     ax.set_xlabel('Time Lags', weight='bold', fontsize=12)     plt.legend() #ccf_plot(zeitdifferenzen_df['time'], ccf_ielts) ccf_plot(zeitdifferenzen_df['Time'], ccf_ielts)  merged_df is a data frame with 534 datapoints per row. I don't know if I left out an important info; if so, please let me know.
 
the plot: 
[img]https://i.stack.imgur.com/U1hQF.png[/img]

 
Thank you very much
 

Источник: [url]https://stackoverflow.com/questions/77111945/scipy-correlate-how-to-change-datapoint-lags-into-time-lags[/url]

Ответить Пред. тема След. тема

1 сообщение • Страница 1 из 1

Быстрый ответ

Заголовок:

Имя пользователя:

Изменение регистра текста:

Смайлики

Ещё смайлики…

К этому ответу прикреплено по крайней мере одно вложение.

Если вы не хотите добавлять вложения, оставьте поля пустыми. Можно прикреплять файлы, перетаскивая их в окно сообщения.

Максимально разрешённый размер вложения: 15 МБ.

Имя файла:

Комментарий к файлу:

Имя файла	Комментарий к файлу	Размер	Статус

Похожие темы

Ответы

Просмотры

Последнее сообщение

Numpy.correlate возвращает неправильное значение

Последнее сообщение Anonymous « 19 янв 2025, 18:38
Добавлено в форуме Python

Anonymous » 19 янв 2025, 18:38 » в форуме Python

Я пытаюсь проверить взаимную корреляцию между временными рядами S1, S2, особенно коэффициентом корреляции.
Три использованных метода возвращают одно и то же значение: (i) метод с использованием электронной таблицы, (ii) во встроенной функции Numbers...

0 Ответы

14 Просмотры

Последнее сообщение Anonymous
19 янв 2025, 18:38
Numpy.correlate возвращает неправильное значение

Последнее сообщение Anonymous « 20 янв 2025, 15:35
Добавлено в форуме Python

Anonymous » 20 янв 2025, 15:35 » в форуме Python

Извините, первый раз.
Я пытаюсь проверить взаимную корреляцию между временными рядами S1, S2, особенно коэффициентом корреляции.
S1. S2
0,029 2,470
0,030 1,750
0,030 2,200
0,030 2,670
0,031 2,130
0,032 2,180
0,030 3,410
0,031 2,310
0,032 2,170
0,032...

0 Ответы

11 Просмотры

Последнее сообщение Anonymous
20 янв 2025, 15:35
Numpy.correlate возвращает неправильное значение

Последнее сообщение Anonymous « 25 янв 2025, 01:54
Добавлено в форуме Python

Anonymous » 25 янв 2025, 01:54 » в форуме Python

Извините, первый раз.
Я пытаюсь проверить взаимную корреляцию между временными рядами S1 и S2, особенно коэффициент корреляции.
S1. S2
0.029 2.470
0.030 1.750
0.030 2.200
0.030 2.670
0.031 2.130
0.032 2.180
0.030 3.410
0.031 2.310
0.032 2.170
0.032...

0 Ответы

11 Просмотры

Последнее сообщение Anonymous
25 янв 2025, 01:54
Как интерполировать временные ряды панд, используя разные временные метки

Последнее сообщение Anonymous « 07 ноя 2024, 13:09
Добавлено в форуме Python

Anonymous » 07 ноя 2024, 13:09 » в форуме Python

Я ищу функцию
pandas_interpolate(df: pd.DataFrame, newTime: pd.DatetimeIndex, method: str = 'linear') -> pd.DataFrame

который будет принимать существующий фрейм данных с индексом DatetimeIndex и возвращать новый фрейм данных с индексом, заданным...

0 Ответы

54 Просмотры

Последнее сообщение Anonymous
07 ноя 2024, 13:09
Как интерполировать временные ряды панд, используя разные временные метки

Последнее сообщение Anonymous « 07 ноя 2024, 13:41
Добавлено в форуме Python

Anonymous » 07 ноя 2024, 13:41 » в форуме Python

Я ищу функцию
pandas_interpolate(df: pd.DataFrame, newTime: pd.DatetimeIndex, method: str = 'linear') -> pd.DataFrame

который будет принимать существующий фрейм данных с индексом DatetimeIndex и возвращать новый фрейм данных с индексом, заданным...

0 Ответы

47 Просмотры

Последнее сообщение Anonymous
07 ноя 2024, 13:41

Вернуться в «Python»

Programmiererforum