Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43902 |
Symbol | |
ID | 7204351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 377576 |
End bp | 381938 |
Gene Length | 4363 bp |
Protein Length | 1416 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186333 |
Protein GI | 219113499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.279249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTATTGCT ATGGACGCTG ATTCAACAAA AAGACGGGCG AGTCTGAAGC GTTCACGTGT AGCGATTGAT TTTCCTCATG GTCGCACATC AGCAAAGAAA CAAACGGTGG CGAAGCCGGA CAATGCCCAA TGCAGCTATG TTCCTGTATT CGATGAGCCA AATCAGCATC TCAAGGCGCT TTCCTCTCAA AATTTCTGTT CGATGAGAAA ACCAAGCTCT CCCTCCTCGC TCTCCAGGAC GAAAAACAAC GCCTCTTGTC ATCACATGGA AAATGCTCAC GATCGCAAAT GGGCCCTAAG TTTTGAGAAG CTATCGGAGT ACAAATCTCG TCACGGAGAT ACCCTTGTTC CGAAAGATTA TATATCAGGA GGGGTCCCAC TGGGAAATTG GGTAAGAAGG CAACGCTACC TTCAAACAGC AAAGCTCAAG GGATCTAAGA CCCCGCTCAC TCAGAGCCGA ATTGAAAAGC TCAATTCTCT TGGCTTTGAT TGGGATACCG TGTCGAAGAG GGTGTTTGTA CGCTGGGAAG AAAACTTTGA GCGCTTAATC AAGTACAAAG AAACGCACGG TGACACTCGA GTACCGAGCA ACTATCGTAT TGATGACGTC AACTTGGGAA ACTGGGTAAG ATCTCAACGC GCTGCATACT CTATGAAAAC AAGGGGTAAA TCAGTGGTCA TGAACGACGA GCGTATATAT AAGTTGAATT CCATCGGTTT TCAGTGGGAT CTCCAATCGC GATTGTCGGG TTGTTGCATT GGCGGCTTCA CGGAACTTCC GTCCTTAGCC GATCAGACGA TAAAAGAGCT GGAAGTAGTT TTCCTGCAAA ACCTAGATCG AGAAATATTG ACCGCAGAAG AGGTACATTT TATGGAGAAG CTAGAAAAAG CGATAATCAT AGAAGAAGAG AAGTCGCTTC TACAGCGCAT TGACGAAGAG GTCACCAAAG CCGAGGTCTG TTTACTCGAT GAGACTCCAC GTGTTCCAAA ACGAACTTTC ACGAGAGTGG GGAGAAGTTC GGAGCTTATG AACGGTTCTC ACTCGCCAGC CATTCAAAAT GATTCACAAC TTTACACTAC CATGGCGGTT GATAGCGCAA CTCTCTACTT TCCCAAGCAT TTGAAAGGAA ATATATTGAT CTCGAGGGAA GAATTTAATT CTGAACTTTT ATTGGCTTTC AAAAAGGAAT GCACCGAACT TCTAGATGGA TTTACCACAA GCATGCAAGT CTGTCAAGCT CAGTCTGACA CGTGGAAATT GACTGTTCAA GGTCAACCAG AAGCTCTACA GAAGCTCTCC CAGATACTCA AAGCTTGGAT TCACAACAAA ATAAACGCTT TTAACTATTT ACTCCTCGTT TCAGAGGGGA GGGAGGTAGA GGTCCAAATG AAGTGTACAG CGCAGATTGA AGCAAAGCTG GTTGAAACAG CATGGAATAC GAATGGTGTG AAAAAGCAGG GCTTGCTCAT TCAAGCTCTT TCGTCGAAAG GACAGCTGGG AAAGGCTCTA GGAGAGCAAG GAACTTCTTG CGGTGCTGCT GTTGTATCCA TTGAAAAGCA AGAGTGCTCC TCGATTGCCC AATTTAAAGA CTTGGTCAAA AATACAGGTG TGAACAATGC ATCGTATAAA CTGCAACTGA TGCTGCATCC TGAATCTTTT CAGGTTGTGG AGCATGAAGG GGGTATGATG AATACTGCAG AATCCCATTA TGGTATCGTG GAGAACTTCA AGTTTGTAAA TCAAGACACT CCTTCGGGGT CTCGTCGTAA AGCAATATGT CCATCTACTG CTTCAGCCAC CTCAGGCTCT TATGAGAGAA AGGAATACGT TTTTAAATTT CCTAGTGGAG AAAAGCTTGG GTTTTATTGC AAAAACGACC GCAGTGGAAA GTCTCCAGTG TGTCGTATCT GTTCCGTTTG TCCGAACAGT ATCACGGCAA AAGACTCAAG AGTCCTCCGA GGCACAATCG TTGTTTGGGC TTCAGTGGGA AACGGAGATC GGTGTGCTGT CAGTAAATGG CAGGACTTAG CGATTCTCTA CACACAAGCA AGTCAGGGAC CTGCTGATTT GAATATCTGG TTTGTAAACC GACATGAGGG AAAATTCGAC CATGACGGCT CAAGGCTGCT GGATAGCAAA GATTGGACAG ACACTGGGAT CTGGAGAGGA AAAGAGAAAG CAGGCTGGGC AGGCGGGGAT CAAACGCACT GCTGTGAAAA TCTTGATCCT CGAAGCGGTA CCGATCGAGA TGCGAATTCA AGCCTCCGTC CCGTCGCATT AGCTTGTAGA CAAGAGGAAT TGGAGAGCAG TGAAGACGAG CAATGGTTCC TTTTAGATCA TATGCCATCA ATTCGTGAGC ATGTGCGCTT GCATCCTGCG CTAAAAAAAG CAGGCTCAGC GAAGAAGGGA AAGACGATCA GCTTTGACCG AAAGCTGTAC CAGGTAAGAA AGTTTGTAAG GGACAGTAGA ACGTGTGAAT TCTACGTCCA GCGTAAGTTC CAAGCCAGTC CTGTTGCGTC CCCTAGCTAC ACAATCCAAG TCGGCTCCAG TATCTCATTT GAAAGCAAGA TTTTACAAGC AATAAAGAAG CACTCGTTTA AAGAACTAAT TCAAGTTCTT CAAAATGGAG CATTTGCTGC CAATCGCTCA GTTGCATTTC TTGACACCGC ACGACGCGAA CTGAGGCTGG CCGTTGAGTC AATGAAATTG GAGAATAGTG AATACTGTGG AAACCCACGG AACTCTCTTG TCCGAAGGAG GGATTTGGAT CTCAAGCACA CAGTGTTGAA GGTGTATATA TCAGCCGCAC ATACATATGG CCATGCGAGA GGTTTGAGGA ACTGGTCCAG GTTCGAGCTT CTGGTGAAAA GTATAGAAAA TTTACGCCTT TCTCCGTCAC TCTTAAGAGG ACAAGGCGAG AGCTTTATTT CAGCCAAGTT GAGTGCAATT TTTCCGGACT CTAGCAGCTC TGAGCAACTT GCGCCCCTTC CCCATCGTCC TCTATCGAAT CTCGTCGAGT ATGGCAATGA GTTCAGCAGT TCTTATTTTT TGAATCACAA CGCTAGTATT GCTGCGGGTC GATGTCTCGT AATTGAAATT CAAATGAATC CTTCTGAGGC TAGCGGATCC TTCCGGCTCG GTTCCATGAA CGTCGAAGTG GCTGAACTAC AGGCTAACTG CCCTCGAGAT GGATCCTGGT GGAATGTATC GAAGCAGATT TTGACTGGGC CCCATCTAGA GGATGGAGTA GTCCATATCA ACGCCCGAAG GGTTGCTGTC GACCCTGAAT ATATCAAAGC GAAACGACGT GACGGTTGTA TACGATTGAA AACAACCTGT GACTGGGCTT GCAAATTCAA CGATAGCCTT TCTCCCATAG AGAGAGCGGG GGGTGTTTTA GATCTACATG TCCCCGTTTT CGGTGGCGCA TCGCTCCTGC ATACTGCGAT TTTGCTTCAA GACTCACATC TGGTGAGGAA GCTTCTTGAT CTTGGCCTAG ATCCGGGCAC CAAGTCCCAA ATTGGATCTC CGATGTCATT GGCCTTCAAT CTAATTGAAA ATATTACACA TGCGTCCAAG AATAATGGAA TGAGCGGCGG TGAGACACAT CACAACCACG CCGAACCGGG AGATCGCGCG GAAAACGACA GAGCAGCAGC ACTAGCCAGA ATTAGTAAAA TGCTTTCTTC CAGGAACGAG CAGTCCAACA TACGCAGAGA CGGTGCGGGC CTGTTGCAAA GAGAAGGCGC TTCTTTGGGA TTGCCTTCCT CAACACATTC TACGATTTCC GTCTCCACAT TAACTCCAAA ACTTCCAACG CTACCCGATA CAAATTGGCT TTTAGAGCCT ACGTTCGTGA GGCACATTTG TCGTTACAAC GAGGGGGCGA CATGCAGACT TGGACAACGA TGCGCCTTTA TTCACATCAA AGCTTCTTTA GGCGAGAACC TTGTTAGCAC TTTGGCACGT ATGCAGAAGA GCGGTGCCGA AGATGAAGGC TCTCTACAAT ACTTCCGGAA AAATTTAAAG GTCATTTCGA GACAAGATTC ATCGAACTGC ATCTGGTATA CTGCTGGCTT CTCGACGGTT GCTCGATTTC GTACCTCAAA CCAGCAGATA TTTTATGCGG AAGGAGGACC TGGTGTCCTC AGTCAACAAG GGGTGACATG GTATCGTGAC CGAAAGAGTG CAGTGGAGTC ACTTGCTCGA GTGTTCAAAA TTTTTAGGGA ATCGACCCGT ATAAAAAACC AAGAGAGTTA GTTAAAGAGA CCACTGATAT CATCAGATCG AAAACGTGAC TCCGAGAGGC ATTGCAAAAT GCGGTGACAA AAAATTACCG AGAATATCGA TCGATGCATG CGG
|
Protein sequence | MDADSTKRRA SLKRSRVAID FPHGRTSAKK QTVAKPDNAQ CSYVPVFDEP NQHLKALSSQ NFCSMRKPSS PSSLSRTKNN ASCHHMENAH DRKWALSFEK LSEYKSRHGD TLVPKDYISG GVPLGNWVRR QRYLQTAKLK GSKTPLTQSR IEKLNSLGFD WDTVSKRVFV RWEENFERLI KYKETHGDTR VPSNYRIDDV NLGNWVRSQR AAYSMKTRGK SVVMNDERIY KLNSIGFQWD LQSRLSGCCI GGFTELPSLA DQTIKELEVV FLQNLDREIL TAEEVHFMEK LEKAIIIEEE KSLLQRIDEE VTKAEVCLLD ETPRVPKRTF TRVGRSSELM NGSHSPAIQN DSQLYTTMAV DSATLYFPKH LKGNILISRE EFNSELLLAF KKECTELLDG FTTSMQVCQA QSDTWKLTVQ GQPEALQKLS QILKAWIHNK INAFNYLLLV SEGREVEVQM KCTAQIEAKL VETAWNTNGV KKQGLLIQAL SSKGQLGKAL GEQGTSCGAA VVSIEKQECS SIAQFKDLVK NTGVNNASYK LQLMLHPESF QVVEHEGGMM NTAESHYGIV ENFKFVNQDT PSGSRRKAIC PSTASATSGS YERKEYVFKF PSGEKLGFYC KNDRSGKSPV CRICSVCPNS ITAKDSRVLR GTIVVWASVG NGDRCAVSKW QDLAILYTQA SQGPADLNIW FVNRHEGKFD HDGSRLLDSK DWTDTGIWRG KEKAGWAGGD QTHCCENLDP RSGTDRDANS SLRPVALACR QEELESSEDE QWFLLDHMPS IREHVRLHPA LKKAGSAKKG KTISFDRKLY QVRKFVRDSR TCEFYVQRKF QASPVASPSY TIQVGSSISF ESKILQAIKK HSFKELIQVL QNGAFAANRS VAFLDTARRE LRLAVESMKL ENSEYCGNPR NSLVRRRDLD LKHTVLKVYI SAAHTYGHAR GLRNWSRFEL LVKSIENLRL SPSLLRGQGE SFISAKLSAI FPDSSSSEQL APLPHRPLSN LVEYGNEFSS SYFLNHNASI AAGRCLVIEI QMNPSEASGS FRLGSMNVEV AELQANCPRD GSWWNVSKQI LTGPHLEDGV VHINARRVAV DPEYIKAKRR DGCIRLKTTC DWACKFNDSL SPIERAGGVL DLHVPVFGGA SLLHTAILLQ DSHLVRKLLD LGLDPGTKSQ IGSPMSLAFN LIENITHASK NNGMSGGETH HNHAEPGDRA ENDRAAALAR ISKMLSSRNE QSNIRRDGAG LLQREGASLG LPSSTHSTIS VSTLTPKLPT LPDTNWLLEP TFVRHICRYN EGATCRLGQR CAFIHIKASL GENLVSTLAR MQKSGAEDEG SLQYFRKNLK VISRQDSSNC IWYTAGFSTV ARFRTSNQQI FYAEGGPGVL SQQGVTWYRD RKSAVESLAR VFKIFRESTR IKNQES
|
| |