Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19937 |
Symbol | |
ID | 7200572 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 298040 |
End bp | 301515 |
Gene Length | 3476 bp |
Protein Length | 1012 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179824 |
Protein GI | 219118084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCAA CCAACAAAAC CGAAACTATT CAGTGGTGCT CGGATGCCTT GCACGATTTG TTGGGCTTTG CCGACACGGC GTTGGCTTCG TACCTGGTCA GCGTTGCGAA GAAGGCAACA CAATCGTCGG AAATCGTCCA GATCCTCGTG GATGGAGATG TACGAGACGT GACACCGGAA CGCATGGAAA GATTTGCTGA GCAATTGCTC TCGCACGCTC GACCGACACC GAAGCAAAGC CACGGCGGAC CTGCTTCTCG ACAAGCAAAG GCCATTCACA GTCAAACAAA AACGAACGCG GACTGGGTCA AGGCGGCTTC CAGCTATCAA CTGATTGATG TAGAGATCAG CGAAGAACCG TCTAATCTGA ATAAACCAAG CGACAGACGG AAGGGAAAGA AAGACAGACA GGATAAAAGG GATTCTTCCC TTTCCGAACG TGTTGGCGAA AAGTCGCGTC GAAGAAAACG GCAGTACCGC GATGGTGATT CTGGTAGCAG TGACAGCAGC AATGCAGAAG ACGGAGCTGG GGCTAGAGTA GCTGAACGGT ACCGTCGGAA AGCCGAAGAG CGTCGAGAGC GGAGAAGGCA TCGCCAAGTG GAGTCAGCTC TTACTCCTGC AGAGCGCGTG GAACTAGAAA GGGAAAAGGA TCTGAAGGAA CGAGACGAGC TTGTACAGCG CATGATGGAA CGCGATCAGA CAAAAACAAA ACAAAAGGCA AAGTCAGAAG AAAAATCTGA TTCTGTTCAA AACCTAGCAG AAATCGAGGA GAGGCTAGCA AAGGGAGAAC CAATGTATGA TGATGCTACA GGGAATGAGT TAACGTTGGA GCGACTCCGC GAGGAGAGCC GTCGTGCGTA TCTGAAGAAG CGAGAAGAGC GTGAGCTAGC CCTATTGAAA CAGTCGCTTC AGGATGAAGA GGATCTTTTT AGAGGCGCAA AATTAACGGA AGCAGAAAAA AAGCGAATTC AGATGGGGAA GCAGATCCTT AGCATGGTTG AGGAGAGAGA CGGTGAGGAA GACAAGGATG ATGAATTTTA TCGGTTACCT GGGGACTTTC ATGAGAAACA CTCAAGGGCT AAACAGCAAG AAGCATTGCT GACGTCCCGC TATAACGAGC CTAAACTCGA AAAATCGGAA CAGGATCTGT GGGAAGAGTC GCAAACGCAA AAGGCTGGTG CTATTGGCGG GCGCCAGAAG AAGGCGATTG AATCAGACGG TTACGAGTTG TTGTTCGACG ATCAGATCGA TTTTGTTATG CAGGAAACTA GAGAAGGCTA TGACAAACGT GGCAAGAGAC ACAAGTTGCG AGACCATACT CGTCAAATTA GAGACGAGAG TCTGTCAGAG ATGCGTCCAG CAACTGAGCA TGAAAAGATT CTGGAAGGGC GTACCAAACT TCCTGTTTAC GCTTATCGCG AAGAATTCCT AGCTGCAGTC AAAGAGCATC AGATTCTCAT CTTAGTGGGA GAAACGGGCT CGGGTAGGTC TTAGTCTGTA ACTTTTGGCT TTTGGGAATC TAGTTGCGAA TGTCTCTGAT TCTATATCCG TTCTTTCTTA CTTTTAGGCA AAACGACACA AATTCCTCAA TTTCTCAACG AAGTTGGATA TGGTGAGCTG GGGAAAATTG GTTGCACGCA GCCTCGGCGC GTCGCTGCAA TGAGTGTGGC AGCTCGTGTC GCGCAAGAAA TGAACGTCAG GCTCGGGCAC GAAGTTGGCT ACTCCATTCG ATTCGAGAAT TGCACAAGCC CCAAGACGAT TCTCCAGTAC ATGACGGACG GTATGCTTCT GAGGGAAATT TTGACCCAAC CAGATTTGGC GAGCTACTCA TGCATGGTAA TCGACGAAGC ACATGAGCGC ACGCTACATA CGGATATACT TTTTGGTCTC GTCAAGGACA TTGTGCGTTT CAGAAGTGAT CTTAAACTCA TCGTCAGTAG TGCAACGCTT GATGCCGAAA AATTCTCGAA GTATTTTGAC GATGCCAGCA TTTTCATGAT TCCCGGTCGT ATGTTTCCAG TCGATACATA TTACACAAAA GCCCCGGAAG CTGACTATGT TGACGCGGCG GTTGTCACCG TGCTACAGAT ACATGTATCC CAGCCGCTCA ACGGAGATGT GCTAGTATTT TTGACCGGTC AAGAGGAAAT CGAGACTGCG GCCGAAACCT TGTCCGAGCG TTCGAAAAAC CTTGGCTCTC GCATACCTGA GCTAATCATT TGTCCGATTT ACGCCAACCT TCCCTCAGAG CAGCAAGCGA AAATCTTTGA AAAGACTCCA AGCGGTGCTC GCAAAGTAGT TCTTGCTACA AATATCGCAG AGACAAGCCT TACAATTGAC GGGATCTGTT ACGTGATAGA TACTGGGTTC AATAAACAAA AAACATATAA TGCCAGATCT GGCATGGAAT CTCTGGTCGT AACTCCCATT TCACAAGCAG CCGCTAACCA ACGAGCTGGT CGAGCAGGGC GGACGCAACC AGGCAAGTGT TTTCGGCTCT TTACAGCATG GTCTTTCCAA CATGAACTTG AACCAAACAC CGTGCCGGAG ATATTACGGA CGAACATGGG AAACGTTGTT TTAATGTTGA AGAGTCTCGG AATCAACGAT CTTTTGAATT TTGACTTCAT GGACCGGCCT CCTGCCGATG CTTTGATAAG AGCTCTTGAA CAGCTGTACG CCCTCGGTGC GCTCAATGAT CGGGGAGAAT TGACAAAACT CGGTCGTCGA ATGGCAGAAT TTCCTTTGGA TCCTATGCTA AGTAAATCTG TAATTGTGTC CGAAAAGTAT GAATGCACAT CCGAGGTGCT GTCGACCGTC GCGATGCTTT CTCTAGGTGC ATCGGTTTTC TATCGGCCAA AAGAAAAGGC AGTACATGCC GACACGGCGC GACTTAATTT TGCCCGCGGT GGTGGAGGTG ACCATATCGC TCTGCTTCGA TGTTACTCTG AATGGGCAGC ATCTGACTTC AGTCCTTCTT GGTGCTTCGA AAATTTTGTT CAAGTCAAGA ACATTAAAAA AGCCCGTGAC ATTCGGGAGC AGCTAGCAGG ACTTTGTGAT CGTGTAGAGA TTGATCATAC AGTTTCGAAT TCTGACGATT TCGACGCTAC TCTGAAAACA ATTACTGCTG GTTTCTTTTA CAACATTGCG AAACTTGGTC GTACTGGAGA GTATCAGACA GCGAAGCAGC ACAAGACTGT GTATATTCAT CCTAGCAGCG TAATGGCAAA AGAGGAAGAG CCGCCACCGT GGCTAGTATT TTTTGAGCTT ACCTTTACAA CAAAGGAATT CATGAGACAG GTAGCCCCTA TCAAGCCATC GTGGTTGGTT GAAATTGCAC CGCACTATTA TCAAGAAACT GATATCGAAG ATTCGAAGAC CAAAAAAATG CCGAGAACGA GACGCAATTG ATGATTTGCT GTTTCACCTG CAAATTAGAA TACTGTTTGG AATTCTTGTT TTTTTTTAGG ATGGTG
|
Protein sequence | MPSTNKTETI QWCSDALHDL LGFADTALAS YLVSVAKKAT QSSEIVQILV DGDVRDVTPE RMERFAEQLL SHARPTPKQS HGGPASRQAK AIHSQTKTNA DWVKAASSYQ LIDVEISEEP SNLNKPSDRR KGKKDRQDKR DSSLSEPLTP AERVELEREK DLKERDELVQ RMMERDQTKT KQKAKSEEKS DSLTLERLRE ESRRAYLKKR EERELALLKQ SLQDEEDLFR GAKLTEAEKK RIQMGKQILS MVEERDGEED KDDEFYRLPG DFHEKHSRAK QQEALLTSRY NEPKLEKSEQ DLWEESQTQK AGAIGGRQKK AIESDGYELL FDDQIDFVMQ ETREGYDKHE SLSEMRPATE HEKILEGRTK LPVYAYREEF LAAVKEHQIL ILVGETGSGK TTQIPQFLNE VGYGELGKIG CTQPRRVAAM SVAARVAQEM NVRLGHEVGY SIRFENCTSP KTILQYMTDG MLLREILTQP DLASYSCMVI DEAHERTLHT DILFGLVKDI VRFRSDLKLI VSSATLDAEK FSKYFDDASI FMIPGRMFPV DTYYTKAPEA DYVDAAVVTV LQIHVSQPLN GDVLVFLTGQ EEIETAAETL SERSKNLGSR IPELIICPIY ANLPSEQQAK IFEKTPSGAR KVVLATNIAE TSLTIDGICY VIDTGFNKQK TYNARSGMES LVVTPISQAA ANQRAGRAGR TQPGKCFRLF TAWSFQHELE PNTVPEILRT NMGNVVLMLK SLGINDLLNF DFMDRPPADA LIRALEQLYA LGALNDRGEL TKLGRRMAEF PLDPMLSKSV IVSEKYECTS EVLSTVAMLS LGASVFYRPK EKAVHADTAR LNFARGGGGD HIALLRCYSE WAASDFSPSW CFENFVQVKN IKKARDIREQ LAGLCDRVEI DHTVSNSDDF DATLKTITAG FFYNIAKLGR TGEYQTAKQH KTVYIHPSSV MAKEEEPPPW LVFFELTFTT KEFMRQVAPI KPSWLVEIAP HYYQETDIED SKTKKMPRTR RN
|
| |