Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37916 |
Symbol | |
ID | 7202843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 393196 |
End bp | 396321 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182059 |
Protein GI | 219123495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGA GAAGGATATC CCATGCCATC CCGCTGGCTC TAAGCGTTTG TCTCCGGAGT CGGCTCATAG ACTCCTTTCT CGCCCTGCCA TCAAAATTGC ATTCGAATCG GTTGGTCTTT GATTCTCCAT CGTCCACGAA AACAACATGC AAAAGAGATT TTAGAATTTT CGGAGACCTT ACCGGAAAGC TCGAAGACGA CCAGGATGCA CAAGAACTCA CAACGACGAG ACACTCAAGC ACACATCAGT CTCGGCGGAA AGCCATGCAA GCCTTAGGCT TGGCTTCGTT GGCTATCCCA ATGGCCGCCT CAGCGGGAAT CGCCGAACTC GACAAGTCTA CGGGAGCACT GTTCAGTCCC AAATCCGAAA TGCTTTCTGG TGGGAGTGCG GCTGCGCGTG GTATTCCTGT TTCTGGTAGT CGCCGCCAAC AACTCCAACC TGGACAAGCG CTCCAAACAG TATACGAAAC TCGCTTTATT GTGTACCTTG CGCGATTTTT ATTGAATTTT GATCCATCAG CACACGCCTG GTGGCTCCAA CAAGGTTTTG CCGATTCTTG GGAACCTCGC TCTGGCTCGG ACGAGGCATT TGCCGACAAC ACTTTAGCCG AGTTTGCAGA AAGTGTCGAA GTTGGTTTGG CTGATTATTT TGTTGGTCCC TACGGTAGCT ATTCGTCACT GTCCGCTGCC AAGGCTGGAA TTTCAGCGGC GCGGCCAGCA CCCTCCGCGC AACCACAACA AGAAGAAAAC TACTTAAAAG AACTCCTTTT TGGTCGACCG AAACTTTCCG ACGAAAAGAC ACCCAAAGAA AAGGTAGACA GCGCAAAAAA AGGGATACTC AATTTGTACA CTCTGTTGAA AGCTCGCTAT ACATCTGTTG CCGCCAAACG GCATCTTGCT ATTCTGTTTT CTTTTATATC CTCCCCAAGG CTTCAACCAA GTAATGAAAT CCTAGCTTTG CTTGGTGAAT CGGACAATGC AACAATTTCT GAGATTCGGA TCGTCAAACC AACTCATTGG CCAGTCAACG AGGCGGACTC GCGGACGAGT AGCCGACGGG GCGGTGGCTA CTCAATCGAG GAGCCACCCA TTGTTACTAT CGATGAACCA CCGGCGTTAG GCGACAGCTA CGTACCGGCT GAATTAAGAC CCGTTCTCAA GCCGACATCA CGTGTTTTGC GCATTTCTGT GATTGACGGT GGTGAAGGTT ATACATCGGC GCCCCAGGTG ACCGTAGTGC AAAGCGGCTA TCTACGCTTG TGTCAAGCTA CTGCTATTAT AGATCGGAGT GGCAGAGTCG AATCTGTCAT TCTTTTGGAT CCGGGATGGG GTTACGGTGG TCGAAAGCAG GCTCCTCCCA AAGTCAAAAT TGAACCTCCC AGACTTAAAA GTAAAGGAGA ATCGGGTCAG CGAGCGAAGG CTGTTGCTGA ACTTGAGTAC GAAATAGTCG GTGCGAAGGT TGTCCGTGGT GGAAACGGAT ATGTCAAGAC CGAAGTACCA CAAATTACAA TTACCCCGCC GGATGGGAAT CCCGACTGGT TTCTGGCAGT GCAGGAACAG CCAGAGATGC GAATGAAGAA ACCAGCAGAA ATCGAACCTT TACGACTAGA AGTGGCCGAA ATGAAATTCT CGGATGGCAG CGTTGCCTAT TCTATCGATC GTATGCCCGA GAGCAAAGGT GTGGACAATG CATTACTCGA TCGTCTTCAA AGAGATCCAC TCGAAATGCT ACCCCCTTCA ATTCGACCCG AAAGATACAA ATATGGAATC TACGCCATAC CTTCCCTGGC ACGCATTCCA CAGTCCGTTC CGAACTTGTC ACCGAGATAC CGTGCATGTG ACCCGGTATT TGGAGGCGTC GGTCGTGTTC CAGTTACAAA AGGGGCTGTG GCCTTGAAAG CTAGCGAATA CGCTCGTCTT GCTTTAAGCG GCGCCGTCTG CACAGTGTTG GTACGCACAG CTTTGAATCC GCTGGAGTTG ATCAAGACCA AGCAGCAATT GCAAAACGAT AACGAATTAC TCTCTTTTGC GAGAGCTCGG GCTCTCCGAA AAGGGATTTC TCCTGATCAC CAAGACGCGC ACGAACATAT GCTGTCTAGC AACAAAGCGG AAATCAATGC CACTGCTGCT GTTGCTCCTC AAGAAACGGA CGGACAAATA AAGCTAGGAA CACTCGATTT GATATCAAGT CTCATCGAGT TACGAGGCCC TTTAGCCTTG TTTCAAAGTG CTGACATTAC CTTTCTTGCT TCCTTGGTAT TTGGCTCACT CGGTTTTGGC GCCACTGAAT TATTTCGTCG TTCGTTCACT GCATTTTTCA TTGCCGGTTC TGGAACGGAT GAAATTGGTT TGGATGTTGT TGCGCTTTTA GCGGCTGCTT CCCTGGCGAC CGTTGTGACA GCAGCCGCTG CCGCGCCCTT TGAGGTCTTG CGTGTAAGGA GTATGGGTCT AATAGAATCG GTGGGTTGGA CAAAGGTTTT GGAGGATTTC ATCGCCGAAA AGTCAAGACC AAGACAAAAA ACATCAAACT CTTTCGGTCT CAATCGTAAA CAACATGGAG GTCATCAAGA ATTTGAATTG CGCAACTTAA AAGCGAGGGA TATCCTGCCT TTATGGGCTG GTTTTGCACC CACCGTCAGT CGTGAACTCC CGTTCGCGGT CGTCAAGTTT TTGACATTTG ACTTTATTAC TGGAACGGTA ATTACCTTCT TGAACACACA GTCCAGTGAT GGCGCGTTGC CTATTCAGGT TGGAACGGGC CCTATTGGAT TGATAGTATC GGCGCTGGCT GGCGCTGTGG CAGGTATCGC AGGTGCTGTT GTCTCGCATC CAGCCGATTT GATTTTGACA AAGACATCAG CTAGTGGCAA TCGCAATGGA ACAGAAACTT CCGCATCGGT AGAGGAGCCA GACTGGAGGG ATGTTGTCAG GGAGTTGATA GCACAGCCCG GCGGAATTGC GAATCTTTAC GTCGGTTTTC CTGCACGTGC TACATTTTTC TTCCTTGTCA TTGGGCTGCA GTTCTTTTTG TACGATTATT TCAAAACGTT GCTAAATGTT GGCTCCGACG ACTTGAGCTT GGTATTGGAT GTGTTTTACG CGGTGCGTGC CGGTCTCGTT GGGTAG
|
Protein sequence | MKRRRISHAI PLALSVCLRS RLIDSFLALP SKLHSNRLVF DSPSSTKTTC KRDFRIFGDL TGKLEDDQDA QELTTTRHSS THQSRRKAMQ ALGLASLAIP MAASAGIAEL DKSTGALFSP KSEMLSGGSA AARGIPVSGS RRQQLQPGQA LQTVYETRFI VYLARFLLNF DPSAHAWWLQ QGFADSWEPR SGSDEAFADN TLAEFAESVE VGLADYFVGP YGSYSSLSAA KAGISAARPA PSAQPQQEEN YLKELLFGRP KLSDEKTPKE KVDSAKKGIL NLYTLLKARY TSVAAKRHLA ILFSFISSPR LQPSNEILAL LGESDNATIS EIRIVKPTHW PVNEADSRTS SRRGGGYSIE EPPIVTIDEP PALGDSYVPA ELRPVLKPTS RVLRISVIDG GEGYTSAPQV TVVQSGYLRL CQATAIIDRS GRVESVILLD PGWGYGGRKQ APPKVKIEPP RLKSKGESGQ RAKAVAELEY EIVGAKVVRG GNGYVKTEVP QITITPPDGN PDWFLAVQEQ PEMRMKKPAE IEPLRLEVAE MKFSDGSVAY SIDRMPESKG VDNALLDRLQ RDPLEMLPPS IRPERYKYGI YAIPSLARIP QSVPNLSPRY RACDPVFGGV GRVPVTKGAV ALKASEYARL ALSGAVCTVL VRTALNPLEL IKTKQQLQND NELLSFARAR ALRKGISPDH QDAHEHMLSS NKAEINATAA VAPQETDGQI KLGTLDLISS LIELRGPLAL FQSADITFLA SLVFGSLGFG ATELFRRSFT AFFIAGSGTD EIGLDVVALL AAASLATVVT AAAAAPFEVL RVRSMGLIES VGWTKVLEDF IAEKSRPRQK TSNSFGLNRK QHGGHQEFEL RNLKARDILP LWAGFAPTVS RELPFAVVKF LTFDFITGTV ITFLNTQSSD GALPIQVGTG PIGLIVSALA GAVAGIAGAV VSHPADLILT KTSASGNRNG TETSASVEEP DWRDVVRELI AQPGGIANLY VGFPARATFF FLVIGLQFFL YDYFKTLLNV GSDDLSLVLD VFYAVRAGLV G
|
| |