Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4959 |
Symbol | |
ID | 4246613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7559014 |
End bp | 7563648 |
Gene Length | 4635 bp |
Protein Length | 1544 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638109770 |
Product | TPR repeat-containing protein |
Protein accession | YP_724346 |
Protein GI | 113478285 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.117684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACA AACGCATAGA AGCATACGTC AACCTCATTA ACCAACTGCT AAATTCCCCC AGCGGTGAGG TAGAAAAAAT CCTCGAAGCC AACCAAGAGT TGGTAGATGA AGGGTTACTG GAAATAATGG AACTTTACGC ACAACAACTG GCAGAAAACG ATGACGAAAA TGCTCAAAAC GCTGCTAATT TTTTACGTCA TCTCAGGAGT CAGCTTGTAG AGTTACTAGA AATTTCAGAA TATTCTCCCT CAACTCATTA CTCATCGGCA GAATATTTTA ATTTCTTAGA AGAGGTATTG GAGGCAACTG CAGAAAGCAA AGGTGACTCG AAGGTTGTCT ACCCATTCCT GCAGCAAAAC TTAGATAAAC TTGATGATAA CTTTGCTGAT ATATTGCGAA ACTGGGCAAC TGCTAAATTT TCTGAAGCGG AGGCGGGTGT AGCAGAATAC ATTGCTATGT GTGTTGGTGA GTTAAGTAAC CTCATTCAGC AATTTCCTCT GGGCAGCCAA GCAAACAACA TGGAAATTAG CATTGCAGGT TATGAAGTAG TGCTAAAGGT TTTTACACAT AAGAGTCACC GGGAAAATTG GGCAACTATT CAAAATAATC TCGGCGCTGC CTACAGAGAC AGAATAAGGG GAGACAAAGC GGAAAATATT GAAGCTGCCA TTGCTGCATA CCAACAAGCT TTGCTGGTGC GCACCCAAAC AGACTTCCCC ATGGACTGGG CAGGAACTCA AAATAATCTC GGCATTGCCT ACAGAAACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCCGCCATT GCTGCATTCC AACAAGCTTT GCTGGTGTAC ACCCAAACTG ACTTCCCCAT CAACTGGGCA ATGACTCAAA ATAATCTCGG CGGTGCCTAC TTTTACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CCGCCATTGC TGCATACCAA CAAGCTTTGC TGGTGTACAC CCAAACTGAC TTCCCCATGG ACTGGGCAAT GACTCAAAAT AATCTCGGCG CTGCCTACAG AAACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCC GCCATTGCTG CATGCCAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATCGAC TGGGCAATGA CTCAAAATAA TCTCGGCGGT GCCTACTTTT ACAGAATAAG GGGAGACAAA GCCGAAAATA TTGAAGCCGC CATTGCTGCA TACCAACAAG CTTTGCTGGT GTTCACCCAA ACTGACTTCC CCATCGACTG GGCAATGACT CAAAATAATC TCGGCGCTGC CTACAGTGAC AGAATAAGGG GAGAAAAAGC CGAAAATATT GAAGCTGCCA TTGCTGCATG CCAACAAGCT TTGCTGGTGT ACACCCAAAC TGACTTCCCC ATGGACTGGG CAATGACTCA AAATAATCTC GGCGCTGCCT ACAGAGACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCCGCCATT GCTGCATTCC AACAAGCTTT GCTGGTGTAC ACCCAAACTG ACTTCCCCAT CAACTGGGCA ATGACTCAAA ATAATCTCGG CATTGCCTAC AGTGACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CCGCCATTGC TGCATACCAA CAAGCTTTGC TGGTGTACAC CCAAACTGAC TTCCCCATGG ACTGGGCAAT GACTCAAAAT AATCTCGGCG CTGCCTACAG AAACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCC GCCATTGCTG CATGCCAACA AGCTTTGCTG GTGCGCACCC AAACTGACTT CCCCATGGAC TGGGCAAATA CTCAAAATAA TCTCGGCTCT GCCTACGGTA ACAGAATAAA GGGAGACCAA GCCGAAAATA TTGAAGCTGC CATTGCTGCA TTCCAACAAG CTTTGCTGGT GCGCACCCAA ACTGACTTCC CCATGGACTG GGCAAATACT CAAAATAATC TCGGCATTGC CTACAGAGAC AGAATAAGGG GAGACAAAGC CGAAAATATT GAAGCTGCCA TTGCTGCATA CCAACAAGCT TTGCTGGTGT ACACCCAAAC TGACTTCCCC ATCAACTGGG CAAGAACTCA AAATAATCTC GGCATTGCCT ACAGTGACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCTGCCATT GCTGCATACC AACAAGCTTT GCTGGTGCGC ACCCAAACTG ACTTCCCCAT GGAATGGGCA ATGACTCAAA ATAATCTCGG CAATGCCTAC AGTGACAGAA TAAGGGGAGA CAAAGCCCAA AATATTAAAG CTGCCATTGC TGCATTCCAA CAAGCTTTGC TGGTGTACAC CCAAACTGAC TTCCCCATCA ACTGGGCAGC AGCTCAAAAT AATCTCGGCC TTGCCTACAG TGACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCC GCCATTGCTG CAAATATTGA AGCCGCCATT GCTGCATACG AACAAGCTTT GCAGGTGCGC ACCCAAACTG ACTTCCCCAT CGACTGGGCA CAAACTCAAA ATAATCTCGG CATTGCCTAC AGTGACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CTGCCATTGC TGCATACCAA CAAGCTTTGC TGGTGCGCAC CCAAACTGAC TTCCCCATGG AATGGGCACA AACTCAAAAT AATCTCGGCC TTGCCTACAT TTACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCC GCCATTGCTG CATACGAACA AGCTTTGCTG GTGCGCACCC TAGAAGCAGA CCCCATCAAC CACCTCCAAA CCACCAACAA CCTAGGCAAC CTATATTTCG ACAACCAAAA CTGGCAACTC GCCGCCGACA ACTATAAAAA AGCCATAACC GCAGTGGAAC TCAGCCGCAG TTGGTCAAAA GATGACGACC GTCGCCAAGA AATCATCGAA GAGTCTATAG GTGTCTACCG CAAGATAGTA CAAGCTTACG TGAATAGCGA TCAAATAGAA AAAGCCTTAG AATATGTAGA GCGTTCTCGC TCCAAACGGT TAGTAGACAT AATGGCAAGT AACGATCGCT ACTCCCAAGG TGAAATGCCA GGAGAAGTTG AAGAACGGTT AAAAGAACAC GAAGCTATTC AACAGCAAAT TAATCAACTT TGGGAACAAC AACAAAAGGG CAGTCAAATA GAGTCGAAAG ATTTAGCTGT AGCAACTAGG GGGCGAGCTG CGACAGAAGC CAGAAACAAA CGCATTGGGG AATTAGAAGC CAAAAAGCAG GAAGTTTATA AAAAAATCCG CAGCTTCGAC CGGGTATTAG CAGAGGGAAT TCAAGTTGCC CCACTGGAAT TTCCAAAAAT TCAAGCATTG ATCCAGGAGC CCACCACAGC GATATTGAGT TTTTATACTA TCACCGACGA TACCTATTTA TTTGTATTGC GACAAGATGG GGTAAAAGTT CATACTTACG AAGGGTTGGG GCTTAAAGAA CTGCAAAACT GGATTTGGGA AAAATGGTTT GGGTCTTACC TCTCATCCCG GGAAGAATGG CAACAACAGA TGCCTGAATT TTTACAGGAT GTAAGTAAGA AGTTAAAATT AGAGGAATTA TGCAAATCTT ATCTTCAAGA CATTGAAGAA TTGATATTAA TTCCCCATTT ATCCTTGCAC TTTCTCCCTT GGAACGCAAT GCCAGTAGCA GAGTCAGGAG AAGACAAATA TCTAGGTGAC CGTTTCCGTA TCCGCACCCT GGCCAGTTGC CAAATTCTAG ACTTTTGTAC CGAACGGGAA GAAATTCAAG GGGAAGTCAA ACAAGGCATT GTGGAAGACA CTCACAACGA CCTACCTTGT TCCAGTTATG AAGCGCAATA TATAGCCCAA ATGTATGGAG TCCCAGAACA CCAACGCCTG CGAGGTGAAG CCGCAACTAT AGACAGTTAC AAACTATTAT TATCTCAGGT ACAACGGTTA TTGTCAACCC ATCACGCTCA ATCTCGCATA GACAACTGTA TGGAGTCAGC GTTAGTGTTA GCGGACGGCA GACTTACTTT AGGGCAACTA TTATCTCCTG CCTTCCGTTT CCCAGATCTA GATGAAGTAT TCATCGACTG TTGTGAGACC AATTTCGGTC GAGTGCAAAT CTCCGATGAC GTGTTAACAT TAAACACAGG GTTTTTATGT GCAGGTGCCA GGGGTGTGAT CAGCAGTTTG TGGTCTGTAG ATGACTTAGG AACATGTTTA TTTTCGATTT TTTATCACCA ACTGCGCCAA GAAGGAAAAA ATCGTTCTCT GGCATTGCAA CTAGGACAAC GACAGTTACG AGAATTAACG GGCAAAGAAC TCAAGAAGAA ATATAAAAAG GAGTTAGAAA GTTCGTTAGA TGAAAAGTTG GAACCGGCAT ATAAGCAACT TCAGGAAATA GAACGCAGAC GTGATGGTTA TACCAAAAGT TCCGTGGAAT ATCAGGAGTT AGAGGAGGAG CGGGAAAAAC TTGTAGCTAT TTATGAGCGT ATTTTTTATA CTAAGAATAA GTACCTGAAA GCAGCTTGTA AAAAAGAACA TCCCTTTGAG CATCCGGCGT ATTGGAGTGC GTTTATTTGT GCAGGGTTGA GTTAG
|
Protein sequence | MDNKRIEAYV NLINQLLNSP SGEVEKILEA NQELVDEGLL EIMELYAQQL AENDDENAQN AANFLRHLRS QLVELLEISE YSPSTHYSSA EYFNFLEEVL EATAESKGDS KVVYPFLQQN LDKLDDNFAD ILRNWATAKF SEAEAGVAEY IAMCVGELSN LIQQFPLGSQ ANNMEISIAG YEVVLKVFTH KSHRENWATI QNNLGAAYRD RIRGDKAENI EAAIAAYQQA LLVRTQTDFP MDWAGTQNNL GIAYRNRIRG DKAENIEAAI AAFQQALLVY TQTDFPINWA MTQNNLGGAY FYRIRGDKAE NIEAAIAAYQ QALLVYTQTD FPMDWAMTQN NLGAAYRNRI RGDKAENIEA AIAACQQALL VYTQTDFPID WAMTQNNLGG AYFYRIRGDK AENIEAAIAA YQQALLVFTQ TDFPIDWAMT QNNLGAAYSD RIRGEKAENI EAAIAACQQA LLVYTQTDFP MDWAMTQNNL GAAYRDRIRG DKAENIEAAI AAFQQALLVY TQTDFPINWA MTQNNLGIAY SDRIRGDKAE NIEAAIAAYQ QALLVYTQTD FPMDWAMTQN NLGAAYRNRI RGDKAENIEA AIAACQQALL VRTQTDFPMD WANTQNNLGS AYGNRIKGDQ AENIEAAIAA FQQALLVRTQ TDFPMDWANT QNNLGIAYRD RIRGDKAENI EAAIAAYQQA LLVYTQTDFP INWARTQNNL GIAYSDRIRG DKAENIEAAI AAYQQALLVR TQTDFPMEWA MTQNNLGNAY SDRIRGDKAQ NIKAAIAAFQ QALLVYTQTD FPINWAAAQN NLGLAYSDRI RGDKAENIEA AIAANIEAAI AAYEQALQVR TQTDFPIDWA QTQNNLGIAY SDRIRGDKAE NIEAAIAAYQ QALLVRTQTD FPMEWAQTQN NLGLAYIYRI RGDKAENIEA AIAAYEQALL VRTLEADPIN HLQTTNNLGN LYFDNQNWQL AADNYKKAIT AVELSRSWSK DDDRRQEIIE ESIGVYRKIV QAYVNSDQIE KALEYVERSR SKRLVDIMAS NDRYSQGEMP GEVEERLKEH EAIQQQINQL WEQQQKGSQI ESKDLAVATR GRAATEARNK RIGELEAKKQ EVYKKIRSFD RVLAEGIQVA PLEFPKIQAL IQEPTTAILS FYTITDDTYL FVLRQDGVKV HTYEGLGLKE LQNWIWEKWF GSYLSSREEW QQQMPEFLQD VSKKLKLEEL CKSYLQDIEE LILIPHLSLH FLPWNAMPVA ESGEDKYLGD RFRIRTLASC QILDFCTERE EIQGEVKQGI VEDTHNDLPC SSYEAQYIAQ MYGVPEHQRL RGEAATIDSY KLLLSQVQRL LSTHHAQSRI DNCMESALVL ADGRLTLGQL LSPAFRFPDL DEVFIDCCET NFGRVQISDD VLTLNTGFLC AGARGVISSL WSVDDLGTCL FSIFYHQLRQ EGKNRSLALQ LGQRQLRELT GKELKKKYKK ELESSLDEKL EPAYKQLQEI ERRRDGYTKS SVEYQELEEE REKLVAIYER IFYTKNKYLK AACKKEHPFE HPAYWSAFIC AGLS
|
| |