Gene Tery_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0529 
Symbol 
ID4242377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp834224 
End bp838024 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content44% 
IMG OID638105840 
Producttetratricopeptide TPR_2 
Protein accessionYP_720454 
Protein GI113474393 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.696941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA ATCAACAAAG TTATCATCGT TTAATTGCAC AGTTACGTCA CTGCGCGGGT 
GAGGCAGAAG TTAAGCAGGT TTTAAATAGC TACCTTGAAC TAAATACTTC GGAGTTATTA
GAGGTAATGG CTCAGGAGGT TGAAGGTTTG AGAGATAATA TGGACTGGGA TGGGGCTAAT
TCTCTGTTAA GTCTGGCCCT AGAGGTGGGA GATGTGTTGG GAAGTTCTGA GAGGGAGGTT
TCTCTAAGGA AGGAAATTGG GGATTTTTTG AGCAAAATGG GGCGGCGTGA ATATGAAGTT
AGTCAGTTTT CTTGGGCAGT GCAGTTTTAT GAACTGACTT TAAGGATATA TCGAGAGATT
AAGGACCGTC CAGGGGAAGT TGATGCTATT AATGGTTTGG GTAATAGTTA TAGGTTTTTG
AGAGAAAGGG AAAGGGCAAC TGCTCTTTTT CAAGAGTCTT TGGCTATTGC TAGGGAAATT
GAATATGGCC AGGGGGAGGT TGATGGCCTC TATGGTTTAG GTGCCATGGA GCATTTTTTG
GGAGAATATG AGTCGGCAAA AGCTTATGTT CAAGAGGCTT TAACTATTGC AGAGGCGATC
GGGTATACAA AAGGGAAAGC TAATGCTTTT GATGGTTTGG GTCATGTTTG TGGAGGTTTG
GGAGAATACG AAGAGGCGAT TAATTTCTAT CAACAGTCTT TACCTATCTT CCGAGAAATC
AAAGATCGTA AAGAGGAAGC CAGTGTCCTG AATAATTTGG GTGCTGCTTA CTATTCTTTG
GGGAAATACG AAACGGCGAT TAATTTCTAT CAACAGTCTT TACGTATCTG CCAAGAACTC
AAATATCGTA AAGGGGAAGC TTATGTTCTG AATAATTTGG GTCTTGATTA CAATTCTTTG
GGGCAATATG AAACGGCGAT TAATTTCTTT CAACAGTCTT TACCTATCTA CCGAGAAATC
AAAGATCGTA AAGAGGAAGC CAATGTCCTG AAGAATTTGG ATGATGCTTA CTATTCTTTG
GGGCAACACA AAAGGAGGAT TGATGTCCTT AAACAGTCTT TACCTATCAA GCAAGAACTC
AAAGATCGTA AAGGGGAAGC CAGTGTCCTG AATAATTTGG GTGTTGCTTA CCATCATGTA
GGGCAATACG AAAAGGCGAT TAATTTCTAT CAACAGTCTT TATCTATCAA TCAAGAACTC
AAAGATCGTG AAGGGGAAGC CAATGTCCTG AATAATTTGG GTTATGCTTA CGATAATTGG
GGGCAATACG AAACGGCGAT TAATTTCTAT CAACAGTCTT TACCTATCTC CCGGAAAATC
AAAAATCGTG AATATGAAGT TTGTGTCCTG AATAATTTGG GTGTTGCTTA CTATAAGTTG
GCGCAATACG AAAGGGCGAT TAATTCCTAT CAACGGTCTT TACCTATCTC CCGGGAAATC
AAAAATCGTG AATATGAAGT TTGTGTCCTG AATAATTTAG GGGATGCCAA CTACTATCTA
GGACAACACC AAGAAGCCAT CGACCTTTAC CACAAATCTC TCGCCATCTC CCAAAAATAC
CAATACCGTT GGGCAGAAGC AGACTCCCGC AACGGTCTAG GCCGCATCTA CTACTCCCAG
GAAAAATACC AACCTGCCCT AGAATTTTAT CAAGAATGCT TTGCCATCAA AGCAGAAATA
GGAGACATTC CCGGAGAAGC CAAAGGCCGA ACCAACGTCG CCCTCGCCTA CAAATCCCTC
CAAGAACCCC AACGAGCCAT CGAATTCCTC CAACAGTCCT TAGAAATTTT CCAACAAATC
TCCGACCCAG AAGGACAAGC AGACTGCCAA AATCACTTAG GAGTAGCCTA CCAAGACCTA
AAACAACACC AACAAGCCAG AAATCACTAC CAAGCATCCC TGGAAATAGC CACCCCCGAA
GGAATGCCCA CAATTTGCCT CAATGCCGGC CAAAACTATG GCCACCTAGA ATTTCAAGAA
AACAACTGGT CAGCAGCGAT CGCTCCCTAC CAAAAAGCCA TCGAAGCAGC AGAAATACTG
CGAGTCCGTT CCCTCACCGA TAACCGCCGG CAAAAAGTAA TGATTGATGC CATAGAAATA
TACGCCAACA CCATCCAATG CCACATCAAC CTCCAACAAT ACGACCTCGC CCTCGAATAC
ACCGAACGTT ACCGCTGTAA ACAACTAGTA GACCTAATAG CCAGTAAAGA CCTCTACCAA
GACGAGAAAA TGCCCGAAAA GGTGAGAAAC CTAATGCAAG AATTCGAAAA ACTCCAAGGG
CAAATAGAAA ACATTCGCGA TCGCGACACC AGGGAAGACA GTAACAATAA CCGCAGTAGC
ATCCTCCCCG CAACCAGATA TTGGCGTGCC CAAAACGAAA TTCGCAACAA AAGCATCATC
GAAAAGGAAA TCGCAAAAAG AAAAATCTTG GAACAAATCA GAAAACAAGA CCCAGTCATC
GCCCAAGGCA TCCAAGTAGC CCACCTAAAA TTTAGCCAAC TGCAACAACT CATCAACAAC
GAACATAGTG CCATCCTCAG TTTTTACAGC ACCAAAACCG ACACCCACAT ATTCATCCTG
CGGCAAAACA GCATCCAGCT CCACTCCTGT CCCCAGCAAG GCAGAGACAA CCTGCAACAA
TGGCTCAAAG AAAACTGGTT CAACCTCTAC GTACCCGACA AAAACAAAAA CGACAACGAG
AACAAAGAAA ACTACCAAAA ATGGCAGCAA CAAATGCCCG ACCTTCTCGC AGAACTAGCC
AACCGCTTAA ACATCCAAAA ACTGATCCAG AATCACCTAG AAGATATCGA GGAACTGATC
CTCATCCCCC ACCAGCAACT ACACCTAATC CCCCTAGGCG CCCTGCCCAT CAGTGATAGC
GAATATCTCC ATGACAAATA CCTCATTCGC ACCTTAGCCA GTTGCCAGAT CCTGAGTTTT
TGCCAAGACC GCGTTCAACT CCAAACAGCT CCCACCTACG GCATCGTGGA AAACACACAA
CTCGACCTAC CCTTCAGCCA ATTAGAAGCC GAAACCGTCG CCCAACTATG TCAAGTCAGC
CCCGACAAAC ATCTCCAAGG AGCAGAAGCC ACAGTCCCCA AATACAGAGA ACTCTTAAAA
CAAATCAACC GACTCCTATC GAGTCATCAC GCCGTATCTC GAATAGACAA CAACCTAGAG
TCAGCCCTCA TGCTAGCCGA AAACCAAAAA ATAACCTTAG GAGAACTCCT CACCCCCGCC
TATCGTTTCC CGGAATTAGA AGAAGTATTC CTCTCGTGCT GCGAAACCAA CCTAGGCACC
CCCCAACCCA CCGACGACAT GCTCACCCTG AATACAGGTT TCCTATCTGC AGGTGCCCGT
GGAGTGATCA GCACCCTCTG GGAAGTAGAT GACTTAGCAT CCTGCATTTT TTCCATTATC
TATCACCGAC TGCGGGCAGA AGGTATAGAC AGAGTCAGGG CAGTGCAAAA AACCCAACAA
ACCATGGTCA AGATGACAGA AAAGCAACTC CAACAAGAAA TCAAAACATT CAAAAAGCAA
CAGAAAAAAA GATACAAAAG GGAATTAGAA GCACTCACCG AGGAACAAAG GGACCTCGAA
ACTCAAGAAC CTCAAAGCAA AGATTCTCCA GAATACCAAG CCTGGAAGGA AAAGCTAAAT
AATGCCATCC AAAAATACGT TCGCAAAGAC CAAGAACAAC GGGAGTTTGA ACTTCGCTTA
AAAGAAGCCC GCAAAAGAGA ATATCCATTT TCCCATCCAG TGTATTGGAG TAGTTTTATT
TGTGCAGGTT TGAGAGATTA A
 
Protein sequence
MNNNQQSYHR LIAQLRHCAG EAEVKQVLNS YLELNTSELL EVMAQEVEGL RDNMDWDGAN 
SLLSLALEVG DVLGSSEREV SLRKEIGDFL SKMGRREYEV SQFSWAVQFY ELTLRIYREI
KDRPGEVDAI NGLGNSYRFL RERERATALF QESLAIAREI EYGQGEVDGL YGLGAMEHFL
GEYESAKAYV QEALTIAEAI GYTKGKANAF DGLGHVCGGL GEYEEAINFY QQSLPIFREI
KDRKEEASVL NNLGAAYYSL GKYETAINFY QQSLRICQEL KYRKGEAYVL NNLGLDYNSL
GQYETAINFF QQSLPIYREI KDRKEEANVL KNLDDAYYSL GQHKRRIDVL KQSLPIKQEL
KDRKGEASVL NNLGVAYHHV GQYEKAINFY QQSLSINQEL KDREGEANVL NNLGYAYDNW
GQYETAINFY QQSLPISRKI KNREYEVCVL NNLGVAYYKL AQYERAINSY QRSLPISREI
KNREYEVCVL NNLGDANYYL GQHQEAIDLY HKSLAISQKY QYRWAEADSR NGLGRIYYSQ
EKYQPALEFY QECFAIKAEI GDIPGEAKGR TNVALAYKSL QEPQRAIEFL QQSLEIFQQI
SDPEGQADCQ NHLGVAYQDL KQHQQARNHY QASLEIATPE GMPTICLNAG QNYGHLEFQE
NNWSAAIAPY QKAIEAAEIL RVRSLTDNRR QKVMIDAIEI YANTIQCHIN LQQYDLALEY
TERYRCKQLV DLIASKDLYQ DEKMPEKVRN LMQEFEKLQG QIENIRDRDT REDSNNNRSS
ILPATRYWRA QNEIRNKSII EKEIAKRKIL EQIRKQDPVI AQGIQVAHLK FSQLQQLINN
EHSAILSFYS TKTDTHIFIL RQNSIQLHSC PQQGRDNLQQ WLKENWFNLY VPDKNKNDNE
NKENYQKWQQ QMPDLLAELA NRLNIQKLIQ NHLEDIEELI LIPHQQLHLI PLGALPISDS
EYLHDKYLIR TLASCQILSF CQDRVQLQTA PTYGIVENTQ LDLPFSQLEA ETVAQLCQVS
PDKHLQGAEA TVPKYRELLK QINRLLSSHH AVSRIDNNLE SALMLAENQK ITLGELLTPA
YRFPELEEVF LSCCETNLGT PQPTDDMLTL NTGFLSAGAR GVISTLWEVD DLASCIFSII
YHRLRAEGID RVRAVQKTQQ TMVKMTEKQL QQEIKTFKKQ QKKRYKRELE ALTEEQRDLE
TQEPQSKDSP EYQAWKEKLN NAIQKYVRKD QEQREFELRL KEARKREYPF SHPVYWSSFI
CAGLRD