Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0529 |
Symbol | |
ID | 4242377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 834224 |
End bp | 838024 |
Gene Length | 3801 bp |
Protein Length | 1266 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638105840 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_720454 |
Protein GI | 113474393 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.696941 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATA ATCAACAAAG TTATCATCGT TTAATTGCAC AGTTACGTCA CTGCGCGGGT GAGGCAGAAG TTAAGCAGGT TTTAAATAGC TACCTTGAAC TAAATACTTC GGAGTTATTA GAGGTAATGG CTCAGGAGGT TGAAGGTTTG AGAGATAATA TGGACTGGGA TGGGGCTAAT TCTCTGTTAA GTCTGGCCCT AGAGGTGGGA GATGTGTTGG GAAGTTCTGA GAGGGAGGTT TCTCTAAGGA AGGAAATTGG GGATTTTTTG AGCAAAATGG GGCGGCGTGA ATATGAAGTT AGTCAGTTTT CTTGGGCAGT GCAGTTTTAT GAACTGACTT TAAGGATATA TCGAGAGATT AAGGACCGTC CAGGGGAAGT TGATGCTATT AATGGTTTGG GTAATAGTTA TAGGTTTTTG AGAGAAAGGG AAAGGGCAAC TGCTCTTTTT CAAGAGTCTT TGGCTATTGC TAGGGAAATT GAATATGGCC AGGGGGAGGT TGATGGCCTC TATGGTTTAG GTGCCATGGA GCATTTTTTG GGAGAATATG AGTCGGCAAA AGCTTATGTT CAAGAGGCTT TAACTATTGC AGAGGCGATC GGGTATACAA AAGGGAAAGC TAATGCTTTT GATGGTTTGG GTCATGTTTG TGGAGGTTTG GGAGAATACG AAGAGGCGAT TAATTTCTAT CAACAGTCTT TACCTATCTT CCGAGAAATC AAAGATCGTA AAGAGGAAGC CAGTGTCCTG AATAATTTGG GTGCTGCTTA CTATTCTTTG GGGAAATACG AAACGGCGAT TAATTTCTAT CAACAGTCTT TACGTATCTG CCAAGAACTC AAATATCGTA AAGGGGAAGC TTATGTTCTG AATAATTTGG GTCTTGATTA CAATTCTTTG GGGCAATATG AAACGGCGAT TAATTTCTTT CAACAGTCTT TACCTATCTA CCGAGAAATC AAAGATCGTA AAGAGGAAGC CAATGTCCTG AAGAATTTGG ATGATGCTTA CTATTCTTTG GGGCAACACA AAAGGAGGAT TGATGTCCTT AAACAGTCTT TACCTATCAA GCAAGAACTC AAAGATCGTA AAGGGGAAGC CAGTGTCCTG AATAATTTGG GTGTTGCTTA CCATCATGTA GGGCAATACG AAAAGGCGAT TAATTTCTAT CAACAGTCTT TATCTATCAA TCAAGAACTC AAAGATCGTG AAGGGGAAGC CAATGTCCTG AATAATTTGG GTTATGCTTA CGATAATTGG GGGCAATACG AAACGGCGAT TAATTTCTAT CAACAGTCTT TACCTATCTC CCGGAAAATC AAAAATCGTG AATATGAAGT TTGTGTCCTG AATAATTTGG GTGTTGCTTA CTATAAGTTG GCGCAATACG AAAGGGCGAT TAATTCCTAT CAACGGTCTT TACCTATCTC CCGGGAAATC AAAAATCGTG AATATGAAGT TTGTGTCCTG AATAATTTAG GGGATGCCAA CTACTATCTA GGACAACACC AAGAAGCCAT CGACCTTTAC CACAAATCTC TCGCCATCTC CCAAAAATAC CAATACCGTT GGGCAGAAGC AGACTCCCGC AACGGTCTAG GCCGCATCTA CTACTCCCAG GAAAAATACC AACCTGCCCT AGAATTTTAT CAAGAATGCT TTGCCATCAA AGCAGAAATA GGAGACATTC CCGGAGAAGC CAAAGGCCGA ACCAACGTCG CCCTCGCCTA CAAATCCCTC CAAGAACCCC AACGAGCCAT CGAATTCCTC CAACAGTCCT TAGAAATTTT CCAACAAATC TCCGACCCAG AAGGACAAGC AGACTGCCAA AATCACTTAG GAGTAGCCTA CCAAGACCTA AAACAACACC AACAAGCCAG AAATCACTAC CAAGCATCCC TGGAAATAGC CACCCCCGAA GGAATGCCCA CAATTTGCCT CAATGCCGGC CAAAACTATG GCCACCTAGA ATTTCAAGAA AACAACTGGT CAGCAGCGAT CGCTCCCTAC CAAAAAGCCA TCGAAGCAGC AGAAATACTG CGAGTCCGTT CCCTCACCGA TAACCGCCGG CAAAAAGTAA TGATTGATGC CATAGAAATA TACGCCAACA CCATCCAATG CCACATCAAC CTCCAACAAT ACGACCTCGC CCTCGAATAC ACCGAACGTT ACCGCTGTAA ACAACTAGTA GACCTAATAG CCAGTAAAGA CCTCTACCAA GACGAGAAAA TGCCCGAAAA GGTGAGAAAC CTAATGCAAG AATTCGAAAA ACTCCAAGGG CAAATAGAAA ACATTCGCGA TCGCGACACC AGGGAAGACA GTAACAATAA CCGCAGTAGC ATCCTCCCCG CAACCAGATA TTGGCGTGCC CAAAACGAAA TTCGCAACAA AAGCATCATC GAAAAGGAAA TCGCAAAAAG AAAAATCTTG GAACAAATCA GAAAACAAGA CCCAGTCATC GCCCAAGGCA TCCAAGTAGC CCACCTAAAA TTTAGCCAAC TGCAACAACT CATCAACAAC GAACATAGTG CCATCCTCAG TTTTTACAGC ACCAAAACCG ACACCCACAT ATTCATCCTG CGGCAAAACA GCATCCAGCT CCACTCCTGT CCCCAGCAAG GCAGAGACAA CCTGCAACAA TGGCTCAAAG AAAACTGGTT CAACCTCTAC GTACCCGACA AAAACAAAAA CGACAACGAG AACAAAGAAA ACTACCAAAA ATGGCAGCAA CAAATGCCCG ACCTTCTCGC AGAACTAGCC AACCGCTTAA ACATCCAAAA ACTGATCCAG AATCACCTAG AAGATATCGA GGAACTGATC CTCATCCCCC ACCAGCAACT ACACCTAATC CCCCTAGGCG CCCTGCCCAT CAGTGATAGC GAATATCTCC ATGACAAATA CCTCATTCGC ACCTTAGCCA GTTGCCAGAT CCTGAGTTTT TGCCAAGACC GCGTTCAACT CCAAACAGCT CCCACCTACG GCATCGTGGA AAACACACAA CTCGACCTAC CCTTCAGCCA ATTAGAAGCC GAAACCGTCG CCCAACTATG TCAAGTCAGC CCCGACAAAC ATCTCCAAGG AGCAGAAGCC ACAGTCCCCA AATACAGAGA ACTCTTAAAA CAAATCAACC GACTCCTATC GAGTCATCAC GCCGTATCTC GAATAGACAA CAACCTAGAG TCAGCCCTCA TGCTAGCCGA AAACCAAAAA ATAACCTTAG GAGAACTCCT CACCCCCGCC TATCGTTTCC CGGAATTAGA AGAAGTATTC CTCTCGTGCT GCGAAACCAA CCTAGGCACC CCCCAACCCA CCGACGACAT GCTCACCCTG AATACAGGTT TCCTATCTGC AGGTGCCCGT GGAGTGATCA GCACCCTCTG GGAAGTAGAT GACTTAGCAT CCTGCATTTT TTCCATTATC TATCACCGAC TGCGGGCAGA AGGTATAGAC AGAGTCAGGG CAGTGCAAAA AACCCAACAA ACCATGGTCA AGATGACAGA AAAGCAACTC CAACAAGAAA TCAAAACATT CAAAAAGCAA CAGAAAAAAA GATACAAAAG GGAATTAGAA GCACTCACCG AGGAACAAAG GGACCTCGAA ACTCAAGAAC CTCAAAGCAA AGATTCTCCA GAATACCAAG CCTGGAAGGA AAAGCTAAAT AATGCCATCC AAAAATACGT TCGCAAAGAC CAAGAACAAC GGGAGTTTGA ACTTCGCTTA AAAGAAGCCC GCAAAAGAGA ATATCCATTT TCCCATCCAG TGTATTGGAG TAGTTTTATT TGTGCAGGTT TGAGAGATTA A
|
Protein sequence | MNNNQQSYHR LIAQLRHCAG EAEVKQVLNS YLELNTSELL EVMAQEVEGL RDNMDWDGAN SLLSLALEVG DVLGSSEREV SLRKEIGDFL SKMGRREYEV SQFSWAVQFY ELTLRIYREI KDRPGEVDAI NGLGNSYRFL RERERATALF QESLAIAREI EYGQGEVDGL YGLGAMEHFL GEYESAKAYV QEALTIAEAI GYTKGKANAF DGLGHVCGGL GEYEEAINFY QQSLPIFREI KDRKEEASVL NNLGAAYYSL GKYETAINFY QQSLRICQEL KYRKGEAYVL NNLGLDYNSL GQYETAINFF QQSLPIYREI KDRKEEANVL KNLDDAYYSL GQHKRRIDVL KQSLPIKQEL KDRKGEASVL NNLGVAYHHV GQYEKAINFY QQSLSINQEL KDREGEANVL NNLGYAYDNW GQYETAINFY QQSLPISRKI KNREYEVCVL NNLGVAYYKL AQYERAINSY QRSLPISREI KNREYEVCVL NNLGDANYYL GQHQEAIDLY HKSLAISQKY QYRWAEADSR NGLGRIYYSQ EKYQPALEFY QECFAIKAEI GDIPGEAKGR TNVALAYKSL QEPQRAIEFL QQSLEIFQQI SDPEGQADCQ NHLGVAYQDL KQHQQARNHY QASLEIATPE GMPTICLNAG QNYGHLEFQE NNWSAAIAPY QKAIEAAEIL RVRSLTDNRR QKVMIDAIEI YANTIQCHIN LQQYDLALEY TERYRCKQLV DLIASKDLYQ DEKMPEKVRN LMQEFEKLQG QIENIRDRDT REDSNNNRSS ILPATRYWRA QNEIRNKSII EKEIAKRKIL EQIRKQDPVI AQGIQVAHLK FSQLQQLINN EHSAILSFYS TKTDTHIFIL RQNSIQLHSC PQQGRDNLQQ WLKENWFNLY VPDKNKNDNE NKENYQKWQQ QMPDLLAELA NRLNIQKLIQ NHLEDIEELI LIPHQQLHLI PLGALPISDS EYLHDKYLIR TLASCQILSF CQDRVQLQTA PTYGIVENTQ LDLPFSQLEA ETVAQLCQVS PDKHLQGAEA TVPKYRELLK QINRLLSSHH AVSRIDNNLE SALMLAENQK ITLGELLTPA YRFPELEEVF LSCCETNLGT PQPTDDMLTL NTGFLSAGAR GVISTLWEVD DLASCIFSII YHRLRAEGID RVRAVQKTQQ TMVKMTEKQL QQEIKTFKKQ QKKRYKRELE ALTEEQRDLE TQEPQSKDSP EYQAWKEKLN NAIQKYVRKD QEQREFELRL KEARKREYPF SHPVYWSSFI CAGLRD
|
| |