Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_2051 |
Symbol | |
ID | 8640080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013526 |
Strand | + |
Start bp | 181473 |
End bp | 183083 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003323777 |
Protein GI | 269839085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCGA TTCCAGGTAG ACTCAAATCC TATCTGTTAC CGGTGCTCAT CCTCGCGCTG CTGCTGGGTG CGTGTGGAGG GCAAGCAGGA CAGACCCCGA CAGTTGGGCA GACTACGTCT CCGTCCCCAT CACCATCGGT TGCTAGTCCG TCTCCTTCGG CCGCACCTTC GGCATCACCC AGTGCATCTC CTACTCCAGC GCAGGCTGGG CAGTCTCCAC AGCCCACGCA GGGAGCAGCG AGCGATCAGT GGTGGCGATC CGCCGCTGAG AAGGCAGCGT GTGTGGGGCA GACCATCAGG GGAGTCACCG AGAGCACTCC GCCCTCCAAG TATGCCGCCG ATGTGTTGGC CAAGCAGTTC GAGCAGGCTA CTGGCATCAA GGTCGAGCTG GAGACCACCT CTTGGGACCA GATGTACGAC AAGGCCATCA AGGACATGGA GGCCAAGACG GGCATCTACG ACTTCGTCTA CACGGAGCAG GATATAGTCT ACGGCTACAT GGCCCGCAAC TTCTTGGTCG ATCTCACCAA GATGATGGAG GAGAACCCCG ACCTGAAGGC TCCTACCTTC GATCTGAACA AGTTCACCTC GTTCATCAAC TACTTCAAGG ACCCCAAGAC CGGGCATCTG TATGGTGTGC CGATGGAGTC GTTCATCAAG ACCTATGTCT ATAGGAAGGA CCTCTTTGAG GACCCAGAGA TAAGGGCTGC CTTCAAGAAG CAGTACGGGC ACGACCTAGC CCCTGCCAAG AACTTCGAAG AATACAGGCA GATCGCGGAG TTCTTCACCA AATGGGGTAA GGACCACAAC ATGCAGCTGT GGGGCACCAC GGTGCAGGCG GCCTCCGGAC ATCCAGCGTC CTTCTACGAG CTAGTGGAGA CGATATTCCC CAGCTGGGGC ATATACAACT GGGGTATCAA CACCAAGACC TACAAGGCCA CGGTAGAGCA CGGTGGCCAA ATGAACAGCG CCAAGGCCAA GCAAGCCCTC AAGTTCTGGC TTGACATGCT CAAGTATGCT CCGCCTGAGT CGACCAACAG CACCTGGGAT GAGGTGGCAG CTACTTTCGC TGCCGGCAGG GCTGCTCAGG GCTGGATCTA CGGAGAGAAC GTGGCTTGGA TCGCCACAGA TCCTTCCAGA TCGAAGGTAG TGGGCAAGGT TGGGGTGGCC CTGCCTCCGA CGGCTCCGGG CGTAATGCAG GATGCCAAGT CCGGCAAGGG CTACATCGGC TACTACGACG GTGGTGCCTT CGCCATACCC TACTCTTCCA AGAAGCAGAA GTGCGCGCTG CTGTGGTTGG AGTACATAGG TCAGCCATCG GTGCAGCCCG AGTGGGCAGC CAAGACGGCC CGCATCACCC TCACCGAGAC CTTCGACGAC CCATTGGTCA AGGAGGTCGA TCAGAAGACC GGCGGCTACT TCACCCTGAT GCGCAAGTAC GGTGACCTGT TCGCTGGTGC TCCTCCGTTC CCGTTCCACG CACAGGTCAG GGAGGTCGTG GCGCCCTTCA TTTATAAGGC GATATCTGGG CAGATGAGCC CTGATCAAGC CCTGGACGAG GCCGCCAAGG CCGCGGAGGA AGAGATGCAG AGACTAGGCT ACGGGAAGTA G
|
Protein sequence | MNSIPGRLKS YLLPVLILAL LLGACGGQAG QTPTVGQTTS PSPSPSVASP SPSAAPSASP SASPTPAQAG QSPQPTQGAA SDQWWRSAAE KAACVGQTIR GVTESTPPSK YAADVLAKQF EQATGIKVEL ETTSWDQMYD KAIKDMEAKT GIYDFVYTEQ DIVYGYMARN FLVDLTKMME ENPDLKAPTF DLNKFTSFIN YFKDPKTGHL YGVPMESFIK TYVYRKDLFE DPEIRAAFKK QYGHDLAPAK NFEEYRQIAE FFTKWGKDHN MQLWGTTVQA ASGHPASFYE LVETIFPSWG IYNWGINTKT YKATVEHGGQ MNSAKAKQAL KFWLDMLKYA PPESTNSTWD EVAATFAAGR AAQGWIYGEN VAWIATDPSR SKVVGKVGVA LPPTAPGVMQ DAKSGKGYIG YYDGGAFAIP YSSKKQKCAL LWLEYIGQPS VQPEWAAKTA RITLTETFDD PLVKEVDQKT GGYFTLMRKY GDLFAGAPPF PFHAQVREVV APFIYKAISG QMSPDQALDE AAKAAEEEMQ RLGYGK
|
| |