Gene Tter_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTter_2051 
Symbol 
ID8640080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobaculum terrenum ATCC BAA-798 
KingdomBacteria 
Replicon accessionNC_013526 
Strand
Start bp181473 
End bp183083 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content59% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003323777 
Protein GI269839085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCGA TTCCAGGTAG ACTCAAATCC TATCTGTTAC CGGTGCTCAT CCTCGCGCTG 
CTGCTGGGTG CGTGTGGAGG GCAAGCAGGA CAGACCCCGA CAGTTGGGCA GACTACGTCT
CCGTCCCCAT CACCATCGGT TGCTAGTCCG TCTCCTTCGG CCGCACCTTC GGCATCACCC
AGTGCATCTC CTACTCCAGC GCAGGCTGGG CAGTCTCCAC AGCCCACGCA GGGAGCAGCG
AGCGATCAGT GGTGGCGATC CGCCGCTGAG AAGGCAGCGT GTGTGGGGCA GACCATCAGG
GGAGTCACCG AGAGCACTCC GCCCTCCAAG TATGCCGCCG ATGTGTTGGC CAAGCAGTTC
GAGCAGGCTA CTGGCATCAA GGTCGAGCTG GAGACCACCT CTTGGGACCA GATGTACGAC
AAGGCCATCA AGGACATGGA GGCCAAGACG GGCATCTACG ACTTCGTCTA CACGGAGCAG
GATATAGTCT ACGGCTACAT GGCCCGCAAC TTCTTGGTCG ATCTCACCAA GATGATGGAG
GAGAACCCCG ACCTGAAGGC TCCTACCTTC GATCTGAACA AGTTCACCTC GTTCATCAAC
TACTTCAAGG ACCCCAAGAC CGGGCATCTG TATGGTGTGC CGATGGAGTC GTTCATCAAG
ACCTATGTCT ATAGGAAGGA CCTCTTTGAG GACCCAGAGA TAAGGGCTGC CTTCAAGAAG
CAGTACGGGC ACGACCTAGC CCCTGCCAAG AACTTCGAAG AATACAGGCA GATCGCGGAG
TTCTTCACCA AATGGGGTAA GGACCACAAC ATGCAGCTGT GGGGCACCAC GGTGCAGGCG
GCCTCCGGAC ATCCAGCGTC CTTCTACGAG CTAGTGGAGA CGATATTCCC CAGCTGGGGC
ATATACAACT GGGGTATCAA CACCAAGACC TACAAGGCCA CGGTAGAGCA CGGTGGCCAA
ATGAACAGCG CCAAGGCCAA GCAAGCCCTC AAGTTCTGGC TTGACATGCT CAAGTATGCT
CCGCCTGAGT CGACCAACAG CACCTGGGAT GAGGTGGCAG CTACTTTCGC TGCCGGCAGG
GCTGCTCAGG GCTGGATCTA CGGAGAGAAC GTGGCTTGGA TCGCCACAGA TCCTTCCAGA
TCGAAGGTAG TGGGCAAGGT TGGGGTGGCC CTGCCTCCGA CGGCTCCGGG CGTAATGCAG
GATGCCAAGT CCGGCAAGGG CTACATCGGC TACTACGACG GTGGTGCCTT CGCCATACCC
TACTCTTCCA AGAAGCAGAA GTGCGCGCTG CTGTGGTTGG AGTACATAGG TCAGCCATCG
GTGCAGCCCG AGTGGGCAGC CAAGACGGCC CGCATCACCC TCACCGAGAC CTTCGACGAC
CCATTGGTCA AGGAGGTCGA TCAGAAGACC GGCGGCTACT TCACCCTGAT GCGCAAGTAC
GGTGACCTGT TCGCTGGTGC TCCTCCGTTC CCGTTCCACG CACAGGTCAG GGAGGTCGTG
GCGCCCTTCA TTTATAAGGC GATATCTGGG CAGATGAGCC CTGATCAAGC CCTGGACGAG
GCCGCCAAGG CCGCGGAGGA AGAGATGCAG AGACTAGGCT ACGGGAAGTA G
 
Protein sequence
MNSIPGRLKS YLLPVLILAL LLGACGGQAG QTPTVGQTTS PSPSPSVASP SPSAAPSASP 
SASPTPAQAG QSPQPTQGAA SDQWWRSAAE KAACVGQTIR GVTESTPPSK YAADVLAKQF
EQATGIKVEL ETTSWDQMYD KAIKDMEAKT GIYDFVYTEQ DIVYGYMARN FLVDLTKMME
ENPDLKAPTF DLNKFTSFIN YFKDPKTGHL YGVPMESFIK TYVYRKDLFE DPEIRAAFKK
QYGHDLAPAK NFEEYRQIAE FFTKWGKDHN MQLWGTTVQA ASGHPASFYE LVETIFPSWG
IYNWGINTKT YKATVEHGGQ MNSAKAKQAL KFWLDMLKYA PPESTNSTWD EVAATFAAGR
AAQGWIYGEN VAWIATDPSR SKVVGKVGVA LPPTAPGVMQ DAKSGKGYIG YYDGGAFAIP
YSSKKQKCAL LWLEYIGQPS VQPEWAAKTA RITLTETFDD PLVKEVDQKT GGYFTLMRKY
GDLFAGAPPF PFHAQVREVV APFIYKAISG QMSPDQALDE AAKAAEEEMQ RLGYGK