Gene Cthe_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1570 
Symbol 
ID4810077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1898462 
End bp1899523 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content44% 
IMG OID640106988 
Productextracellular solute-binding protein 
Protein accessionYP_001037989 
Protein GI125974079 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGTT TTATAAAATT GATCAGTGCA GTCGCTCTGT TTTCCATTTT TATAGGCTTG 
TTTTCCGGAT GTGGCAGGAC GAATACGGGA GGTACCCAGG ACAACGGGAA AACGGGGACA
TCTTCAGCAG AATACAAATA CGGAAAAATT GATATTCCCG GTAAAGACGG TTCACTTTGC
GGTGCACCTA TCTATATTGC ATATGAAAAG GGCTTTTTCA AGCAAGAGGG CTTTGATGTC
AACCTTATAT CGGCAGATAC CGAAACCCGT AAAATCGGTT TGAACAACGG TACCATCCCG
ATTGTCAACG GGGACTTTCA GTTCTTTCCC TCTATTGAAA ACAATGTGAA GGTAAAGGTG
GTGGACGGTC TCCACTATGG ATGCATAAAA CTGATTGTTC CGAAGGATTC ACCGATTCAA
GGAGTTCAAG ACCTTAGGGG AAAAAAGATC AGCGTTGATG AAATAGGCGG CACTCCCCAT
CAGGTAGCAT CGGTGTGGCT GGAGAAAAAC GGAATTTCCG CAAAGCAGGA GGACGGAGAA
GTTACGTTCC TTCCCTTCTC CGACGGAAAT CTGGCAGTGG AAGCCCTGAG AAAAGGAGAG
GTTGATGTTG CGGCACTGTG GGATCCCTTC GGCTCTGTTC AGGAAAAAAC GGGAAATTAC
CGTGTAATTC TTGATATTTC CAAGGATGAA CCTTTTGCCG GAAAATACTG CTGTTTCCTT
TATGCTTCGG AAAAGCTGCT TGACGAAAAA CCAGAACAGG TTGCCGCATT GCTGCGTGCA
TATAGGGCAG CCCAAAACTG GATTTCGGAA AACCCGGAAG AAGCCGTCGA CATTATAATA
AATGGTAAAT ACGCGCAGAT TGAAGACAGA GAATTAGCCA TTAAGCTTAT CAAGAGCTAT
CAATACCCTT CTTATGCCGA ACGGGAAAAA AATAAAACAC AGGTTCGAGA CAATGTTTAC
TATTTTGCCG AACAATTGAA CCAAATTGGA TATTTAAAGA CGGATCCTGA TACTTTCACA
AAGAATGCTT ATGTCGAGGT CGACATCAAC CTGGGTTTAT AA
 
Protein sequence
MKGFIKLISA VALFSIFIGL FSGCGRTNTG GTQDNGKTGT SSAEYKYGKI DIPGKDGSLC 
GAPIYIAYEK GFFKQEGFDV NLISADTETR KIGLNNGTIP IVNGDFQFFP SIENNVKVKV
VDGLHYGCIK LIVPKDSPIQ GVQDLRGKKI SVDEIGGTPH QVASVWLEKN GISAKQEDGE
VTFLPFSDGN LAVEALRKGE VDVAALWDPF GSVQEKTGNY RVILDISKDE PFAGKYCCFL
YASEKLLDEK PEQVAALLRA YRAAQNWISE NPEEAVDIII NGKYAQIEDR ELAIKLIKSY
QYPSYAEREK NKTQVRDNVY YFAEQLNQIG YLKTDPDTFT KNAYVEVDIN LGL