Gene Cthe_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2646 
Symbol 
ID4808957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3127922 
End bp3129145 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content38% 
IMG OID640108059 
Producthypothetical protein 
Protein accessionYP_001039038 
Protein GI125975128 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATGA AAAAAGTTTT CTTTGTGACT TACGGCGGAG GCCATGTAAG AAGCGTAATT 
CCGGTTATTA AAGAATTAAA ATCAAGGGGC CATAAAGTCT CTGTTCTCGG ATTAACAAGC
AGCGTTAATG ATTTAAAAAA AGAAGAGATT GAATTTAAGG GCATCAGGGA TTATTTGAAT
TTGTTCAAAG ATGAAGAAGC ACAAAAGATT TTAAAATACG GAGATATGTT TATTGATGAA
CATTTTGATG CCGGTTCAGG CCTGGATAAA TTTGAAATCA AAGTGTATTT GGGAATGAAT
CTATGGGATT TGTCCCTTCA GCTTAAAAGT TTTGAAGAAG CATTAAAACT TTTCAGAGAG
CGCGGCAGAA GCTGTTTTTT CCCCATAAAT TTAATGGAAA GGATATTAAG CTTTGAAAAA
CCGGACGTAA TTGTGGTTAC CAGCGGGAAA AGAGCTGAAA AAGCTGCAGC CTTCAGCGCC
AATAAAATGG ATGTAAAAGT GGTACGTATA GTTGACCTTC TGGGAGAAAA TTTGAAAATT
CCATACAAAG CAACGGTTTG TGTGTTAAAC GATTATGCCA AAGCAAACAT ACTTTCCTGC
AATGAAAACC TGAATGAACG GGACGTAGTC GTCACAGGGC AGCCAAATAT TGAACCGACT
TACACCGAAA AGCATTTTGA GGATTTTATA AAGAGGTACA ATCTTGATAA ATTCGACAAG
GTTATTTCTT TTTTCTCCCA GCCCAATATA GCTTACAGAG AGGATATCCT GGTCGAATTT
ATTAAGCTTA TGCAAAAAAG ACCAAACTTC ATGGGTATAT GGAAAACCCA TCCCAACGAG
CAAATGGACC TATATACCGG GTATTTGAAT ACATTGCCGC AAAATTTATT GATTGTAAAA
GAAGAGGATA CCAATTTGAT TTTAAGTAAG TCCAATTTGG TAATTACTTT TTACTCTACA
GTCGGATTAC AGGCCATAGC CGCAGACAAA CCTCTGATAA CAGTCAATTT TTCAAAAAAT
GCACATCCGG TGGAATATGA CAAGCTGGGC TGCGCCCTTC CTGTCAAAAA TACCGAAGAA
TTTGAAAATG CCATAAATCT TTTGCTTGAA AGCAGCAATT CAGATGCCCG TAATTTACAT
GCCCGCCTCA GGGAGGCAAG GAAAAAACTC ATGCCCCCTG CCGGGGCGGC CCAAAATATA
GCCAATGTTA TCGAATACTC ATAA
 
Protein sequence
MKMKKVFFVT YGGGHVRSVI PVIKELKSRG HKVSVLGLTS SVNDLKKEEI EFKGIRDYLN 
LFKDEEAQKI LKYGDMFIDE HFDAGSGLDK FEIKVYLGMN LWDLSLQLKS FEEALKLFRE
RGRSCFFPIN LMERILSFEK PDVIVVTSGK RAEKAAAFSA NKMDVKVVRI VDLLGENLKI
PYKATVCVLN DYAKANILSC NENLNERDVV VTGQPNIEPT YTEKHFEDFI KRYNLDKFDK
VISFFSQPNI AYREDILVEF IKLMQKRPNF MGIWKTHPNE QMDLYTGYLN TLPQNLLIVK
EEDTNLILSK SNLVITFYST VGLQAIAADK PLITVNFSKN AHPVEYDKLG CALPVKNTEE
FENAINLLLE SSNSDARNLH ARLREARKKL MPPAGAAQNI ANVIEYS