Gene Cthe_1748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1748 
Symbol 
ID4810178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2067664 
End bp2069217 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content33% 
IMG OID640107161 
Producthypothetical protein 
Protein accessionYP_001038162 
Protein GI125974252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000315273 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTAAATT ACTGGTGGGT AACAAGACCA AAGCGAAAAT TAAATTCTGT TCCTGAAGTA 
TTATCAGCAT TTGCTGAATT ATCATTAGAT CAAGAATGGC AGGGACAGAG AGAGTCTCAT
TTATCTTTTG AAGATGCCTT AGAGCAGGCA GGTTTAAAGC GTAAAGGGGA ACGCAGAGAT
CAGACAGGCG GAGGAGCACG AACATATAAG GCATGGCTTA CAAGCTTAGG GTTGATATTC
ACACAGGAAT CAACAGGAAA GATAAAGTTA ACATTAGCAG GTGAAGCCAT AATGGCAGGT
GACTCTCCTG TTGAAGTTTT GAAAAACCAA ATTTTAAAAT ATCAGTTTCC ATCTTCTTTT
TCATTGAGCA GAGGAGTTCA AGTCGCCCCA AGATTTAAAA TCAGACCATT TAGATTTTTA
TTGAGACTAT TAAATGATCC AGAGATAGAA TATTTGACAG AAGAGGAAAT TGCAAAAATT
ATAGTTACGA AGGCAGAAAA TGAAACAGAT AAATGTTATA GATATATTGT AGGTAAAATT
TTAGAATTCA GACAAAGCGG CGATATGATT CATGAAGAAG ATTTTTTTGA TAAATATAAA
TCTTCAAAAG GTGATATCAA TCTTGAACAT CCATATAGGC ATTTAATGGA TTTAGCAAAT
ACTATTGTAA ACTGGTTAGA ATATACACAG CTTGTAAAAA GAGATAATGG TGAAGTACGT
ATTCTTGAAG ATAAACGATT AGAAGTTCAG CAGATTTTAT CTGTTTCACC GCCTTTTATT
GATCGACCTG AACAACATGA ATATTTTCAA AGAAAATACG GCCTTGACCC TAAGCACAAG
AAAGATACTA GAAATCTTAC AGAAACTAAG ACTATTACAG CTAAAATTAT TGCTGAACAG
AAGATTAAAA AAGCATATAT TGTTGAATCT TTAAAACAGC CGATAACCAA AATAACGACA
GATCTTATAG ATAAAATTTC TGAGCAGACA GGTTTTGAGG ACAAGCTGGT AGAAGAAACC
CTTTTGAAAC TGTATCCAAG AGGGTCTGTT GGTGCATTTA TGACAGAGTA CTTTGAAATG
GCTTTTAAAG GGAGAGATGA GGCTTCAGAC TTTGAAAAAG CAACTGTGCA ATTATTTCAA
AATGTTTTTG GTTTTGAAGC AAAACATGTA GGACCTATAG GCCTTACGCC TGATGTTTTA
ATTTTATCTG ATAAAGATGG ATATCAGGCT ATTATAGATA ATAAGGCATA CAGTAAATAT
ACAATTAGCA ATGACCATCA TAATAGAATG GTTCACAATT ATATAGGAAA TTTAAATCGT
TATAGTAATT CTAGTGATCC GCTTGCCTTT TTTTCATATA TTGCAGGTGG CTTTGGAAAG
AACATTAATT CTCAAATTAT AGATATTGTT AATGCTACTG GTGTTTCTGG TTCAGCAATG
AGTGTATCTA ATATGATTAA ACTTGTTGAA TCATACGAGT CCAAGCATTA TACACATAAA
AACATTAGAG ATATATTTTC TGTTAATAGG CAGATATTGT TATCTGATTT ATAA
 
Protein sequence
MLNYWWVTRP KRKLNSVPEV LSAFAELSLD QEWQGQRESH LSFEDALEQA GLKRKGERRD 
QTGGGARTYK AWLTSLGLIF TQESTGKIKL TLAGEAIMAG DSPVEVLKNQ ILKYQFPSSF
SLSRGVQVAP RFKIRPFRFL LRLLNDPEIE YLTEEEIAKI IVTKAENETD KCYRYIVGKI
LEFRQSGDMI HEEDFFDKYK SSKGDINLEH PYRHLMDLAN TIVNWLEYTQ LVKRDNGEVR
ILEDKRLEVQ QILSVSPPFI DRPEQHEYFQ RKYGLDPKHK KDTRNLTETK TITAKIIAEQ
KIKKAYIVES LKQPITKITT DLIDKISEQT GFEDKLVEET LLKLYPRGSV GAFMTEYFEM
AFKGRDEASD FEKATVQLFQ NVFGFEAKHV GPIGLTPDVL ILSDKDGYQA IIDNKAYSKY
TISNDHHNRM VHNYIGNLNR YSNSSDPLAF FSYIAGGFGK NINSQIIDIV NATGVSGSAM
SVSNMIKLVE SYESKHYTHK NIRDIFSVNR QILLSDL