Gene Cthe_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2010 
Symbol 
ID4810942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2388911 
End bp2390149 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content33% 
IMG OID640107422 
Producthypothetical protein 
Protein accessionYP_001038417 
Protein GI125974507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0742406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATT TATTATGTGA TGAAAGTTAT TTATTAGAAA CAATAGAATT TAACAAGGAT 
GAAATTTATG AAAAGAAAGA AAAGATTATT ATGTTGAAAA ATGATATGGA AAAAGGTATA
CAAAGATATC CTAGAGACAA TCAAAGCATA ATCTATGCTA CGTATAGAGG AATGTTTATG
TCTAATTTAG ATATACTTAT GGCTGAATAT TCTTTAGGAA ACCATCCAGA TACAATGTTG
GAGGATTATT TAGATGGTAT AATATATTTA GAAAATATTG GTAATGAAAG AGCGGGGTAT
ATTAGCCTTT TGTGGATGTT ATCGTTAGGT ATACTTTTAG AAGTAGATAA TGAAAATTTA
AAAAGGCTTG CTTGTGTGAT AGAGAAGCAA AAAATAGAAG ATGCACTGAT AGATTTTCTT
TTAAAAGCTT GTGATATAGG ATGGTATCAT AATACAAGTG AATATGAAAG AAAAAATCCA
TATGCAAAGA CGGCTGAAAT TATACAAATA GCATTACATG ATAAAGACAG AGAAAAAGCT
TCGAAAAGGC TACAACAATA TGTAGAGAAA GAATGGATTA AGGGACATAA TGATCTGGAC
TTCAAAAATG CGCATAAAGA ACCCGGCTAC GTTGGCTTGT GGAGTTTTGA GGCTGCAGCA
TTGGCAAAGA TACTGGGATT GGACGACAGC GCACTGAAAG ATAACAACCA TTACCCTTAT
GATTTGGCAC ATTATAAAAA TGGAATGAGT TTTGATTTAA GCTGGTATGG TGTGCCAGTT
GAAGAGGAAG CCAAGGAAGA AGAGTCAATA GTGTATGGAA TACCGAACAA ACCTGAGTTG
GAGCAAATAA TACCTGCAAA ATTCCACAGT TTTGTGAATG AAGTGATAGG AGACTACAAT
ACATTGACTG ATGAAGAGTT TTGGAAGAAG TATAATTTGA GAGAAATCTG GTTTGATGTT
AAGGAGTACA AAGAAGATAA TAAAGCCAAA AATATGTTGG GAACGATTAT AGTGTTTTTG
CTTGTAGAGA AGGAGTATAT TTTGCAGTTG GATTATAAGG AAGATTTGGT AGATTACATA
GAAGATATAG ATAATTATTG GGGCAAAGAG GAAGTAAAGT TGATAAGCTT TGAAGTGGAC
AATGACCAGC AGTATTATGC ATACGTACCG AAAACCGCAG CAATAGATTC ATTGTACGAG
GTAAAATTGA CAGAAGTGGA GAAGATAGAG GAAGTTTAG
 
Protein sequence
MRDLLCDESY LLETIEFNKD EIYEKKEKII MLKNDMEKGI QRYPRDNQSI IYATYRGMFM 
SNLDILMAEY SLGNHPDTML EDYLDGIIYL ENIGNERAGY ISLLWMLSLG ILLEVDNENL
KRLACVIEKQ KIEDALIDFL LKACDIGWYH NTSEYERKNP YAKTAEIIQI ALHDKDREKA
SKRLQQYVEK EWIKGHNDLD FKNAHKEPGY VGLWSFEAAA LAKILGLDDS ALKDNNHYPY
DLAHYKNGMS FDLSWYGVPV EEEAKEEESI VYGIPNKPEL EQIIPAKFHS FVNEVIGDYN
TLTDEEFWKK YNLREIWFDV KEYKEDNKAK NMLGTIIVFL LVEKEYILQL DYKEDLVDYI
EDIDNYWGKE EVKLISFEVD NDQQYYAYVP KTAAIDSLYE VKLTEVEKIE EV