Gene Cthe_2882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2882 
Symbol 
ID4809089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3408444 
End bp3409751 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content47% 
IMG OID640108301 
Producthistidinol dehydrogenase 
Protein accessionYP_001039273 
Protein GI125975363 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA TTATTGATTT AAGATGCGGC AAAGACAGCG ACATATTTGA AAACCTTGCA 
TCAAGAAGTC AGCTGGAGTA TAGGGATGTT TTGGACCGGG TGGAGGAGAT AGTGGCAAAT
GTTCGGCGAA ACGGAGACAA AGCCGTGCTG GAATATACAG CCATGTTTGA CAAGGTTCAG
CTCACTTTAG GAAAATTAAG GGTTACGGAG AAAGAAATAA AAGAGGCGTA CACAAAGGTT
GACCCCAAAC TTGTTGAAGT GATAAAAAGG TCAAGGGACA ATATTTGGAA TTTTCATGAA
AAGCAGAAGG AGAAATCCTG GTTTTCCACC GAAAAGGAAG GGGTAATTGT CGGACAGCTT
TACAGACCTT TGGAGGTTGT CGGCGTGTAT GTTCCCGGCG GGACAGCGGC CTACCCTTCA
TCGGTTCTCA TGTGCGCAGT GCCTGCAAAG GTGGCCGGTG TAAGCAAAAT AGTAATGACC
ACGCCTCCCG GAAAGGATGA AAAGATAAAT CCTGCAATAC TGGTGGCAGC CAATGAAGCC
GGGGTTGATG AAATATACAA AGTGGGTGGG GCGCAGGCCG TAGCCGCCCA GGCCTTTGGA
ACGGAGACAA TCCCGAAAGT GGACAAAATT GTGGGGCCGG GGAACATATA TGTGGCAATG
GCAAAGAGGA CGGTATACGG CTATTGCGAT ATTGACATGA TAGCCGGACC CAGCGAGATA
ATGGTGGTTG CCGATGAGAC CGCAAATCCT GTGTTTGTGG CGGCGGATCT TTTATCCCAG
GCAGAGCATG ATATACTTGC TTCATCAATT TTGGTTACAA CTTCTGAGGA TATTGCCAAA
GAGGTTCAAA GGGAGCTTGA GGCTCAGCTC GCGGTTTTGG AAAGAAAAGA AATAGCCGGA
AAATCGATAG CTGACTATGG AGCGATAATT ATTGTGGAAA GCCTCAAGGA TGCCGCGACG
GTGGTTAACA GAATTGCGCC GGAGCATCTG GAACTTTGCG TAAAAGATCC CTTTGCCGCA
CTGGGGGATA TAAAGAATGC GGGTGCGATA TTCCTTGGCA ACTATTCCAC AGAGCCTTTG
GGAGACTATT TTGCAGGACC CAACCATGTG CTCCCCACAA GCGGTACGGC AAGATTCTTC
TCACCTTTAA ATCTTTCGGA TTTTATGAAG AAAAGCAGCA TTATTTCATA TACAAGAGAT
GCCCTTCAAA AGGTTAAAGA CGATGTCATA CTCTTTGCAG AGTCCGAAGG ATTGGGAGCC
CATGCAAATG CCATTAGAGT GAGGTTTCAG GACGGACAGG ACAAATAA
 
Protein sequence
MIKIIDLRCG KDSDIFENLA SRSQLEYRDV LDRVEEIVAN VRRNGDKAVL EYTAMFDKVQ 
LTLGKLRVTE KEIKEAYTKV DPKLVEVIKR SRDNIWNFHE KQKEKSWFST EKEGVIVGQL
YRPLEVVGVY VPGGTAAYPS SVLMCAVPAK VAGVSKIVMT TPPGKDEKIN PAILVAANEA
GVDEIYKVGG AQAVAAQAFG TETIPKVDKI VGPGNIYVAM AKRTVYGYCD IDMIAGPSEI
MVVADETANP VFVAADLLSQ AEHDILASSI LVTTSEDIAK EVQRELEAQL AVLERKEIAG
KSIADYGAII IVESLKDAAT VVNRIAPEHL ELCVKDPFAA LGDIKNAGAI FLGNYSTEPL
GDYFAGPNHV LPTSGTARFF SPLNLSDFMK KSSIISYTRD ALQKVKDDVI LFAESEGLGA
HANAIRVRFQ DGQDK