Gene Cthe_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0374 
Symbol 
ID4808451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp470198 
End bp471532 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content43% 
IMG OID640105788 
Productglutamate dehydrogenase 
Protein accessionYP_001036805 
Protein GI125972895 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000129016 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTAT TGGCTGGTGT AATGGAACAA GTCATTAAAA GAAATCCAAA TGAACCTGAG 
TTTCATCAAG CGGTAAGAGA AGTGTTGGAG TCGCTGGAAA TAGTCGCCGA AAAAAATCCG
GAGTACTTAA AAGCAGGTAT ATTTGAAAGG ATTGTTGAAC CTGAAAGACA GATTATTTTC
AGAGTACCGT GGGTGGATGA CAATGGCAAG GTACAGGTAA ACAGAGGTTT TAGAGTTCAG
TTCAACAGTG CAATTGGTCC TTACAAGGGC GGAATAAGAT TCCATCCTTC GGTTAACTTG
GGAATTATCA AATTCCTTGG TTTCGAGCAG ATTTTCAAGA ATTCATTGAC CGGCCTTCCA
ATGGGGGGAG GAAAAGGCGG CAGCGACTTT GATCCGAAAG GAAAATCCGA CGGAGAAATC
ATGAGGTTCT GTCAGAGCTT TATGACCGAG CTTTACAGAC ATATCGGACC GGATACCGAC
GTTCCTGCGG GAGATATCGG TGTAGGTGCC CGTGAAATAG GTTATATGTT CGGCATGTAC
AGAAAAATAA GAAACGAGTT TACCGGAGTT CTGACAGGAA AAGGACTGAC ATGGGGCGGA
AGCCTTGTAA GAACTGAGGC TACAGGTTAT GGTCTCTGCT ACTTCATGGA AGAAGCAATG
AAGACAATAA AAGGTAAATC TTTTGAAGGT GCGACAGTTG TTATCTCAGG TTCGGGCAAT
GTGGCCATTT ATGCAACGGA AAAAGCTCAG CAGCTTGGTG CTAAAGTAGT TGCATTGAGC
GATTCAAACG GATATGTTTA TGATCCTGAC GGAATAAAAC TCGATACGGT TAAGCAAATA
AAAGAGGTAG AAAGAAAGAG AATCAGTGAA TATGTAAAAT ATCATCCTAA TGCAAAATAT
ACAGAAGGAT GTTCAGGAAT ATGGTCAGTC AAGTGTGATG TTGCGCTTCC GTGTGCAACT
CAGAACGAGC TTGACGGAAA CGCGGCAAAG ACTCTTGTTG AAAACGGATG TTATGCGGTA
GGAGAAGGTG CAAACATGCC GTGTACGCCT GAAGCTATTG ATATATTTAT GAAGAACGGC
GTTCTTTATG CTCCAGGAAA AGCTTCAAAT GCCGGCGGTG TTGCAACTTC CGGACTTGAA
ATGTGCCAGA ACAGCATGAG GTATTCCTGG TCTTTTGAAG AAGTTGACGC CAAGTTGAAG
GATATTATGG TTAACATATT CAGAAATGTA AGAGCGGTAG CAAAAGAATA CGGCCAGGAA
GACAATCTTG TTTTGGGTGC AAATATTGCA GGATTCCTGA AAGTTGCAAA TGCTATGATG
GCACAGGGAG TGTAA
 
Protein sequence
MKLLAGVMEQ VIKRNPNEPE FHQAVREVLE SLEIVAEKNP EYLKAGIFER IVEPERQIIF 
RVPWVDDNGK VQVNRGFRVQ FNSAIGPYKG GIRFHPSVNL GIIKFLGFEQ IFKNSLTGLP
MGGGKGGSDF DPKGKSDGEI MRFCQSFMTE LYRHIGPDTD VPAGDIGVGA REIGYMFGMY
RKIRNEFTGV LTGKGLTWGG SLVRTEATGY GLCYFMEEAM KTIKGKSFEG ATVVISGSGN
VAIYATEKAQ QLGAKVVALS DSNGYVYDPD GIKLDTVKQI KEVERKRISE YVKYHPNAKY
TEGCSGIWSV KCDVALPCAT QNELDGNAAK TLVENGCYAV GEGANMPCTP EAIDIFMKNG
VLYAPGKASN AGGVATSGLE MCQNSMRYSW SFEEVDAKLK DIMVNIFRNV RAVAKEYGQE
DNLVLGANIA GFLKVANAMM AQGV