Gene Cthe_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1796 
Symbol 
ID4810041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2120816 
End bp2121913 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content41% 
IMG OID640107210 
Productprephenate dehydrogenase 
Protein accessionYP_001038210 
Protein GI125974300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGTTG AAAAGATATC CATTATCGGG CTTGGACTTA TCGGCGGGTC GCTGGCGAAA 
GCTTTGAAAG AAAAGCTTGG CATTGAGTCA ATAACCGCCG TTGACATCAA TGAGAAAAGT
CTGAGCCAGG CTCTTAAAGA GGGTTTTATA AAAGAAGGTT TTACCGAACT TAACGAATCG
GTATATAATT CTGACATTAT TTTCATATGT ACACCGGTTA AGGATGCTGT TGAGTATATA
ACCCGACTGC ACGGCAAAGT GAAAGCCGGA TGCATCCTGA CGGACACAGC AAGTACAAAG
GGGGAAATTA TAGATTATGT AAATTCATTG GATAATCCCC CCTGCTTCAT AGGGGGGCAT
CCAATGGCCG GTACTGAGAA GGCAGGTTTT TCATCAAGTT TTTCACATTT GTTTGAAAAT
GCGTACTATA TAATGTCGCC TTCAAAAAAT TGCCCCGAAG AATCCCTTGA GTACTTGGCA
GAAATAATCA GAGGAATCGG CGCAATACCG ATAAAGCTTG ACTCCAAAGA ACACGATATT
ATCACCGCAA CCATAAGCCA TGTACCGCAT GTAATTGCTT CCGCCCTGGT AAACCTTGTG
AAATTCTCTG ATTCCCCCGA CGGCAAAATG CAAACTTTGG CAGCAGGAGG ATTTAAGGAT
ATAACAAGAA TTGCATCATC AAACCCTAAG ATGTGGGAAA ATATTATTCT CAGCAACAAG
GAAATAGTTA AATCGACTTT GAATAAATTT ACCGAGACAA TAAACACTTT TATTGAATAT
ATTGATAACG AAAATTCCAA CGGCATATAC AATTTTTTCG ATTCTGCAAA AAAGTTTCGT
GATTCCATTC CAAACAACAG GAAAGGACTC ATTGAACCGC AGAACGAGCT TATTGTAGAT
GTTGTTGACA AGCCCGGCAT CATCGGTGAA ATAGCAACCA TTCTCGGAAA CAACGGTATT
AATATAAAAA ACATTAATGT TTCCAACAGC CGGGAGTTTG AGCAGGGGTG TCTCAGAATC
ACACTGCCCG ATTCAGGCAG TGTGGCCGAG GCTTATGAAC TGCTCGCAAA AAAGGGTTAT
AAAGTGTTTA AAATTTGA
 
Protein sequence
MQVEKISIIG LGLIGGSLAK ALKEKLGIES ITAVDINEKS LSQALKEGFI KEGFTELNES 
VYNSDIIFIC TPVKDAVEYI TRLHGKVKAG CILTDTASTK GEIIDYVNSL DNPPCFIGGH
PMAGTEKAGF SSSFSHLFEN AYYIMSPSKN CPEESLEYLA EIIRGIGAIP IKLDSKEHDI
ITATISHVPH VIASALVNLV KFSDSPDGKM QTLAAGGFKD ITRIASSNPK MWENIILSNK
EIVKSTLNKF TETINTFIEY IDNENSNGIY NFFDSAKKFR DSIPNNRKGL IEPQNELIVD
VVDKPGIIGE IATILGNNGI NIKNINVSNS REFEQGCLRI TLPDSGSVAE AYELLAKKGY
KVFKI