Gene Cthe_1376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1376 
Symbol 
ID4809371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1679668 
End bp1680915 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content43% 
IMG OID640106800 
Producthomoserine dehydrogenase 
Protein accessionYP_001037801 
Protein GI125973891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000372588 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATA TAGCAGTGAT GGGATACGGA GTGGTCGGCT CCGGAGTTGT TGAAGTTATA 
AGAAAAAACA GTGTCAGCAT ATCAAAGAAA GCGGGTCAGG AGATTCGCGT AAAAAAAATA
CTGGATATCA GGGATTTTCC GGACAGTCCT GAACGGGATT TGTTTACAAA GAATCCTGAT
GAAATTTTTG ATGACCCGTC AATAGGAATA GTTGTTGAGA CCATAGGCGG AATAGGTGCT
GCGTACGAAT TTACAAAGAA GGCTTTGAGC AAAGGAAAGA ATGTTGTAAC CTCGAACAAG
GAGCTGGTTG CAACCCATGG ACCTGAACTT TTGAAGCTTG CAAAGGAAAA TGGAGTAAAC
TATCTGTTTG AAGCAAGTGT CGGCGGTGGA ATTCCCATTA TCAGGCCTTT GAACCGCTGC
CTTGCCGCAA ATGAAATACA CAGCATCATA GGAATACTCA ACGGAACTAC GAACTACATA
TTAACACAGA TGAAAAGGCA GGGAAAAGAT TTTGACGAGG CTTTGAAAGA GGCACAGCAG
AAAGGATATG CGGAAGCGGA TCCAACCGCG GACATAGAAG GGCATGATGC ATGCAGAAAA
ATTGCAATAC TCTCATCCAT TGCGTACAAT GAATTTGTTG ATTACAAAAA GATACATACG
GAAGGCATAA AAAAAATAAG CCTTGCGGAT ATGAAATATG CCGAAAGCAT GGATTCAACC
ATCAAGCTTG TCGCCATAAG CGAAAAAATC GGTGACGGTA TTATGGCAAG GGTTGCTCCT
GCGATAGTAA GCAGCAAAAG TCCGCTTTAC AGTGTTGAAG ATGTTTTTAA TGCTATTGTT
GTGAGAGGAG ATGCAATTGG AGAAGTGATG TTTTACGGCC CGGGAGCGGG CAAGCTCCCC
ACGGCAAGCG CCGTTGTGGC GGATGTAATT GAAATTGTGA AGCATTGGGG TACCTGCGGC
AGCTATAACT GGGTTGTAAA AGACGGCGGC AACGTCATTG ATTTGAAAGA AACCAGGACA
AGGTATTTTG TGAGACTGAA AGTGGAGAAT GAAGCTGAAG CTAAAAAAGC GGTGGAGAAT
GCTTTTGGAA ATGTTGAATG GGTAAAAGCG TATGATGCAA GTGTACAGGA TGAATTGGCA
TTTGTTACTT CCTGCGTTTT GGAGAAAGAC TATTGCAATT CTCTTCAACA ACTTAAAGGC
AGCAAAGCTG TAAAAGATGT GGTAAATGCC ATAAGAGTCC TGGACTAA
 
Protein sequence
MVNIAVMGYG VVGSGVVEVI RKNSVSISKK AGQEIRVKKI LDIRDFPDSP ERDLFTKNPD 
EIFDDPSIGI VVETIGGIGA AYEFTKKALS KGKNVVTSNK ELVATHGPEL LKLAKENGVN
YLFEASVGGG IPIIRPLNRC LAANEIHSII GILNGTTNYI LTQMKRQGKD FDEALKEAQQ
KGYAEADPTA DIEGHDACRK IAILSSIAYN EFVDYKKIHT EGIKKISLAD MKYAESMDST
IKLVAISEKI GDGIMARVAP AIVSSKSPLY SVEDVFNAIV VRGDAIGEVM FYGPGAGKLP
TASAVVADVI EIVKHWGTCG SYNWVVKDGG NVIDLKETRT RYFVRLKVEN EAEAKKAVEN
AFGNVEWVKA YDASVQDELA FVTSCVLEKD YCNSLQQLKG SKAVKDVVNA IRVLD