Gene Cthe_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1083 
Symbol 
ID4811381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1289870 
End bp1290919 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content38% 
IMG OID640106505 
Productputative homoserine kinase type II (protein kinase fold)-like protein 
Protein accessionYP_001037508 
Protein GI125973598 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATAA ACCATGAACC CCTTTTTGAT GTGCTTTCAC AATATGATAT AAAGGTCGTC 
TCGATAAGAA ATGAAAGCTA CAAGGATAAA AAAGGTGTTT GGTGGATACA AACCCCTGAT
GAATACAAAA TTCTAAAAAA GATATCAAAT TCGGAAGACA CTTTTAAATA TATATTGAGT
GCTGCGGAGC ACCTAAGAAA AAACGGAGTA AATATTCCTG CTGTATACAA AACAAAGGAC
GGAAAAGACT ATGTGAATAT TAACGGAACC TGCTACGTTT TATATGAGGC GGTTGAAGGC
AAAAATCCTT CATATAATTC ACCTGAAGAC TTCAGGGCGA TTGTCAGAGA ACTTGCCGGA
TTTCATGCCG CATCAGTGGG ATTTTCGCCT CCGGACAACA CAAAACCAAA AATTCATCTG
GGTAAATGGG TTGAACAATA CACAGAACAA GTGGAAGACA TGAACAGGTT CTATCAAACC
GAACTTGAGA AAAGCGAAAA CGACAGAATA GGAAAAGTAA TTATCGAAGA GTTTCCCGCC
TTTTATGAAA GGGCAAAACA AGCGATTGAA GGATTGAAGG GAAAAGAATA CCAAGACTGG
GTTGAAAAAG TCAAAAGCCG GGGCGGGCTT TGCCATCAGG ATTTTGCAGC TGGAAATCTT
TTAAAAAATC CTTCGGGAAA AATTTTTGTT CTCGACACGG ATTCAATTAC CATAGACATT
CCGGCACGGG ATATAAGAAA GCTCCTTAAC AAAATCATGA AGAAAAACGG AAAATGGGAT
TTGGAAATTC TTCGCAAGTT TATACGAATT TATCAATCAG AAAATCCATT GAGTTTTTCC
GAATGGACGG TTGTAAAGTT CGACCTCATG TTCCCTCATC TGTTCCTGGG AGCTATGAAT
AAATTTTATT ATAAAAGAGA CAAAGAATGG AGTTTTGAAA AGTATCTGAA AAGAATAAAT
GAAATGACCG CTTTGGAAAA GACCATTACA CCTGTTTTGG AAAACTTCGA CTCCATTGTT
TATGAAGAGA TTAATCAAAG GAAGGACTGA
 
Protein sequence
MPINHEPLFD VLSQYDIKVV SIRNESYKDK KGVWWIQTPD EYKILKKISN SEDTFKYILS 
AAEHLRKNGV NIPAVYKTKD GKDYVNINGT CYVLYEAVEG KNPSYNSPED FRAIVRELAG
FHAASVGFSP PDNTKPKIHL GKWVEQYTEQ VEDMNRFYQT ELEKSENDRI GKVIIEEFPA
FYERAKQAIE GLKGKEYQDW VEKVKSRGGL CHQDFAAGNL LKNPSGKIFV LDTDSITIDI
PARDIRKLLN KIMKKNGKWD LEILRKFIRI YQSENPLSFS EWTVVKFDLM FPHLFLGAMN
KFYYKRDKEW SFEKYLKRIN EMTALEKTIT PVLENFDSIV YEEINQRKD