Gene Cthe_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1084 
Symbol 
ID4811382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1290925 
End bp1291962 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content42% 
IMG OID640106506 
Productspore coat protein 
Protein accessionYP_001037509 
Protein GI125973599 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATGAAG TTGGCAAAAA CCCAAACATG GATTTGAGTA AGCTTGCAAG TTCTGTATTG 
GAAGAGTATG GAATCGAACC GGAAAACATT AGTGTAGTTC AAAGTGCAAA TATAAAAACC
GTATGGAGAA TAAAAACGAA GGACCGTGAA CTGTGTCTCA AAAGATTGAA ACATCCATTA
GACAAAGCTC TCTTTTCCGT AAACGCCCAG GATTTTATAT ACAATCATGG CGGAAATGTC
GCGGGAATAA TCCGGGATAA AGAAGGAAAT CTTATTCATT CTTTCAACGA CCAGCTGTTC
GTTGTATATG AATGGCTTTA CGGAAGGGAT TTGTCCTTTG TCAATGCTGA TGACTTAAAA
TCCGCCCTGC ACGGCCTTGC CAAATTTCAT ATTGCGTCAA AGGGTTATGT CGCCCCGGAA
GGTGCCAAAG TCTCTTCCAA GCTCGGCAGG TGGCCTGAAC AGTACAAATC CATGGCAGAC
AAACTTTCTT CCTGGAAAGA AGCATCCCTG GGAAAACCTG CTTCAGCTTC TGTCAATGCT
TATCTCAAAA ATGTTGACGA AATGCTTGAT ATCTGCCATC GGGCCATGGA GCTTTTAAAT
GCCTCAAAAT ATGCCGAGTT GGCAGGTGAA AATTCCAAAT CGGCTGTTTT ATGCCATCAG
GATTACGGCA AGGGAAATGC ACTTTTTACA GACAATGGTG TTTATGTCAT AGATCTTGAC
GGAGTAACCT GGGACCATCC TGGACGGGAT CTTCGAAAAA TAATCGGCAA GCTGTCGGAG
AACAGAGGAG CCTGGTCTTT GGATCAAATC GAAAAAATCC TTGACTGGTA CAGCGAAATA
AATCCTCTTT CCACCGCAGA CAGGGAACTT ATTTATATTG ACCTTATGTA CCCCCACTGG
TTTTTTGGCC TTGTTAAAAA CATTTTCAAG AACAATAAAA GCGAAAGTCC GTCAAAGATT
GAAAAAACAG CAAGGCTGGA AACTTCCAAA GTACCATTGC TTGCCGAAAA GCTTCGGGAT
ATAAAATCGC AGGGCTAA
 
Protein sequence
MNEVGKNPNM DLSKLASSVL EEYGIEPENI SVVQSANIKT VWRIKTKDRE LCLKRLKHPL 
DKALFSVNAQ DFIYNHGGNV AGIIRDKEGN LIHSFNDQLF VVYEWLYGRD LSFVNADDLK
SALHGLAKFH IASKGYVAPE GAKVSSKLGR WPEQYKSMAD KLSSWKEASL GKPASASVNA
YLKNVDEMLD ICHRAMELLN ASKYAELAGE NSKSAVLCHQ DYGKGNALFT DNGVYVIDLD
GVTWDHPGRD LRKIIGKLSE NRGAWSLDQI EKILDWYSEI NPLSTADREL IYIDLMYPHW
FFGLVKNIFK NNKSESPSKI EKTARLETSK VPLLAEKLRD IKSQG