Gene Cthe_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1028 
Symbol 
ID4811322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1229687 
End bp1230886 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content41% 
IMG OID640106446 
Productacetate kinase 
Protein accessionYP_001037453 
Protein GI125973543 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000413101 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TGGTTATTAA TACCGGAAGC TCATCACTAA AGTATCAGCT GATTGACATG 
ACAAACGAGT CTGTGCTTGC AAAAGGTGTG TGTGACAGAA TTGGTCTTGA ACATTCCTTT
TTAAAGCATA CAAAGACCGG AGGGGAAACC GTAGTTATAG AAAAAGACCT GTACAATCAC
AAGCTTGCCA TACAGGAGGT AATTTCGGCT CTTACGGATG AAAAAATCGG AGTCATAAAA
AGCATGTCGG AAATTTCTGC CGTCGGTCAT CGTATTGTTC ACGGCGGAGA GAAGTTTAAG
GAATCTGCCA TAATTGATGA AGATGTAATG AAAGCAATCA GGGATTGTGT TGAACTGGCT
CCGCTCCACA ATCCGTCAAA TATAATCGGA ATTGAAGCCT GTAAACAGAT ACTGCCCGAT
GTGCCGATGG TTGCTGTGTT TGACACAGCT TTTCATCAGA CAATGCCAAG GCATGCATAT
ATTTATGCCC TCCCTTATGA GATATATGAG AAGTATAAAT TGAGAAAATA CGGATTCCAC
GGAACTTCCC ACAAATATGT GGCCCACAGG GCGGCTCAGA TGCTGGGCAA ACCTATTGAG
AGCCTGAAGC TGATAACCTG CCATCTTGGA AACGGAGCAA GTATTTGTGC GGTAAAAGGC
GGAAAATCCG TTGACACCTC AATGGGATTT ACTCCTCTGC AGGGGTTGTG CATGGGTACC
AGAAGCGGCA ATGTTGACCC TGCGGTTATA ACTTATTTGA TGGAAAAGGA AAAAATGAAT
ATTAACGATA TAAACAATTT CCTTAACAAG AAATCAGGTG TGCTTGGAAT TTCAGGTGTA
AGCAGTGATT TCAGAGATGT TCAGGATGCC GCAGAAAAGG GAGATGACAG GGCGCAGCTG
GCATTGGATA TTTTCTGCTA TGGTGTTAGG AAATATATTG GAAAATATAT TGCAGTGCTG
AACGGCGTTG ATGCGGTGGT ATTCACTGCA GGTATCGGCG AAAACAATGC TTATATAAGA
AGAGAAGTTT TGAAGGATAT GGACTTTTTC GGAATTAAAA TAGATTTGGA TAAAAATGAA
GTGAAAGGCA AAGAAGCGGA TATCAGTGCT CCCGATGCGA AAGTAAAGAC TTTGGTTATC
CCGACAAATG AGGAGCTTGA GATTGCAAGG GAGACTTTAA GACTTGTAAA AAACTTATAA
 
Protein sequence
MNILVINTGS SSLKYQLIDM TNESVLAKGV CDRIGLEHSF LKHTKTGGET VVIEKDLYNH 
KLAIQEVISA LTDEKIGVIK SMSEISAVGH RIVHGGEKFK ESAIIDEDVM KAIRDCVELA
PLHNPSNIIG IEACKQILPD VPMVAVFDTA FHQTMPRHAY IYALPYEIYE KYKLRKYGFH
GTSHKYVAHR AAQMLGKPIE SLKLITCHLG NGASICAVKG GKSVDTSMGF TPLQGLCMGT
RSGNVDPAVI TYLMEKEKMN INDINNFLNK KSGVLGISGV SSDFRDVQDA AEKGDDRAQL
ALDIFCYGVR KYIGKYIAVL NGVDAVVFTA GIGENNAYIR REVLKDMDFF GIKIDLDKNE
VKGKEADISA PDAKVKTLVI PTNEELEIAR ETLRLVKNL