Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1028 |
Symbol | |
ID | 4811322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1229687 |
End bp | 1230886 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106446 |
Product | acetate kinase |
Protein accession | YP_001037453 |
Protein GI | 125973543 |
COG category | [C] Energy production and conversion |
COG ID | [COG0282] Acetate kinase |
TIGRFAM ID | [TIGR00016] acetate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000413101 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TGGTTATTAA TACCGGAAGC TCATCACTAA AGTATCAGCT GATTGACATG ACAAACGAGT CTGTGCTTGC AAAAGGTGTG TGTGACAGAA TTGGTCTTGA ACATTCCTTT TTAAAGCATA CAAAGACCGG AGGGGAAACC GTAGTTATAG AAAAAGACCT GTACAATCAC AAGCTTGCCA TACAGGAGGT AATTTCGGCT CTTACGGATG AAAAAATCGG AGTCATAAAA AGCATGTCGG AAATTTCTGC CGTCGGTCAT CGTATTGTTC ACGGCGGAGA GAAGTTTAAG GAATCTGCCA TAATTGATGA AGATGTAATG AAAGCAATCA GGGATTGTGT TGAACTGGCT CCGCTCCACA ATCCGTCAAA TATAATCGGA ATTGAAGCCT GTAAACAGAT ACTGCCCGAT GTGCCGATGG TTGCTGTGTT TGACACAGCT TTTCATCAGA CAATGCCAAG GCATGCATAT ATTTATGCCC TCCCTTATGA GATATATGAG AAGTATAAAT TGAGAAAATA CGGATTCCAC GGAACTTCCC ACAAATATGT GGCCCACAGG GCGGCTCAGA TGCTGGGCAA ACCTATTGAG AGCCTGAAGC TGATAACCTG CCATCTTGGA AACGGAGCAA GTATTTGTGC GGTAAAAGGC GGAAAATCCG TTGACACCTC AATGGGATTT ACTCCTCTGC AGGGGTTGTG CATGGGTACC AGAAGCGGCA ATGTTGACCC TGCGGTTATA ACTTATTTGA TGGAAAAGGA AAAAATGAAT ATTAACGATA TAAACAATTT CCTTAACAAG AAATCAGGTG TGCTTGGAAT TTCAGGTGTA AGCAGTGATT TCAGAGATGT TCAGGATGCC GCAGAAAAGG GAGATGACAG GGCGCAGCTG GCATTGGATA TTTTCTGCTA TGGTGTTAGG AAATATATTG GAAAATATAT TGCAGTGCTG AACGGCGTTG ATGCGGTGGT ATTCACTGCA GGTATCGGCG AAAACAATGC TTATATAAGA AGAGAAGTTT TGAAGGATAT GGACTTTTTC GGAATTAAAA TAGATTTGGA TAAAAATGAA GTGAAAGGCA AAGAAGCGGA TATCAGTGCT CCCGATGCGA AAGTAAAGAC TTTGGTTATC CCGACAAATG AGGAGCTTGA GATTGCAAGG GAGACTTTAA GACTTGTAAA AAACTTATAA
|
Protein sequence | MNILVINTGS SSLKYQLIDM TNESVLAKGV CDRIGLEHSF LKHTKTGGET VVIEKDLYNH KLAIQEVISA LTDEKIGVIK SMSEISAVGH RIVHGGEKFK ESAIIDEDVM KAIRDCVELA PLHNPSNIIG IEACKQILPD VPMVAVFDTA FHQTMPRHAY IYALPYEIYE KYKLRKYGFH GTSHKYVAHR AAQMLGKPIE SLKLITCHLG NGASICAVKG GKSVDTSMGF TPLQGLCMGT RSGNVDPAVI TYLMEKEKMN INDINNFLNK KSGVLGISGV SSDFRDVQDA AEKGDDRAQL ALDIFCYGVR KYIGKYIAVL NGVDAVVFTA GIGENNAYIR REVLKDMDFF GIKIDLDKNE VKGKEADISA PDAKVKTLVI PTNEELEIAR ETLRLVKNL
|
| |