Gene Emin_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0720 
Symbol 
ID6263652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp794279 
End bp795637 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content43% 
IMG OID642611192 
Productacetate kinase 
Protein accessionYP_001875612 
Protein GI187251130 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0988409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000000373865 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAATAT TGTTTTTAAA TTGCGGCAGT TCATCCGTAC GCTACAGCGT TTACGACTGG 
GCCCGAAAAG AAAGTCTGGC CGCCGGCCTT GTTGAAAGGG TTACAATGCC CGGCACTGCG
ATTACCCATG AACGCCCCGG CAAAGATAAA GTTTCTTTTG AAAAGGAATG TAAAGACCAT
ACGGAAGCCG TTAAGTTAGT TGTAGATACC CTTATCAATA AAGAATACGG CGTTATAAAT
GATGTTAAAC TTATTTCGGC TGTTGGGCAC CGCATAGTTC ACGGCGGCAA ATTCGCAAAA
TCAGTGCTTG TTGACGACGC TGTTATGAAA GAACTTAAAG AAATTTCCGA CCTTGCCCCC
CTTCACAATC CTGCGCACAT TATGGGTATA GAAGCGGCTC AAAAAATTAT TCCCGGCGTT
AAAAACATTT GTATTATGGA TACCGCGTTT CACCAAACAA TGCCTGACCA TGTTTTTATG
TACGCTTTAC CGTATGAATG GTATACGGAC TTAAAAGTAA GAAGGTACGG CTTTCACGGT
TCATCGGTTT TATATTGCGC TAAAAGAGCT GCTGTTCTTT TAGGCAAAAA ATCAAATGAA
GTTAACGTTA TTGTCTGCCA TATTGGCAAC GGCGCAAGCA TAACCGCTGT TAAAGAAGGC
AAATGCTTTG ACACAAGCAT GGGTTTAACC CCGCTTGAAG GCCTTGTTAT GGGCACGAGA
TCAGGCGATA TTGACCCCGC TATTCCTTTT TATGTAATGT CTAAAACAGG TGTATCACCT
GATGATATGT ACAATACTTT AAATAAAGGA TCGGGCGTTT TGGCCGTTTC AGGCAAATCA
GCGGACAGAC GTGACGTAGA GCTTGCCGCC GAAGCGGGTG ACCAGCGCTG CCAACTTGCG
GTAGATATGG AATCTTACCG CATTAAAAAA TACATTGGTG CTTACGCCGC CGCTTTAGGC
CGTGTTGACG CCGTTGTTTG GACAGCGGGC GTAGGTGAAA GAGGCCCGGT AATAAGAGAA
AAAGCGCTTA GAGACCTTGA ATATATGGGC CTTGAATATG ATCATGACAA AAACTTTTCC
GCCCTTACCA AAAACGCCGA AAGCGAAATT ACAGCACCAA ACAGCAAGGT AAAATCCTTT
GTTATTCCTA CCGATGAGGA AATTGTAGGC GTTGAGGATA TTGTGGCTAT ACTTGAAAAC
AGGTATGACG TTCACACAAA ATACACATAT ATCTTTGAAG ATAAGAACTA CCGCAACAAA
TTAAGAGATG AGCTTTTTGC CAAAGAAATC GCCAAAAAGC CATTCTTGCT AAAAGCGGCT
GTCAATGTGC CGGAAAATAT AAAAAACTTA GTTAAATAA
 
Protein sequence
MIILFLNCGS SSVRYSVYDW ARKESLAAGL VERVTMPGTA ITHERPGKDK VSFEKECKDH 
TEAVKLVVDT LINKEYGVIN DVKLISAVGH RIVHGGKFAK SVLVDDAVMK ELKEISDLAP
LHNPAHIMGI EAAQKIIPGV KNICIMDTAF HQTMPDHVFM YALPYEWYTD LKVRRYGFHG
SSVLYCAKRA AVLLGKKSNE VNVIVCHIGN GASITAVKEG KCFDTSMGLT PLEGLVMGTR
SGDIDPAIPF YVMSKTGVSP DDMYNTLNKG SGVLAVSGKS ADRRDVELAA EAGDQRCQLA
VDMESYRIKK YIGAYAAALG RVDAVVWTAG VGERGPVIRE KALRDLEYMG LEYDHDKNFS
ALTKNAESEI TAPNSKVKSF VIPTDEEIVG VEDIVAILEN RYDVHTKYTY IFEDKNYRNK
LRDELFAKEI AKKPFLLKAA VNVPENIKNL VK