Gene Emin_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1366 
Symbol 
ID6263403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1468518 
End bp1469630 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content42% 
IMG OID642611847 
Productbutyrate kinase 
Protein accessionYP_001876253 
Protein GI187251771 
COG category[C] Energy production and conversion 
COG ID[COG3426] Butyrate kinase 
TIGRFAM ID[TIGR02707] butyrate kinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACACA ATATTTTAGT TATAAATCCA GGGTCAACCT CGGATGATAT AGGCTATTAC 
AAAGGCCCAA AAACCGTATT TGAAGAATCA GCAAGATATT CTCAAGAAGA GTTGGATTCA
TTTGCAGGCA AAGAACTTTC CGAACAAATT CCTTTAAGAA GAAAATTTTT ATTAGACGTT
TTAAAAAAAC ATGAAATCAA TTTAAATGAA ATAGACGCCG TTATCGGCCG CGGCGGGCTG
TTAAAACATA TTGAAGGCGG CATTTATACA ATTAACGAAG CTATGCTTGC CGATTTAAAA
AGGGGTTATA ACGGCCATCA CCCGAGCAAT CTGGGCGGTA TTTTGGCGCG TGAAATCGCC
GAATCTTTGG GCAAACCATG TTTTATAGCG GACCCTGTGG TAGTGGACGA AATGGAGCCT
CTTGCCAGAT ACACAGGATT TAAAGAAATA AAAAGAAAAT CAATTTTTCA CGCTTTAAAC
CAAAAACGCG TGGCTATTAC CGCAGCCAAA GAACTGGGCA AAAAGTATAA AGAATGCAAC
TTTATAGTAA TGCACGGCGG CGGCGGCGTA AGTGTGGGCG CACATAAAAA AGGTAAAGTT
ATAGACGTGT CTGACGGCTT TGAAGGCGCA GGCCCGATGA CTCCGCAAAG AAGCGGCGTT
TTACCCAGTT TAGAGCTTGT TGAAATGTGT TTCAGCGGGC AGTATACAAT ACAGGAGCTG
CGTAAAAAAA TGCGCGGCCG CGGCGGCATG ATAGCGCATA CGGGCACTTC CGATATTGCG
GATTTATATA ATTATATTTC CTCCGGGAAA AAGAAGCCCG GCTCAACAAT CAATTGTTCA
AGAGAAGCGG CGCAGGAAGC ATTTGACGCC ATGATTTACC AAATCTCAAA AGAAATAGGC
GCTATGGCTA CCGTACTTAA AGGGGATGTT GACGCTATTA TTTTAACAGG CGGGCTTGCC
TATAATGAAT ATTTAGTTAA TATGATAAAG GAAAGAACAG GATTTATTAC GGATAAATTT
TTTGTGTATC CCGGAGGCGA TGAAAAGGCC GCTTTAAAAG AAGCCGCCGC GCGCGCTTTG
GAAAACCCTG AAATAATTAA ACAATATAAA TAA
 
Protein sequence
MEHNILVINP GSTSDDIGYY KGPKTVFEES ARYSQEELDS FAGKELSEQI PLRRKFLLDV 
LKKHEINLNE IDAVIGRGGL LKHIEGGIYT INEAMLADLK RGYNGHHPSN LGGILAREIA
ESLGKPCFIA DPVVVDEMEP LARYTGFKEI KRKSIFHALN QKRVAITAAK ELGKKYKECN
FIVMHGGGGV SVGAHKKGKV IDVSDGFEGA GPMTPQRSGV LPSLELVEMC FSGQYTIQEL
RKKMRGRGGM IAHTGTSDIA DLYNYISSGK KKPGSTINCS REAAQEAFDA MIYQISKEIG
AMATVLKGDV DAIILTGGLA YNEYLVNMIK ERTGFITDKF FVYPGGDEKA ALKEAAARAL
ENPEIIKQYK