Gene Emin_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0344 
Symbol 
ID6263238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp369077 
End bp371866 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content44% 
IMG OID642610810 
Productpyruvate phosphate dikinase 
Protein accessionYP_001875240 
Protein GI187250758 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID[TIGR01828] pyruvate, phosphate dikinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00706754 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA CAGTAAAGAA AGCGGCAAAA AAAGCCGCGG TTAAAAAAGC AGTTAAAAGC 
ATGAAAAATG TTTATTACTT TGGCGGCGGC AAAGCCGACG GCAAAGGCTC AATGAAAGAA
CTGTTGGGCG GCAAGGGGGC AAACCTGGCT GAAATGGCCG GTCTTATGAA ACTTCCGGTT
CCTCCGGGCT TTACGATTAC TACAGAAGTT TGCACATATT ATTGGGATAA TAAAAAAAAT
TACCCTTCCT CATTAAAAGC TGAAGTTGAA TCTAATCTTA AAAAAGTTGA AAAAGAAACA
AAAAAAGTTT TCGGTTCCGT TGATAATCCC CTTTTGCTTT CCGTACGTTC CGGGGCAAGA
GCTTCCATGC CGGGTATGAT GGAAACTATT TTAAATATCG GTTTGACTGA AAAAACAATT
CCCGGTATGA TTAAAAAAAC GGGTAACGAA CGCTTTGTTT ATGACGCTTA CAGGCGTTTA
ATAATGATGT ATTCCGACGT TGTTATGGAA AAAGCCGCCG GCATTGAACC TAAAGACGAC
AAAGGCATAC GTAAAGTTTT AGATGGTATG CTGCATGACG TTAAATCCAA AAAAGGCGTT
AAAGACGATA CTGATTTAAC GGCGGAAGAT TTAAAAACCC TTTGCGCGGA ATTTAAGAAA
ACCGTTAAAA ATGTTTTAGG TAAAGAATTC CCCGATAACC CGATGTTGCA GCTTTGGGGC
GCCATAGGCG CTGTTTTCTC AAGCTGGAAC GGAAAAAGAG CTATCGCTTA CAGAAATATT
GAAAAAATTC CGCACGAATG GGGAACGGCT GTTAACGTAC AGGCCATGGT TTTCGGTAAC
ATGGGGACAG ACTCGGCCAC CGGCGTAGCT TTCTCAAGAA ACCCCGGCAA CGGCGATTCA
CACTTCTATG GCGAATACTT AATTAACGCC CAGGGTGAAG ACGTTGTGGC GGGTATCAGA
ACACCCAGCC CCATGAACAA ATGGTCCAAA AATACACATT CCGAACACTT GCCCACATTG
GAACAGGTTA TGCCTAAAGC TTATAAGGAA CTTGACGGCA TACAGAAAAA ACTTGAAAAA
CATTTCAGAG ATATGTTAGA TATTGAGTTT ACCATTGAAC AAGGTAGACT TTGGATGCTA
CAGTGCCGCG TAGGAAAAAG AAACGGCACC GCTGCCGTTC AAATGGCTTT AGACATGGTT
AAAGAAAAAC TTATCAGCCA AAACGAAGCC GTTTTAAGAG TTACAGCCTC TCAACTCGAC
GAACTTTTGC ACCCCGCTAT TGACCCTAAA GCGGAGGCTT TGGCTAAAAT TGTAGGAAAA
GGTTTGCCCG CAGGTCCCGG CGGCGCTTCA GGTAAAGTTG TTTTCACTTC AGAAGCGGCC
ATGGCGCTTA AAGCCAAAGG CGAAAAAGCT ATTTTGGTAA GAGAAGAAAC CAACCCCGAA
GACGTTGAAG GTATGAGAGC GGCTGAAGCT ATTTTAACAC AGCGCGGCGG TATGACCTCA
CACGCGGCGT TAGTAGCCCG CGGCTGGGGT AAATGCTGTA TAGTCGGCTG CGGCGAGCTT
GAAATTAATT TAAGCAAAAA AACCGCCTCA ATCGGCAGCG TTACTTTTAA AGAAGGCGAC
TTCATTACAC TTAACGGCAC AAAAGGTTTT GTTTACTTAG GCCAGCTTAA AATGTTAGAA
GCGGGCGAAG GCAACAATAA CTTAACTAAA TTCTTAGATA TGTGCGATAA AATAAAACGT
CTTGACGTAC GCACAAACGC CGACACTCCC GAAGACGCTG TAAGGGCCAA AAAATTCGGC
GCTAAGGGTA TCGGTTTATT CCGCATTGAA CACATGTTCT ACGGCACCAA CGCCGAAAAA
CCGCTGTTCA TCTTAAGAAA AATGATTGTT TCCAAAACAA CCGAGGAAAG AACAAAAGCT
GTTAATGAGC TTTTCCCGTT CATGAAAAAG GCCATCAAAG GTACAATTAA AGCCATGGCT
GGTTTTGGAG TTACAATCCG CTTGATGGAC CCGCCTTTGC ATGAGTTTAT TCCCCAGCAG
AAAGAGGTAA AAGAGCAGGT TTGCAAAGCT ATGGGAATTA CAATGGAAGA GTTTGATTCC
AGAGCCGCTG TCCTTCACGA AGTTAATCCT ATGATGGGAC ACAGAGGCGT GAGGCTTGGC
GTTACATATC CCGAAATTAC GGAAATGCAG TCCAGAGCAA TACTTGAATC CGCGGCCGAA
CTTATAAAAG AAGGCGTTAA AGCTATGCCT GAAATTATGG TTCCCGTTGT TTGCCATGAA
AACGAACTTA TTGACCAAAG AGCTATTATT GAACGCGTTT ATAAAGAAGT TGTTGCCAAA
ACAGGCGTTA AGAAATTACC GCTTTCAGTA GGCACCATGA TTGAGATTCC CAGAGCGGCT
ATTATGTCTC ACAAAATTGC CGAACAGGCG GATTTCTTCT CCTTCGGTAC AAACGACTTA
ACTCAAATGA CGTTCGGCTT CTCAAGAGAT GATATCGGCG GTTTTATGGG CGCTTACCTT
GAAAAAGGCG TTCTTAAAAA CGATCCGTTC CAAACGCTTG ACCAGGATGG CGTAGGCTAC
TTAATCAAGC AAGGCGTTAA GGGCGGCAGA AGCACAAAAG CCAAACTTAA AATCGGCATT
TGCGGCGAAC ACGGCGGCGA CGCGAAAAGC GTTGAATTCT GCCACAGGGA AGGATTTAAC
TATGTTTCCT GCTCACCGTT CAGAGTTCCG ATAGCCAGGC TTGCGGCTGC GCAGGCGGTT
GCTAAAGAAA TAAAAACTAA AAAGAAATAG
 
Protein sequence
MAKTVKKAAK KAAVKKAVKS MKNVYYFGGG KADGKGSMKE LLGGKGANLA EMAGLMKLPV 
PPGFTITTEV CTYYWDNKKN YPSSLKAEVE SNLKKVEKET KKVFGSVDNP LLLSVRSGAR
ASMPGMMETI LNIGLTEKTI PGMIKKTGNE RFVYDAYRRL IMMYSDVVME KAAGIEPKDD
KGIRKVLDGM LHDVKSKKGV KDDTDLTAED LKTLCAEFKK TVKNVLGKEF PDNPMLQLWG
AIGAVFSSWN GKRAIAYRNI EKIPHEWGTA VNVQAMVFGN MGTDSATGVA FSRNPGNGDS
HFYGEYLINA QGEDVVAGIR TPSPMNKWSK NTHSEHLPTL EQVMPKAYKE LDGIQKKLEK
HFRDMLDIEF TIEQGRLWML QCRVGKRNGT AAVQMALDMV KEKLISQNEA VLRVTASQLD
ELLHPAIDPK AEALAKIVGK GLPAGPGGAS GKVVFTSEAA MALKAKGEKA ILVREETNPE
DVEGMRAAEA ILTQRGGMTS HAALVARGWG KCCIVGCGEL EINLSKKTAS IGSVTFKEGD
FITLNGTKGF VYLGQLKMLE AGEGNNNLTK FLDMCDKIKR LDVRTNADTP EDAVRAKKFG
AKGIGLFRIE HMFYGTNAEK PLFILRKMIV SKTTEERTKA VNELFPFMKK AIKGTIKAMA
GFGVTIRLMD PPLHEFIPQQ KEVKEQVCKA MGITMEEFDS RAAVLHEVNP MMGHRGVRLG
VTYPEITEMQ SRAILESAAE LIKEGVKAMP EIMVPVVCHE NELIDQRAII ERVYKEVVAK
TGVKKLPLSV GTMIEIPRAA IMSHKIAEQA DFFSFGTNDL TQMTFGFSRD DIGGFMGAYL
EKGVLKNDPF QTLDQDGVGY LIKQGVKGGR STKAKLKIGI CGEHGGDAKS VEFCHREGFN
YVSCSPFRVP IARLAAAQAV AKEIKTKKK