Gene GM21_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3384 
Symbol 
ID8138751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3916054 
End bp3917814 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content64% 
IMG OID644871002 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003023167 
Protein GI253701978 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones112 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGG AGAGGAGTGA TTCCAAAGAG CTGAAGGGGA TCGGCGCCTC CGCCGGTATC 
GCCATCGGTC AGGTCCGCAT CACGGATCGG CGCCGGGTGT CGGTGACCGA GGTGGTGGTG
GCGCACGAGG AGATCCCGGG AGAGGTGGCG CGCTTCGCCA AGGCGATCGA GCGGGCCAAA
GGGGAGCTCT GCGCCCTTAA GGACCAGCTC TCCAGCACCC ACGGCCCGGA ACACCTCTGC
GTCATCGACG CGCACCTGAT GCTTTTGGAC GACACCATGC TGGTGGGCGA AACCACCCAA
TACATAGAGC GGCTCGGCAT CAACGCCGAA GGGGCGCTGA AAAGGACCCT GTCGCGCTTC
AAGTCCTTCT TCGACGGGGT GGAAGACGAC TACCTGCGCG AGCGCGGCAA CGACGTGGAG
ACGGTGGTCG AGCGGGTGCT TAGGAACATG GTAGGGCAGA AGCAGGAATC CATCGCCAAG
ATCGAGGGGA AGGTGATCGT GGTGGCCCAC GACCTCTCCC CGGCGGACAT CCTGCAGATC
GACAAGGAGA AGGTGATCGG CTTCATCACC GACCTGGGGG GGAAGACCTC CCACTCCTCG
ATCCTGGCGC GCTCCTTCGA GATCCCTGCC GTGGTGGGTC TGGAGCGGGC CACAGTGGAG
GTTGGGGAGG GGGATACCCT GATCGTCGAC GGCGCCACCG GCGTGGTCAT AGTCAACCCG
GGGGTCGAGC AGTTCAGGGA CTACCTGCAG AAAAAACAGC GCTACGAGTA CGTGGAAAGG
GAACTGCTGA AGCTGCGGGA CCTGCCGGCC GTGACTCTCG ACGGCCACCG GATGCGGTTA
AAGGGGAACG TCGAGTTCAT CCAGGAAGTG GCCGGCATCC ACCGGCACGG CGGCGAGGGG
ATCGGCCTGT ACCGCACCGA GATGCTCTTC TTGAACCGCA ACACGCTTCC CGGCGAGGAG
GAGCAGTTCG AGGCCTACGC GGCGCTGGTG AAGAAGATGG CCCCGCAATC GGTGACCATC
AGGACGCTGG ACATAGGCGG CGACAAGTGC CTGACCGATC TGGAGCTTAC CGACGAAATG
AACCCTGCCT TGGGGGTAAG GGCGATCCGC CTCTCGCTGC GCCAGCCCGA GGCCTTCAAG
GCCCAGCTGC GGGCCATCCT CCGGGCCAGC GCCTTCGGCG AAGTCCGCCT CTTCTTCCCT
ATGGTCTCCG GGGTGGCCGA GGTACGCGCC GCCAAGGCGC TCTTGGAGGA AGCAAAATCC
GAACTGCGCG GGCTCCATCC GTTCGACGAG AAGATCCAGG TCGGCATCAT GATCGAGATC
CCCTCGGCCG TGGTCATCGC CGACCTACTG GCCGCGGAGG TCGACTTCTT CAGCGTCGGG
ACCAACGATC TCATCCAGTA CACCCTCGCC ATCGACCGTA CCAACGAACA TCTGTCCTCG
CTTTACGAGC CGCTGCACCC GGCGGTGCTC CGTAGTCTGA AGATGGTGGT GGACGCGGCG
CACGGCGCGG GAATCGACGC CTGCATCTGC GGCGAGATGG CGGGGGAGCC GGAGTACCTA
CCCGTCCTTC TCGGTCTCGG CTTCGACGAG CTTTCCATGA ACGCGGTCTC AATCCCGCGC
GTCAAGAAGA TCCTGCGCCG CTGCAGCATG GAGGAGGCGC GCCTGGTCGC CTCGCGCGCC
CTTTCCTTTT CGACCGCGGC CGAAGCGGAC GCCTACCTGA AGGGAGAGAT AGCGGCACGC
TTCTCCGAGA GCTTCGACTG A
 
Protein sequence
MGLERSDSKE LKGIGASAGI AIGQVRITDR RRVSVTEVVV AHEEIPGEVA RFAKAIERAK 
GELCALKDQL SSTHGPEHLC VIDAHLMLLD DTMLVGETTQ YIERLGINAE GALKRTLSRF
KSFFDGVEDD YLRERGNDVE TVVERVLRNM VGQKQESIAK IEGKVIVVAH DLSPADILQI
DKEKVIGFIT DLGGKTSHSS ILARSFEIPA VVGLERATVE VGEGDTLIVD GATGVVIVNP
GVEQFRDYLQ KKQRYEYVER ELLKLRDLPA VTLDGHRMRL KGNVEFIQEV AGIHRHGGEG
IGLYRTEMLF LNRNTLPGEE EQFEAYAALV KKMAPQSVTI RTLDIGGDKC LTDLELTDEM
NPALGVRAIR LSLRQPEAFK AQLRAILRAS AFGEVRLFFP MVSGVAEVRA AKALLEEAKS
ELRGLHPFDE KIQVGIMIEI PSAVVIADLL AAEVDFFSVG TNDLIQYTLA IDRTNEHLSS
LYEPLHPAVL RSLKMVVDAA HGAGIDACIC GEMAGEPEYL PVLLGLGFDE LSMNAVSIPR
VKKILRRCSM EEARLVASRA LSFSTAAEAD AYLKGEIAAR FSESFD