Gene GM21_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3388 
Symbol 
ID8138755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3919329 
End bp3920279 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content63% 
IMG OID644871006 
ProductHPr kinase/phosphorylase 
Protein accessionYP_003023171 
Protein GI253701982 
COG category[T] Signal transduction mechanisms 
COG ID[COG1493] Serine kinase of the HPr protein, regulates carbohydrate metabolism 
TIGRFAM ID[TIGR00679] Hpr(Ser) kinase/phosphatase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones119 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCA GCATCAAGGA TCTTCTGGAA GACAAGGCCT ACGGCCTGGA CCTGCAGCTT 
CTCTGCGGCG AAGAGGGGCT GGCGAACCGT CTCTACAGTT CCCGCATCCA GAAACCGGGG
CTCGCGCTCA CCGGGTACAC GGAGCACCTG CACCCGGACC GGGTCCAGGT ACTGGGCAAC
ACCGAGATCT CCTATCTGAG CCAGATCCCC GAGGAAGTGG GAAGCCGGCA CATCGAGAAG
CTCTGCAGCT ACCCCATCGC CTGCTTCATC GTCACCAAGG GGCTCGATCC CCCCGAATTC
CTGAAGAAGA GCGCCCAGGA CGCCGGGATC CCGCTTCTGG GCACGCACCA CCAGTCCTCC
ACCTTCATCT CCCTCATCAC CAAGTTCCTG GAGGAGAGCC TGCTCCCCTC GACCCACATC
CACGGCGTGC TGGTAGACGT CCTTGGGGTG GGGGTGCTGC TCTTAGGCAA GAGCGGCATA
GGCAAGAGCG AGTGCGCCCT CGACCTGGTG ATCCGCGGAC ACCGGCTGGT CGCCGACGAC
GTGATCCACA TCAAGAAGAA GATGCCGGCG GCCCTGGTCG GGCAGGCGGG AGAGTCGATC
CAGTACCACA TGGAGATCCG GGGCTTAGGG ATCATCAACG TGAAGGACCT CTACGGGGTT
TCCTCCATCC GGGAAAAGAA GATCATCGAC ATGGTCCTGG AGCTTATGGA GTGGGACGCA
AGCCAGGAGT ACGAGCGCCT GGGGATCGAC GAGCCGGTGA AGACCATCTT GGGCGTGGAC
CTGCCGCACC TGAAGATCCC GGTACGCCCG GGGCGCAACC TCACCTCCAT CATCGAGGTG
GCCGCGAGGA ACTTCCTCCT GAAGGGGATG GGGTACCACT CCGCCCGGGA ATTCCACGAA
AAGCTCCTGG CCCGCCTGGA GTTCCGCCCG CTGGGCAACG AGGTGGAATA G
 
Protein sequence
MSLSIKDLLE DKAYGLDLQL LCGEEGLANR LYSSRIQKPG LALTGYTEHL HPDRVQVLGN 
TEISYLSQIP EEVGSRHIEK LCSYPIACFI VTKGLDPPEF LKKSAQDAGI PLLGTHHQSS
TFISLITKFL EESLLPSTHI HGVLVDVLGV GVLLLGKSGI GKSECALDLV IRGHRLVADD
VIHIKKKMPA ALVGQAGESI QYHMEIRGLG IINVKDLYGV SSIREKKIID MVLELMEWDA
SQEYERLGID EPVKTILGVD LPHLKIPVRP GRNLTSIIEV AARNFLLKGM GYHSAREFHE
KLLARLEFRP LGNEVE