Gene ECH74115_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0287 
SymbolproB 
ID6970410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp300060 
End bp301163 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content56% 
IMG OID643384353 
Productgamma-glutamyl kinase 
Protein accessionYP_002268869 
Protein GI209398531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0263] Glutamate 5-kinase 
TIGRFAM ID[TIGR01027] glutamate 5-kinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.406641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.77952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA GCCAGACGCT GGTGGTAAAA CTCGGCACCA GTGTGCTAAC AGGCGGATCG 
CGCCGCCTGA ACCGTGCCCA TATCGTTGAA CTTGTTCGCC AGTGCGCGCA GTTACATGCC
GCCGGGCATC GGATTGTTAT TGTGACGTCG GGCGCGATCG CCGCCGGACG TGAGCACCTG
GGTTACCCGG AACTGCCAGC GACTATCGCC TCGAAACAAC TGCTGGCGGC GGTAGGGCAG
AGTCGACTGA TTCAACTGTG GGAACAGCTG TTTTCGATTT ATGGCATTCA CGTCGGGCAA
ATGCTGCTGA CTCGTGCTGA TATGGAAGAC CGTGAACGCT TCCTGAACGC CCGCGACACC
CTGCGTGCGT TGCTCGATAA CAATATCGTT CCGGTAATCA ATGAGAACGA TGCTGTCGCT
ACGGCAGAGA TTAAAGTCGG CGATAACGAC AACCTTTCTG CACTGGCGGC GATTCTGGCG
GGTGCCGATA AACTGTTGTT ACTGACCGAT CAAAAAGGTT TGTATACCGC TGACCCGCGC
AGCAATCCGC AGGCAGAACT GATTAAAGAT GTTTACGGCA TTGATGACGC ACTGCGCGCG
ATTGCTGGTG ACAGCGTTTC AGGCCTCGGA ACTGGCGGCA TGAGTACCAA ATTGCAGGCC
GCTGACGTGG CTTGCCGTGC GGGTATCGAC ACCATTATTG CCGCGGGCAG CAAGCCGGGC
GTTATTGGTG ATGTGATGGA AGGCATTTCC GTCGGTACGC TGTTCCATGC CCAGGCGACT
CCGCTTGAAA ACCGTAAACG CTGGATTTTC GGTGCGCCGC CTGCGGGTGA AATCACGGTA
GATGAAGGGG CAACCGCCGC CATTCTTGAA CGCGGCAGCT CCCTGTTGCC GAAAGGCATT
AAAAGCGTGA CTGGCAACTT CTCGCGTGGT GAAGTCATCC GCATTTGTAA CCTCGAAGGT
CGCGATATCG CCCACGGCGT CAGTCGTTAC AACAGCGATG CATTACGCCG TATTGCCGGA
CACCACTCGC AAGAAATTGA TGCAATACTG GGATATGAAT ACGGCCCGGT TGCCGTTCAC
CGTGATGACA TGATCACCCG TTAA
 
Protein sequence
MSDSQTLVVK LGTSVLTGGS RRLNRAHIVE LVRQCAQLHA AGHRIVIVTS GAIAAGREHL 
GYPELPATIA SKQLLAAVGQ SRLIQLWEQL FSIYGIHVGQ MLLTRADMED RERFLNARDT
LRALLDNNIV PVINENDAVA TAEIKVGDND NLSALAAILA GADKLLLLTD QKGLYTADPR
SNPQAELIKD VYGIDDALRA IAGDSVSGLG TGGMSTKLQA ADVACRAGID TIIAAGSKPG
VIGDVMEGIS VGTLFHAQAT PLENRKRWIF GAPPAGEITV DEGATAAILE RGSSLLPKGI
KSVTGNFSRG EVIRICNLEG RDIAHGVSRY NSDALRRIAG HHSQEIDAIL GYEYGPVAVH
RDDMITR