Gene EcSMS35_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0296 
SymbolproB 
ID6145427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp304156 
End bp305259 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content56% 
IMG OID641615193 
Productgamma-glutamyl kinase 
Protein accessionYP_001742402 
Protein GI170681970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0263] Glutamate 5-kinase 
TIGRFAM ID[TIGR01027] glutamate 5-kinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00410823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGACA GCCAGACGCT GGTGGTAAAA CTCGGCACCA GTGTGCTAAC AGGCGGATCG 
CGCCGTCTGA ACCGTGCCCA TATCGTTGAA CTTGTTCGCC AGTGCGCGCA GTTACATGCC
GCCGGGCATC GGATTGTTAT TGTGACGTCG GGCGCGATCG CCGCCGGACG TGAGCACCTG
GGTTACCCGG AACTGCCAGC GACTATCGCC TCGAAACAAC TGCTGGCGGC GGTAGGGCAG
AGTCGACTGA TTCAACTATG GGAACAGCTG TTTTCTATTT ATGGCATTCA CGTCGGGCAA
ATGCTGCTGA CCCGTGCTGA TATGGAAGAC CGTGAACGCT TCCTGAACGC CCGCGACACC
TTGCGTGCGT TGCTCGATAA CAATATCGTT CCGGTAATCA ATGAGAACGA TGCTGTCGCT
ACTGCAGAGA TTAAGGTCGG TGATAACGAT AACCTTTCTG CGCTGGCGGC GATTCTGGCG
GGTGCCGATA AACTGTTGCT GCTGACCGAT CAAAAAGGTT TGTACACCGC TGACCCGCGC
AGCAATCCGC AGGCAGAACT GATTAAAGAT GTTTACGGCA TTGATGACGC ACTGCGCGCG
ATTGCCGGTG ACAGCGTTTC AGGCCTCGGA ACTGGCGGCA TGAGTACCAA ATTGCAGGCC
GCGGACGTGG CTTGCCGTGC GGGTATCGAC ACCATTATTG CCGCGGGCAG CAAGCCGGGC
GTTATTGGTG ATGTGATGGA AGGCATTTCC GTGGGTACGC TGTTCCATGC CCAGGCGACT
CCGCTTGAAA ACCGTAAACG CTGGATTTTC GGTGCGCCGC CGGCGGGTGA AATCACGGTA
GATGAAGGGG CAACTGCCGC CATTCTTGAA CGCGGCAGCT CCCTGTTGCC GAAAGGGATT
AAAAGCGTGA CTGGCAATTT CTCGCGTGGT GAAGTCATCC GCATTTGCAA CCTCGAAGGT
CGCGATATCG CCCACGGTGT CAGTCGTTAC AACAGCGATG CATTACGCCG TATTGCCGGA
CACCACTCGC AAGAAATTGA TGCAATACTG GGATATGAAT ACGGCCCGGT TGCCGTTCAC
CGTGATGACA TGATTACCCG TTAA
 
Protein sequence
MSDSQTLVVK LGTSVLTGGS RRLNRAHIVE LVRQCAQLHA AGHRIVIVTS GAIAAGREHL 
GYPELPATIA SKQLLAAVGQ SRLIQLWEQL FSIYGIHVGQ MLLTRADMED RERFLNARDT
LRALLDNNIV PVINENDAVA TAEIKVGDND NLSALAAILA GADKLLLLTD QKGLYTADPR
SNPQAELIKD VYGIDDALRA IAGDSVSGLG TGGMSTKLQA ADVACRAGID TIIAAGSKPG
VIGDVMEGIS VGTLFHAQAT PLENRKRWIF GAPPAGEITV DEGATAAILE RGSSLLPKGI
KSVTGNFSRG EVIRICNLEG RDIAHGVSRY NSDALRRIAG HHSQEIDAIL GYEYGPVAVH
RDDMITR