Gene ECH74115_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1984 
SymbolabgB 
ID6970418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1876984 
End bp1878429 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content52% 
IMG OID643385908 
Productaminobenzoyl-glutamate utilization protein B 
Protein accessionYP_002270397 
Protein GI209398564 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.397765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA TCTATCGTTT TATCGACGAT GCGATTGAAG CCGATCGCCA ACGTTATACC 
GATATTGCCG ATCAAATCTG GGATCATCCA GAAACACGTT TTGAAGAGTT CTGGTCAGCG
GAGCATCTGG CTTCGGCGCT GGAATCTGCA GGCTTCACCG TTACCCGCAA CGTAGGCAAT
ATCCCAAATG CCTTTATTGC TTCGTTTGGT CAAGGCAAAC CGGTTATCGC CCTGCTGGGA
GAATATGACG CCCTGGCAGG TTTAAGTCAG CAAGCAGGTT GCGCGCAACC TACATCCGTG
ACGCCCGGTG AAAATGGTCA CGGTTGCGGA CACAATTTGC TGGGAACCGC CGCCTTTGCC
GCTGCAATAG CCATCAAGAA ATGGCTGGAA CAATATGGGC AAGGCGGCAC AGTGCGCTTT
TATGGTTGTC CTGGCGAAGA AGGCGGCTCG GGTAAAACGT TCATGGTCCG CGAGGGGGTA
TTTGATGATG TGGATGCGGC ACTCACCTGG CACCCGGAAG CCTTTGCCGG TATGTTCAAT
ACCCGTACGC TGGCAAACAT TCAGGCATCA TGGCGCTTTA AAGGGATCGC AGCACATGCC
GCGAATTCCC CTCATTTGGG ACGCAGCGCC CTTGATGCCG TAACGTTGAT GACCACTGGC
ACCAACTTCC TCAACGAACA TATTATTGAA AAAGCGCGCG TACACTATGC CATCACAAAT
AGTGGCGGGA TCTCGCCCAA CGTGGTCCAG GCGCAGGCAG AAGTGCTTTA TCTTATCCGC
GCCCCCGAAA TGACCGACGT GCAGCATATT TATGATCGGG TCGCCAAAAT CGCCGAAGGT
GCGGCATTGA TGACCGAAAC CACGGTTGAA TGCCGCTTCG ACAAAGCCTG TTCCAGTTAT
CTCCCGAATC GCACCTTAGA AAATGCCATG TACCAGGCCC TATCCCATTT TGGTACCCCG
GAATGGAACT CCGAAGAACT GGCTTTTGCG AAACAAATTC AGGCTACGCT TACCTCCAAC
GATCGGCAAA ACAGTCTGAA TAATATCGCC GCAACCGGTG GCGAAAACGG CAAGGTTTTT
GCACTACGTC ATCGTGAAAC GGTACTGGCG AATGAAGTCG CTCCATATGC CGCCACCGAT
AACGTGCTTG CGGCATCGAC TGATGTCGGC GACGTCAGTT GGAAACTGCC TGTTGCCCAG
TGTTTCAGCC CCTGTTTTGC CGTCGGTACA CCCCTACATA CGTGGCAACT GGTTAGCCAG
GGGCGAACAT CTATTGCTCA TAAAGGAATG CTGCTGGCGG CGAAAACTAT GGCAGCAACC
ACACTCAATC TCTTCATTGA TTCAGGGCTA TTGCAAGAAT GCCAACAAGA GCATCAGCAA
GTTACGGACA CGCAACCGTA TCACTGCCCT ATCCCGAAAA ACGTGACACC GTCACCTTTA
AAATAA
 
Protein sequence
MQEIYRFIDD AIEADRQRYT DIADQIWDHP ETRFEEFWSA EHLASALESA GFTVTRNVGN 
IPNAFIASFG QGKPVIALLG EYDALAGLSQ QAGCAQPTSV TPGENGHGCG HNLLGTAAFA
AAIAIKKWLE QYGQGGTVRF YGCPGEEGGS GKTFMVREGV FDDVDAALTW HPEAFAGMFN
TRTLANIQAS WRFKGIAAHA ANSPHLGRSA LDAVTLMTTG TNFLNEHIIE KARVHYAITN
SGGISPNVVQ AQAEVLYLIR APEMTDVQHI YDRVAKIAEG AALMTETTVE CRFDKACSSY
LPNRTLENAM YQALSHFGTP EWNSEELAFA KQIQATLTSN DRQNSLNNIA ATGGENGKVF
ALRHRETVLA NEVAPYAATD NVLAASTDVG DVSWKLPVAQ CFSPCFAVGT PLHTWQLVSQ
GRTSIAHKGM LLAAKTMAAT TLNLFIDSGL LQECQQEHQQ VTDTQPYHCP IPKNVTPSPL
K