Gene EcHS_A1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1453 
SymbolabgB 
ID5591777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1448062 
End bp1449507 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content53% 
IMG OID640920607 
Productaminobenzoyl-glutamate utilization protein B 
Protein accessionYP_001458166 
Protein GI157160848 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA TCTATCGTTT TATCGACGAT GCGATTGAAG CCGATCGCCA ACGTTATACC 
GATATTGCCG ATCAAATCTG GGATCATCCA GAAACACGTT TTGAAGAGTT CTGGTCAGCG
GAGCATCTGG CTTCGGCGCT GGAATCTGCA GGCTTCACCG TTACCCGCAA CGTAGGCAAT
ATCCCAAATG CCTTTATTGC TTCGTTTGGT CAAGGCAAAC CGGTTATCGC CCTGCTGGGA
GAATATGACG CCCTGGCAGG TTTAAGTCAG CAAGCAGGTT GCGCGCAACC TACATCCGTG
ACGCCCGGTG AAAATGGTCA CGGTTGCGGA CACAATTTGC TGGGAACCGC CGCCTTTGCC
GCTGCAATAG CCGTCAAGAA ATGGCTGGAA CAATATGGGC AAGGCGGCAC GGTGCGCTTT
TATGGTTGTC CTGGCGAAGA AGGCGGCTCG GGTAAAACGT TCATGGTTCG CGAGGGGGTA
TTTGATGATG TGGATGCGGC ACTCACCTGG CACCCGGAAG CCTTTGCCGG TATGTTCAAT
ACCCGCACGC TGGCAAACAT TCAGGCATCA TGGCGCTTTA AAGGGATCGC AGCACATGCC
GCGAATTCCC CTCATTTGGG ACGCAGCGCC CTTGATGCCG TAACGTTGAT GACCACTGGC
ACCAACTTCC TCAACGAACA TATTATTGAA AAAGCGCGCG TACACTATGC CATCACAAAT
AGCGGCGGGA TCTCGCCCAA CGTGGTCCAG GCGCAGGCAG AAGTGCTTTA TCTTATCCGC
GCCCCCGAAA TGACCGACGT GCAGCATATT TATGATCGGG TCGCCAAAAT CGCCGAAGGT
GCGGCATTGA TGACCGAAAC CACGGTTGAA TGCCGCTTCG ACAAAGCCTG TTCCAGTTAT
CTCCCGAATC GCACCTTAGA AAATGCCATG TACCAGGCCC TATCCCATTT TGGTACCCCG
GAATGGAACT CCGAAGAACT GGCTTTTGCG AAACAAATTC AGGCTACGCT CACCTCCAAC
GATCGGCAAA ACAGTCTGAA TAATATCGCC GCAACCGGTG GCGAAAACGG CAAGGTTTTT
GCACTACGTC ATCGTGAAAC GGTACTGGCG AATGAAGTCG CTCCATATGC CGCCACCGAT
AACGTGCTTG CGGCATCGAC TGATGTCGGC GACGTCAGTT GGAAACTGCC TGTTGCCCAG
TGTTTCAGCC CCTGTTTTGC CGTCGGTACA CCGCTACATA CGTGGCAACT GGTTAGCCAG
GGGCGAACAT CTATTGCTCA TAAAGGAATG CTGCTGGCGG CGAAAACTAT GGCAGCAACC
ACAGTCAATC TCTTCCTTGA TTCAGGGCTA TTGCAAGAAT GCCAACAAGA GCATCAGCAA
GTAACGGACA CGCAACCGTA TCACTGCCCT ATCCCGAAAA ACGTGACACC GTCACCTTTA
AAATAA
 
Protein sequence
MQEIYRFIDD AIEADRQRYT DIADQIWDHP ETRFEEFWSA EHLASALESA GFTVTRNVGN 
IPNAFIASFG QGKPVIALLG EYDALAGLSQ QAGCAQPTSV TPGENGHGCG HNLLGTAAFA
AAIAVKKWLE QYGQGGTVRF YGCPGEEGGS GKTFMVREGV FDDVDAALTW HPEAFAGMFN
TRTLANIQAS WRFKGIAAHA ANSPHLGRSA LDAVTLMTTG TNFLNEHIIE KARVHYAITN
SGGISPNVVQ AQAEVLYLIR APEMTDVQHI YDRVAKIAEG AALMTETTVE CRFDKACSSY
LPNRTLENAM YQALSHFGTP EWNSEELAFA KQIQATLTSN DRQNSLNNIA ATGGENGKVF
ALRHRETVLA NEVAPYAATD NVLAASTDVG DVSWKLPVAQ CFSPCFAVGT PLHTWQLVSQ
GRTSIAHKGM LLAAKTMAAT TVNLFLDSGL LQECQQEHQQ VTDTQPYHCP IPKNVTPSPL
K