Gene EcSMS35_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1785 
SymbolabgB 
ID6144073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1803199 
End bp1804644 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content53% 
IMG OID641616661 
Productaminobenzoyl-glutamate utilization protein B 
Protein accessionYP_001743839 
Protein GI170682636 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA TCTATCGTTT TATCGACGAT GCGATTGAAG CCGATCGCCA ACGTTATACC 
GATATTGCCG ATCAAATCTG GGATCATCCA GAAACACGTT TTGAAGAGTT CTGGTCAGCG
GAGCATCTGG CTTCGGCGCT GGAGTCTGCA GGCTTCACCG TTACCCGCAA CGTAGGCAAT
ATCCCAAATG CCTTTATTGC TTCGTTTGGT CAAGGCAAAC CGGTTATCGC CCTGCTGGGG
GAATATGACG CCCTGGCAGG TTTAAGTCAG CAAGCAAGTT GCGCGCAACC TACATCCGCG
ACGCCCGGTG AAAATGGTCA CGGTTGCGGA CACAATTTGC TGGGAACCGC CGCCTTTGCC
GCTGCAATAG CCGTCAAGAA ATGGCTGGAA CAATATGGGC AAGGCGGCAC GGTGCGCTTT
TATGGTTGTC CTGGCGAAGA AGGCGGCTCG GGTAAAACGT TCATGGTCCG CGAGGGGGTA
TTTGATGATG TGGATGCGGC ACTCACCTGG CACCCGGAAG CCTTTGCCGG TATGTTCAAT
ACCCGCACGC TGGCAAACAT TCAGGCATCA TGGCGCTTTA AAGGGATCGC AGCACATGCC
GCGAATTCCC CTCATTTGGG ACGCAGCGCC CTTGATGCCG TAACGTTGAT GACCACTGGC
ACCAACTTCC TCAACGAACA TATTATTGAA AAAGCGCGCG TACACTATGC CATCACAGAT
AGCGGCGGGA TCTCGCCCAA CGTGGTCCAG GCGCAGGCAG AAGTGCTTTA TCTTATCCGC
GCCCCCGAAA TGACCGATGT GCAGCATATT TATGATCGGG TCGCCAAAAT CGCCGAAGGT
GCGGCATTGA TGACCGAAAC CACGGTTGAA TGCCGCTTCG ACAAAGCCTG TTCCAGTTAT
CTCCCGAATC GCACCTTAGA AAATGCCATG TACCGAGCCC TATCCCATTT TGGTACCCCG
GAATGGAACT GCGAAGAACT GGCTTTTGCG AAACAAATTC AGGCTACGCT CACCCCCAAC
GATCGGCAAA ACAGTCTGAA TAATATCGCT GCAACCGGTG GCGAAAACGG CAAGGCTTTT
GCACTACGTC ATCGTGAAAC GGTACTGGCG AATGAAGTCG CTCCATATGC CGCCACCGAT
AACGTGCTTG CGGCATCGAC TGATGTCGGC GACGTCAGTT GGAAACTGCC TGTTGCCCAG
TGTTTCAGCC CCTGCTTTGC CGTCGGTACC CCGCTACATA CGTGGCAACT GGTTAGCCAG
GGGCGAACAT CTATTGCTCA TAAAGGAATG CTGCTGGCGG CGAAAACTAT GGCAGCAACC
ACACTTAATC TCTTCATTGA TTCAGGGCTA TTGCAAGAAT GCCAACAAGA GCATCAGCAA
GTTACGGACA CGCAACCGTA TCACTGCCCT ATCCCGAAAA ACGTGACACC GTCACCTTTA
AAATAA
 
Protein sequence
MQEIYRFIDD AIEADRQRYT DIADQIWDHP ETRFEEFWSA EHLASALESA GFTVTRNVGN 
IPNAFIASFG QGKPVIALLG EYDALAGLSQ QASCAQPTSA TPGENGHGCG HNLLGTAAFA
AAIAVKKWLE QYGQGGTVRF YGCPGEEGGS GKTFMVREGV FDDVDAALTW HPEAFAGMFN
TRTLANIQAS WRFKGIAAHA ANSPHLGRSA LDAVTLMTTG TNFLNEHIIE KARVHYAITD
SGGISPNVVQ AQAEVLYLIR APEMTDVQHI YDRVAKIAEG AALMTETTVE CRFDKACSSY
LPNRTLENAM YRALSHFGTP EWNCEELAFA KQIQATLTPN DRQNSLNNIA ATGGENGKAF
ALRHRETVLA NEVAPYAATD NVLAASTDVG DVSWKLPVAQ CFSPCFAVGT PLHTWQLVSQ
GRTSIAHKGM LLAAKTMAAT TLNLFIDSGL LQECQQEHQQ VTDTQPYHCP IPKNVTPSPL
K