Gene EcSMS35_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1784 
SymbolabgA 
ID6147178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1801898 
End bp1803199 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content55% 
IMG OID641616660 
Productaminobenzoyl-glutamate utilization protein A 
Protein accessionYP_001743838 
Protein GI170679911 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATCAAT TTATTAATTC GCTTGCCCCA AAATTATCGC ACTGGCGACG TGATTTTCAC 
CACTATGCAG AGTCTGGCTG GGTGGAATTC CGCACTGCCA CCGTTGTTGC GGAAGAATTG
CACCAGCTCG GTTATTCACT GGCGCTGGGC CGCGAAGTCG TTAATGAAAG TAGCCGGATG
GGATTACCTG ATGAATTCAC TCTACAACGC GAATTCGAGC GCGCTCGTCA ACAGGGGGCG
CTAGAACAAT GGATTGCGGC TTTTGAAGGC GGTTTCACTG GCATCGTCGC TACCCTGGAT
ACTGGTCGCC CCGGTCCGGT GATGGCTTTC CGTGTCGATA TGGACGCGCT GGATCTCAGT
GAAGAGCAGG ATGTCAGCCA TCGCCCCTAC CGCGACGGTT TTGCGTCATG TAACGCCGGA
ATGATGCATG CCTGTGGTCA TGATGGGCAT ACCGCCATTG GGCTTGGGCT GGCGCATACC
CTTAAACAAT TCGAGTCCGG ACTACATGGC GTCATCAAAC TGATTTTTCA GCCTGCAGAG
GAAGGTACGC GTGGCGCGCG GGCGATGGTC GATGCAGGTG TCGTAGATGA TGTTGATTAT
TTTACTGCCG TGCACATTGG CACTGGCGTA CCTGCGGGCA CCGTGGTGTG CGGCAGTGAT
AATTTTATGG CAACCACCAA ATTTGACGCG CACTTCACCG GGACCGCCGC TCACGCAGGC
GCAAAACCAG AAGACGGTCA CAATGCCTTG TTGGCGGCAG CACAAGCAAC TCTTGCACTG
CATGCAATCG CCCCGCACAG CGAAGGCGCT TCCAGAGTAA ACGTGGGCGT TATGCAGGCA
GGAAGCGGTC GTAATGTTGT TCCTGCCTCG GCGTTGCTGA AAGTGGAAAC ACGCGGGGCC
AGCGACGTCA TTAATCAATA TGTTTTTGAA CGTGCACAAC AAGCGATTCA GGGCGCAGCA
ACCATGTATG GTGTCGGCGT TGAAACTCGT CTGATGGGTG CTGCTACCGC CAGTTCTCCT
TCGCCGCAAT GGGTCGCATG GTTGCAAATC CAGGCGGCTC AGGTCGCGGG GGTCAATCAG
GCCATTGAAC GTGTTGAAGC GCCTGCGGGT TCCGAAGATG CCACATTAAT GATGGCCCGC
GTGCAGCGAC ATCAAGGGCA AGCCTCCTAC ATGGTATTTG GCACACAGCT GGCGGCAGGT
CATCACAATG AAAAATTCGA TTTTGACGAG CAGGTTCTCG CTATTGCCGT CGAAACGCTG
GCGCGCACCG CGCTCAATTT TCCCTGGACG CGAGGTATCT GA
 
Protein sequence
MNQFINSLAP KLSHWRRDFH HYAESGWVEF RTATVVAEEL HQLGYSLALG REVVNESSRM 
GLPDEFTLQR EFERARQQGA LEQWIAAFEG GFTGIVATLD TGRPGPVMAF RVDMDALDLS
EEQDVSHRPY RDGFASCNAG MMHACGHDGH TAIGLGLAHT LKQFESGLHG VIKLIFQPAE
EGTRGARAMV DAGVVDDVDY FTAVHIGTGV PAGTVVCGSD NFMATTKFDA HFTGTAAHAG
AKPEDGHNAL LAAAQATLAL HAIAPHSEGA SRVNVGVMQA GSGRNVVPAS ALLKVETRGA
SDVINQYVFE RAQQAIQGAA TMYGVGVETR LMGAATASSP SPQWVAWLQI QAAQVAGVNQ
AIERVEAPAG SEDATLMMAR VQRHQGQASY MVFGTQLAAG HHNEKFDFDE QVLAIAVETL
ARTALNFPWT RGI