Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1784 |
Symbol | abgA |
ID | 6147178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1801898 |
End bp | 1803199 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616660 |
Product | aminobenzoyl-glutamate utilization protein A |
Protein accession | YP_001743838 |
Protein GI | 170679911 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATCAAT TTATTAATTC GCTTGCCCCA AAATTATCGC ACTGGCGACG TGATTTTCAC CACTATGCAG AGTCTGGCTG GGTGGAATTC CGCACTGCCA CCGTTGTTGC GGAAGAATTG CACCAGCTCG GTTATTCACT GGCGCTGGGC CGCGAAGTCG TTAATGAAAG TAGCCGGATG GGATTACCTG ATGAATTCAC TCTACAACGC GAATTCGAGC GCGCTCGTCA ACAGGGGGCG CTAGAACAAT GGATTGCGGC TTTTGAAGGC GGTTTCACTG GCATCGTCGC TACCCTGGAT ACTGGTCGCC CCGGTCCGGT GATGGCTTTC CGTGTCGATA TGGACGCGCT GGATCTCAGT GAAGAGCAGG ATGTCAGCCA TCGCCCCTAC CGCGACGGTT TTGCGTCATG TAACGCCGGA ATGATGCATG CCTGTGGTCA TGATGGGCAT ACCGCCATTG GGCTTGGGCT GGCGCATACC CTTAAACAAT TCGAGTCCGG ACTACATGGC GTCATCAAAC TGATTTTTCA GCCTGCAGAG GAAGGTACGC GTGGCGCGCG GGCGATGGTC GATGCAGGTG TCGTAGATGA TGTTGATTAT TTTACTGCCG TGCACATTGG CACTGGCGTA CCTGCGGGCA CCGTGGTGTG CGGCAGTGAT AATTTTATGG CAACCACCAA ATTTGACGCG CACTTCACCG GGACCGCCGC TCACGCAGGC GCAAAACCAG AAGACGGTCA CAATGCCTTG TTGGCGGCAG CACAAGCAAC TCTTGCACTG CATGCAATCG CCCCGCACAG CGAAGGCGCT TCCAGAGTAA ACGTGGGCGT TATGCAGGCA GGAAGCGGTC GTAATGTTGT TCCTGCCTCG GCGTTGCTGA AAGTGGAAAC ACGCGGGGCC AGCGACGTCA TTAATCAATA TGTTTTTGAA CGTGCACAAC AAGCGATTCA GGGCGCAGCA ACCATGTATG GTGTCGGCGT TGAAACTCGT CTGATGGGTG CTGCTACCGC CAGTTCTCCT TCGCCGCAAT GGGTCGCATG GTTGCAAATC CAGGCGGCTC AGGTCGCGGG GGTCAATCAG GCCATTGAAC GTGTTGAAGC GCCTGCGGGT TCCGAAGATG CCACATTAAT GATGGCCCGC GTGCAGCGAC ATCAAGGGCA AGCCTCCTAC ATGGTATTTG GCACACAGCT GGCGGCAGGT CATCACAATG AAAAATTCGA TTTTGACGAG CAGGTTCTCG CTATTGCCGT CGAAACGCTG GCGCGCACCG CGCTCAATTT TCCCTGGACG CGAGGTATCT GA
|
Protein sequence | MNQFINSLAP KLSHWRRDFH HYAESGWVEF RTATVVAEEL HQLGYSLALG REVVNESSRM GLPDEFTLQR EFERARQQGA LEQWIAAFEG GFTGIVATLD TGRPGPVMAF RVDMDALDLS EEQDVSHRPY RDGFASCNAG MMHACGHDGH TAIGLGLAHT LKQFESGLHG VIKLIFQPAE EGTRGARAMV DAGVVDDVDY FTAVHIGTGV PAGTVVCGSD NFMATTKFDA HFTGTAAHAG AKPEDGHNAL LAAAQATLAL HAIAPHSEGA SRVNVGVMQA GSGRNVVPAS ALLKVETRGA SDVINQYVFE RAQQAIQGAA TMYGVGVETR LMGAATASSP SPQWVAWLQI QAAQVAGVNQ AIERVEAPAG SEDATLMMAR VQRHQGQASY MVFGTQLAAG HHNEKFDFDE QVLAIAVETL ARTALNFPWT RGI
|
| |