Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1985 |
Symbol | abgA |
ID | 6969625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1878429 |
End bp | 1879739 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385909 |
Product | aminobenzoyl-glutamate utilization protein A |
Protein accession | YP_002270398 |
Protein GI | 209397530 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.556014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCTT TGAATCAATT TGTTAATTCG CTTGCCCCAA AATTATCGCA CTGGCGACGT GATTTTCATC ACTATGCAGA GTCTGGCTGG GTGGAATTCC GCACTGCCAC CCTTGTTGCG GAAGAATTGC AGCAGCTCGG CTACTCACTG GCGCTGGGCC GCGAAGTAGT TAATGAAAAT AGCCGGATGG GACTACCTGA TGAACTCACT CTACAACGCG AATTCGAGCG CGCTCGTCAA CAGGGTGCGC TAGCACAATG GATTGCGGCT TTTGAAGGTG GTTTCACGGG TATCGTCGCC ACCCTGGATA CCGGTCGCCC CGGTCCGGTG ATGGCTTTCC GTGTCGATAT GGACGCGCTG GATCTCAGTG AAGAGCAGGA TGTCAGCCAT CGCCCCTACC GCGACGGTTT TGCGTCATGT AACGCCGGAA TGATGCATGC CTGTGGTCAT GATGGGCATA CCACCATTGG GCTTGGGCTG GCGCATACCC TTAAACAATT CGAGTCCGGA CTACATGGCG TCATCAAACT GATTTTTCAG CCTGCAGAGG AAGGTACGCG TGGCGCGCGG GCGATGGTCG ATGCAGGTGT CGTAGATGAT GTTGATTATT TTACTGCCGT GCACATTGGC ACTGGCGTAC CTGCGGGCAC CGTGGTGTGC GGCAGTGATA ATTTTATGGC AACCACCAAA TTTGACGCGC ACTTCACCGG TACCGCCGCT CACGCAGGCG CAAAACCAGA AGACGGTTAC AATGCCTTGT TGGCGGCAGC ACAAGCCACT CTTGCACTGC ATGCAATCGC CCCGCACAGC GAAGGAGCTT CCAGAGTAAA CGTGGGCGTT ATGCAGGCAG GAAGCGGTCG TAACGTTGTT CCTGCCTCGG CGTTGCTGAA AGTGGAAACA CGCGGGGCCA GCGACGTCAT TAATCAATAT GTTTTTGACC GTGCTCAGCA AGCGATTCAG GGCGCAGCAA CCATGTATGG TGTCGGCGTT GAAACTCGTC TGATGGGTGC AGCTACCGCC AGTTCTCCTT CGCCGCAATG GGTCGCATGG TTGCAAAGTC AGGCGGCTCA GGTCGCGGGG GTCAATCAGG CCATTGAACG TGTTGAAGCG CCTGCGGGTT CCGAAGATGC CACATTAATG ATGGCCCGCG TGCAGCAACA TCAAGGGCAA GCCTCCTACG TGGTGTTTGG CACACAGCTG GCGGCAGGTC ATCACAACGA AAAATTCGAT TTTGACGAGC AGGTTCTCGC TATTGCCGTC GAAACGCTGG CGCGCACCGC GCTCAATTTT CCCTGGACGC GAGGTATCTG A
|
Protein sequence | MESLNQFVNS LAPKLSHWRR DFHHYAESGW VEFRTATLVA EELQQLGYSL ALGREVVNEN SRMGLPDELT LQREFERARQ QGALAQWIAA FEGGFTGIVA TLDTGRPGPV MAFRVDMDAL DLSEEQDVSH RPYRDGFASC NAGMMHACGH DGHTTIGLGL AHTLKQFESG LHGVIKLIFQ PAEEGTRGAR AMVDAGVVDD VDYFTAVHIG TGVPAGTVVC GSDNFMATTK FDAHFTGTAA HAGAKPEDGY NALLAAAQAT LALHAIAPHS EGASRVNVGV MQAGSGRNVV PASALLKVET RGASDVINQY VFDRAQQAIQ GAATMYGVGV ETRLMGAATA SSPSPQWVAW LQSQAAQVAG VNQAIERVEA PAGSEDATLM MARVQQHQGQ ASYVVFGTQL AAGHHNEKFD FDEQVLAIAV ETLARTALNF PWTRGI
|
| |