Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1785 |
Symbol | abgB |
ID | 6144073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1803199 |
End bp | 1804644 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616661 |
Product | aminobenzoyl-glutamate utilization protein B |
Protein accession | YP_001743839 |
Protein GI | 170682636 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAAA TCTATCGTTT TATCGACGAT GCGATTGAAG CCGATCGCCA ACGTTATACC GATATTGCCG ATCAAATCTG GGATCATCCA GAAACACGTT TTGAAGAGTT CTGGTCAGCG GAGCATCTGG CTTCGGCGCT GGAGTCTGCA GGCTTCACCG TTACCCGCAA CGTAGGCAAT ATCCCAAATG CCTTTATTGC TTCGTTTGGT CAAGGCAAAC CGGTTATCGC CCTGCTGGGG GAATATGACG CCCTGGCAGG TTTAAGTCAG CAAGCAAGTT GCGCGCAACC TACATCCGCG ACGCCCGGTG AAAATGGTCA CGGTTGCGGA CACAATTTGC TGGGAACCGC CGCCTTTGCC GCTGCAATAG CCGTCAAGAA ATGGCTGGAA CAATATGGGC AAGGCGGCAC GGTGCGCTTT TATGGTTGTC CTGGCGAAGA AGGCGGCTCG GGTAAAACGT TCATGGTCCG CGAGGGGGTA TTTGATGATG TGGATGCGGC ACTCACCTGG CACCCGGAAG CCTTTGCCGG TATGTTCAAT ACCCGCACGC TGGCAAACAT TCAGGCATCA TGGCGCTTTA AAGGGATCGC AGCACATGCC GCGAATTCCC CTCATTTGGG ACGCAGCGCC CTTGATGCCG TAACGTTGAT GACCACTGGC ACCAACTTCC TCAACGAACA TATTATTGAA AAAGCGCGCG TACACTATGC CATCACAGAT AGCGGCGGGA TCTCGCCCAA CGTGGTCCAG GCGCAGGCAG AAGTGCTTTA TCTTATCCGC GCCCCCGAAA TGACCGATGT GCAGCATATT TATGATCGGG TCGCCAAAAT CGCCGAAGGT GCGGCATTGA TGACCGAAAC CACGGTTGAA TGCCGCTTCG ACAAAGCCTG TTCCAGTTAT CTCCCGAATC GCACCTTAGA AAATGCCATG TACCGAGCCC TATCCCATTT TGGTACCCCG GAATGGAACT GCGAAGAACT GGCTTTTGCG AAACAAATTC AGGCTACGCT CACCCCCAAC GATCGGCAAA ACAGTCTGAA TAATATCGCT GCAACCGGTG GCGAAAACGG CAAGGCTTTT GCACTACGTC ATCGTGAAAC GGTACTGGCG AATGAAGTCG CTCCATATGC CGCCACCGAT AACGTGCTTG CGGCATCGAC TGATGTCGGC GACGTCAGTT GGAAACTGCC TGTTGCCCAG TGTTTCAGCC CCTGCTTTGC CGTCGGTACC CCGCTACATA CGTGGCAACT GGTTAGCCAG GGGCGAACAT CTATTGCTCA TAAAGGAATG CTGCTGGCGG CGAAAACTAT GGCAGCAACC ACACTTAATC TCTTCATTGA TTCAGGGCTA TTGCAAGAAT GCCAACAAGA GCATCAGCAA GTTACGGACA CGCAACCGTA TCACTGCCCT ATCCCGAAAA ACGTGACACC GTCACCTTTA AAATAA
|
Protein sequence | MQEIYRFIDD AIEADRQRYT DIADQIWDHP ETRFEEFWSA EHLASALESA GFTVTRNVGN IPNAFIASFG QGKPVIALLG EYDALAGLSQ QASCAQPTSA TPGENGHGCG HNLLGTAAFA AAIAVKKWLE QYGQGGTVRF YGCPGEEGGS GKTFMVREGV FDDVDAALTW HPEAFAGMFN TRTLANIQAS WRFKGIAAHA ANSPHLGRSA LDAVTLMTTG TNFLNEHIIE KARVHYAITD SGGISPNVVQ AQAEVLYLIR APEMTDVQHI YDRVAKIAEG AALMTETTVE CRFDKACSSY LPNRTLENAM YRALSHFGTP EWNCEELAFA KQIQATLTPN DRQNSLNNIA ATGGENGKAF ALRHRETVLA NEVAPYAATD NVLAASTDVG DVSWKLPVAQ CFSPCFAVGT PLHTWQLVSQ GRTSIAHKGM LLAAKTMAAT TLNLFIDSGL LQECQQEHQQ VTDTQPYHCP IPKNVTPSPL K
|
| |