Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4871 |
Symbol | iadA |
ID | 6144094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4983394 |
End bp | 4984566 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619675 |
Product | isoaspartyl dipeptidase |
Protein accession | YP_001746782 |
Protein GI | 170683847 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR01975] isoaspartyl dipeptidase IadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGATT ATACCGCAGC CGGTTTTACC CTGCTGCAGG GAGCGCATTT GTATGCGCCG GAAGATCGGG GAGTTTGCGA TGTCCTCGTC GCTAACGGCA AAATTATCGC CGTTGCCAGT AATATCCCTT CTGACATTGT ACCGGACTGC ACGGTTGTCG ATCTCAGCGG GCAGATCCTC TGCCCAGGTT TTATTGATCA ACACGTTCAT TTGATTGGCG GTGGCGGTGA AGCAGGTCCC ACGACGCGCA CGCCGGAAGT GGCGTTAAGT CGCCTGACGG AAGCGGGCGT CACGTCAGTG GTTGGTCTGC TGGGCACCGA CTCTATCTCT CGCCACCCGG AATCCCTGCT CGCCAAGACC CGTGCGCTCA ATGAAGAAGG CATCAGCGCC TGGATGCTGA CCGGCGCTTA TCATGTCCCT TCCCGCACTA TTACGGGTTC CGTGGAAAAA GACGTGGCGA TTATCGATCG CGTGATTGGT GTGAAATGCG CCATCTCTGA TCACCGTTCT GCCGCACCGG ACGTTTATCA CCTGGCTAAT ATGGCGGCGG AATCCCGCGT TGGCGGTTTG CTTGGCGGTA AACCTGGCGT CACCGTGTTC CACATGGGCG ACAGTAAAAA GGCGTTACAG CCTGTCTATG ACCTGCTGGA AAACTGCGAT GTGCCGATCA GCAAGCTGCT GCCGACCCAC GTTAACCGCA ACGTTCCGTT GTTTGAGCAG GCGCTGGAGT TTGCGCACAA AGGCGGCACT ATCGACATCA CCAGCAGCAT TGACGAACCG GTCGCCCCTG CCGAAGGTAT TGCCCGCGCC GTTCAGGCGG GTATTCCGCT GGCACGCGTC ACCCTCAGCT CCGACGGCAA CGGTAGCCAG CCGTTCTTCG ATGACGAAGG GAATTTAACC CATATCGGTG TTGCCGGTTT TGAAACGTTG CTGGAAACCG TGCAGGTGCT GGTCAAAGAC TATGATTTCA GTATCAGCGA TGCCCTGCGC CCGCTCACCA GTAGCGTAGC CGGTTTCCTT AACCTGACCG GGAAAGGCGA AATTCTGCCA GGCAATGATG CAGACTTACT GGTCATGACG CCAGAATTGC GCATTGAGCA GGTATACGCT CGCGGCAAAC TGATGGTCAA AGACGGCAAA GCCTGCGTGA AAGGAACGTT TGAAACGGCT TAA
|
Protein sequence | MIDYTAAGFT LLQGAHLYAP EDRGVCDVLV ANGKIIAVAS NIPSDIVPDC TVVDLSGQIL CPGFIDQHVH LIGGGGEAGP TTRTPEVALS RLTEAGVTSV VGLLGTDSIS RHPESLLAKT RALNEEGISA WMLTGAYHVP SRTITGSVEK DVAIIDRVIG VKCAISDHRS AAPDVYHLAN MAAESRVGGL LGGKPGVTVF HMGDSKKALQ PVYDLLENCD VPISKLLPTH VNRNVPLFEQ ALEFAHKGGT IDITSSIDEP VAPAEGIARA VQAGIPLARV TLSSDGNGSQ PFFDDEGNLT HIGVAGFETL LETVQVLVKD YDFSISDALR PLTSSVAGFL NLTGKGEILP GNDADLLVMT PELRIEQVYA RGKLMVKDGK ACVKGTFETA
|
| |