Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4640 |
Symbol | amiB |
ID | 6143598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4740847 |
End bp | 4742184 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619456 |
Product | N-acetylmuramoyl-l-alanine amidase II |
Protein accession | YP_001746564 |
Protein GI | 170680514 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0860] N-acetylmuramoyl-L-alanine amidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.4247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0772035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTATC GCATCAGAAA TTGGTTGGTA GCGACGTTGC TGCTGCTGTG CGCGCAGGTG GGTGCCGCGA CGCTCTCTGA TATTCAGGTT TCTAACGGCA ACCAACAGGC GCGGATAACG TTGAGTTTTA TTGGCGATCC TGATTATGCG TTTAGCCATC AAAGCAAACG CATCGTGGCG CTCGATATCA AACAAACGGG CGTGATTCAG GGACTGCCGT TGTTGTTCAG CGGCAATAAT CTGGTGAAGG CGATTCGCTC TGGAACGCCT AAAGATGCAC AAACGCTACG GCTGGTGGTC GATCTTACCG AAAATGGTAA AACCGAAGCG GTGAAGCGGC AGAATGGCAG CAATTACACT GTCGTCTTTA CGATTAACGC CGATGCGCCG CCACCGCCTC CTCCGCCGCC TGTGGTTGCG AAACGCGTTG AAACGCCTGC GGTTGGCGCA CCGCGCGTCA GCGAACCGGC GCGCAATCCG TTTAAAACGG AAAGTAACCG CACTACGGGT GTTATCAGCA GTAATACGGT AACGCGTCCG GCAGCGCGCG CGACGGCTAA CACTGGCGAT AAAATTATCA TCGCTATTGA TGCCGGACAC GGCGGTCAGG ATCCTGGCGC TATCGGCCCC GGTGGTACGC GGGAGAAAAA TGTCACCATC GCCATCGCAC GTAAATTACG TACTTTGCTC AATGACGATC CAATGTTTAA AGGCGTTTTA ACCCGTGACG GGGATTACTT TATTTCGGTG ATGGGGCGCA GCGATGTGGC ACGTAAGCAA AACGCCAATT TCCTCGTGTC GATTCACGCT GATGCCGCAC CAAACCGCAG TGCGACTGGC GCTTCCGTAT GGGTGCTCTC TAACCGTCGT GCAAACAGCG AGATGGCAAG CTGGCTGGAA CAGCATGAGA AACAGTCGGA GCTACTGGGC GGAGCGGGCG ATGTGCTGGC GAACAGTCAG TCTGACCCCT ATTTGAGCCA GGCGGTGCTG GATTTACAGT TCGGTCATTC CCAGCGGGTA GGGTATGATG TAGCGACCAG TATGATCAGT CAGTTGCAAC GCATTGGCGA AATTCATAAA CGTCGACCAG AACACGCCAG CCTTGGCGTT CTGCGTTCGC CGGATATCCC ATCAGTACTG GTCGAAACCG GTTTTATCAG CAACAACAGC GAAGAACGTT TGCTGGCGAG CGACGATTAC CAACAACAGC TGGCAGAAGC CATTTATAAA GGTCTGCGCA ATTATTTCCT TGCGCATCCG ATGCAATCTG CGCCGCAGGG TGCAACGGCA CAAACTGCCA GTACGGTGAC GACGCCAGAT CGTACGCTGC CAAACTAA
|
Protein sequence | MMYRIRNWLV ATLLLLCAQV GAATLSDIQV SNGNQQARIT LSFIGDPDYA FSHQSKRIVA LDIKQTGVIQ GLPLLFSGNN LVKAIRSGTP KDAQTLRLVV DLTENGKTEA VKRQNGSNYT VVFTINADAP PPPPPPPVVA KRVETPAVGA PRVSEPARNP FKTESNRTTG VISSNTVTRP AARATANTGD KIIIAIDAGH GGQDPGAIGP GGTREKNVTI AIARKLRTLL NDDPMFKGVL TRDGDYFISV MGRSDVARKQ NANFLVSIHA DAAPNRSATG ASVWVLSNRR ANSEMASWLE QHEKQSELLG GAGDVLANSQ SDPYLSQAVL DLQFGHSQRV GYDVATSMIS QLQRIGEIHK RRPEHASLGV LRSPDIPSVL VETGFISNNS EERLLASDDY QQQLAEAIYK GLRNYFLAHP MQSAPQGATA QTASTVTTPD RTLPN
|
| |