Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2879 |
Symbol | iap |
ID | 6144091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2949779 |
End bp | 2950816 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617748 |
Product | alkaline phosphatase isozyme conversion aminopeptidase |
Protein accession | YP_001744903 |
Protein GI | 170683777 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCCG CATTGCGCCA CCGTACCGCT GCCCTGGCGC TCGGCGTATG CTTTATTCTC CCCGTACACG CCTCGTCACC TAAACCTGGC GATTTTGCTA ATACCCAGGC ACGACATATT GCTACTTTCT TTCCGGGACG CATGACCGGA ACTCCAGCAG AAATGTTATC TGCCGATTAT ATTCGCCAAC AGTTTCAGCA AATGGGTTAT CGCAGTGATA TTCGGACATT TAATAGTCGG TATATTTATA CCGCCCGCGA TAATCGCAAG AGCTGGCATA ACGTGACGGG AAGTACGGTG ATTGCCGCTC ATGAAGGCAA AGCGCCGCAG CAGATCATCA TTATGGCGCA TCTGGATACT TACGCCCCGC TGAGCGATGC TGACGCCGAT GCCAATCTCG GCGGGCTGAC GTTACAAGGA ATGGATGATA ACGCCGCAGG TTTAGGTGTC ATGCTGGAAT TGGCAGAACG CCTGAAAAAT ACGCCTACCG AGTATGGTAT TCGATTTGTG GCGACCAGCG GCGAAGAGGA AGGGAAATTA GGCGCTGAGA ATTTACTCAA GCGGATGAGT GACTCCGAAA AGAAAAATAC ACTGCTGGTG ATTAATCTCG ATAACTTAAT TGTTGGCGAT AAATTGTATT TCAACAGCGG TGTAAAAACC CCTGAAGCAG TAAGGAAATT AACGCGCGAC AGGGCGCTGG CAATTGCGCG TAGTCATGGA ATTGCCGCAA CGACCAATCC GGGTTTGAAT AAAAATTATC CGAAGGGTAC TGGCTGTTGT AATGACGCCG AAGTTTTCGA CAAAGCCGGT ATTGCGGTAC TTTCGGTGGA GGCGACCAAC TGGAATCTTG GAAATAAGGA TGGTTATCAG CAACGCGCAA AAACAGCTGC ATTCCCTGCG GGAAATAGCT GGCATGACGT AAGACTGGAT AATCAACAAC ATATTGATAA GGCACTTCCT GGAAGAATAG AACGTCGCTG CCGTGACGTT ATGCGGATAA TGCTACCTCT GGTGAAAGAG CTGGCGAAGG CGTCTTGA
|
Protein sequence | MFSALRHRTA ALALGVCFIL PVHASSPKPG DFANTQARHI ATFFPGRMTG TPAEMLSADY IRQQFQQMGY RSDIRTFNSR YIYTARDNRK SWHNVTGSTV IAAHEGKAPQ QIIIMAHLDT YAPLSDADAD ANLGGLTLQG MDDNAAGLGV MLELAERLKN TPTEYGIRFV ATSGEEEGKL GAENLLKRMS DSEKKNTLLV INLDNLIVGD KLYFNSGVKT PEAVRKLTRD RALAIARSHG IAATTNPGLN KNYPKGTGCC NDAEVFDKAG IAVLSVEATN WNLGNKDGYQ QRAKTAAFPA GNSWHDVRLD NQQHIDKALP GRIERRCRDV MRIMLPLVKE LAKAS
|
| |