Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4740 |
Symbol | pepA |
ID | 6147238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4839389 |
End bp | 4840900 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619555 |
Product | leucyl aminopeptidase |
Protein accession | YP_001746663 |
Protein GI | 170681979 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000218455 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.712457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTA GTGTAAAAAG CGGTAGCCCG GAGAAACAGC GGAGTGCCTG CATCGTCGTG GGCGTCTTCG AACCACGTCG CCTTTCTCCG ATTGCAGAAC AGCTCGATAA AATCAGCGAT GGGTACATCA GCGCCCTGCT ACGTCGGGGC GAACTGGAAG GAAAACCGGG GCAGACACTG TTGCTGCACC ATGTTCCGAA TGTACTTTCC GAGCGAATTC TCCTTATTGG TTGCGGCAAA GAACGTGAGC TGGATGAACG TCAGTACAAG CAGGTTATTC AGAAAACCAT TAATACGCTG AATGATACTG GCTCAATGGA AGCGGTCTGC TTTCTGACTG AACTGCACGT TAAAGGCCGT AACAACTACT GGAAAGTGCG TCAGGCTGTC GAGACTGCAA AAGAGACGCT CTACAGTTTC GATCAGCTGA AAACGAACAA GAGCGAACCG CGTCGTCCGC TGCGTAAAAT GGTGTTCAAC GTGCCGACCC GCCGTGAACT GACCAGCGGT GAGCGCGCGA TCCAGCACGG TCTGGCGATT GCCGCCGGGA TTAAAGCAGC AAAAGATCTC GGCAATATGC CGCCGAATAT CTGTAACGCC GCTTACCTCG CTTCACAAGC GCGCCAGCTG GCTGACAGCT ACAGCAAGAA TGTCATCACC CGCGTTATCG GCGAACAGCA GATGAAAGAG CTGGGGATGC ATTCTTATCT GGCGGTCGGT CAGGGTTCGC AGAACGAATC GCTGATGTCG GTGATTGAGT ACAAAGGCAA CGCGTCGGAA GATGCTCGCC CAATCGTGCT GGTGGGTAAA GGTTTAACCT TCGACTCCGG CGGTATCTCC ATCAAGCCTT CAGAAGGCAT GGATGAGATG AAGTACGATA TGTGCGGCGC GGCGGCGGTT TACGGCGTGA TGCGTATGGT CGCGGAGCTG CAACTGCCGA TTAACGTTAT CGGCGTGCTG GCAGGCTGCG AAAACATGCC TGGCGGGCGT GCCTATCGTC CGGGCGATGT GTTAACCACC ATGTCCGGTC AAACCGTTGA AGTGCTGAAC ACCGATGCCG AAGGCCGCCT GGTACTGTGC GACGTGTTAA CTTACGTTGA ACGTTTTGAG CCGGAAGCGG TGATTGATGT GGCGACGCTG ACCGGTGCCT GCGTGATCGC GCTGGGTCAT CATATTACTG GTCTGATGGC GAACCATAAT CCGCTGGCCC ATGAACTGAT TGCCGCGTCT GAACAATCCG GTGACCGCGC ATGGCGCTTA CCGCTGGGTG ACGAGTATCA GGAACAACTG GAGTCCAATT TTGCCGATAT GGCGAACATT GGCGGTCGTC CTGGTGGGGC GATTACCGCA GGTTGCTTCC TGTCACGCTT TACCCGTAAG TACAACTGGG CGCACCTGGA TATTGCAGGA ACCGCCTGGC GTTCTGGTAA AGCAAAAGGC GCAACCGGTC GTCCGGTAGC GTTGCTGGCA CAGTTCCTGC TGAATCGCGC TGGGTTTAAC GGCGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP IAEQLDKISD GYISALLRRG ELEGKPGQTL LLHHVPNVLS ERILLIGCGK ERELDERQYK QVIQKTINTL NDTGSMEAVC FLTELHVKGR NNYWKVRQAV ETAKETLYSF DQLKTNKSEP RRPLRKMVFN VPTRRELTSG ERAIQHGLAI AAGIKAAKDL GNMPPNICNA AYLASQARQL ADSYSKNVIT RVIGEQQMKE LGMHSYLAVG QGSQNESLMS VIEYKGNASE DARPIVLVGK GLTFDSGGIS IKPSEGMDEM KYDMCGAAAV YGVMRMVAEL QLPINVIGVL AGCENMPGGR AYRPGDVLTT MSGQTVEVLN TDAEGRLVLC DVLTYVERFE PEAVIDVATL TGACVIALGH HITGLMANHN PLAHELIAAS EQSGDRAWRL PLGDEYQEQL ESNFADMANI GGRPGGAITA GCFLSRFTRK YNWAHLDIAG TAWRSGKAKG ATGRPVALLA QFLLNRAGFN GEE
|
| |