Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5781 |
Symbol | pepA |
ID | 6972163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5415178 |
End bp | 5416689 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643389411 |
Product | leucyl aminopeptidase |
Protein accession | YP_002273804 |
Protein GI | 209397461 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0325394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTA GTGTAAAAAG CGGTAGCCCG GAGAAACAGC GGAGTGCCTG CATCGTCGTG GGCGTCTTCG AACCACGTCG CCTTTCTCCG ATTGCAGAAC AGCTCGATAA AATCAGCGAT GGGTACATCA GCGCCCTGCT ACGTCGGGGC GAACTGGAAG GAAAACCGGG GCAGACATTG TTGCTGCACC ATGTTCCGAA TGTACTTTCC GAGCGAATTC TCCTTATTGG TTGCGGCAAA GAACGTGAGC TGGATGAGCG TCAGTACAAG CAGGTTATTC AGAAAACCAT TAATACGCTG AATGATACTG GCTCAATGGA AGCGGTCTGC TTTCTGACTG AACTGCACGT TAAAGGCCGT AACAACTACT GGAAAGTGCG TCAGGCTGTC GAGACGGCAA AAGAGACGCT CTACAGTTTC GATCAGCTGA AAACGAACAA GAGCGAACCG CGTCGTCCGC TGCGTAAGAT GGTGTTCAAC GTGCCGACCC GCCGTGAACT GACCAGCGGT GAGCGCGCGA TCCAGCACGG TCTGGCGATT GCCGCCGGGA TTAAAGCCGC GAAAGATCTC GGTAATATGC CGCCGAATAT CTGTAACGCC GCTTACCTCG CTTCACAAGC GCGCCAGCTG GCTGACAGCT ACAGCAAGAA TGTTATCACC CGCGTTATCG GCGAACAGCA GATGAAAGAG CTGGGGATGC ATTCCTATCT GGCGGTCGGT CAGGGTTCGC AAAACGAATC GCTGATGTCG GTGATTGAGT ACAAAGGCAA CGCGTCGGAA GATGCACGCC CAATCGTGCT GGTGGGTAAA GGTTTAACCT TCGACTCCGG CGGTATCTCG ATCAAGCCTT CAGAAGGCAT GGATGAGATG AAGTACGATA TGTGCGGTGC GGCAGCGGTT TACGGCGTGA TGCGGATGGT CGCGGAGCTA CAACTGCCGA TTAACGTTAT CGGCGTGTTG GCAGGCTGCG AAAACATGCC TGGCGGACGA GCCTATCGTC CGGGCGATGT GTTAACCACC ATGTCCGGTC AAACCGTTGA AGTGCTGAAT ACCGACGCTG AAGGCCGCCT GGTACTGTGC GACGTGTTAA CTTACGTTGA GCGTTTTGAG CCGGAAGCGG TGATTGACGT GGCGACGCTG ACCGGTGCCT GCGTGATCGC GCTGGGTCAT CACATTACCG GTCTGATGGC GAACCATAAT CCGCTGGCCC ATGAACTGAT TGCCGCGTCT GAACAATCCG GTGACCGCGC ATGGCGCTTA CCGCTGGGTG ACGAGTATCA GGAACAACTG GAGTCCAATT TTGCCGATAT GGCGAACATT GGCGGTCGTC CTGGTGGGGC GATTACCGCA GGTTGCTTCC TGTCACGCTT TACCCGTAAG TACAACTGGG CGCACCTGGA TATTGCAGGA ACCGCCTGGC GTTCTGGTAA AGCAAAAGGC GCAACCGGTC GTCCGGTAGC GTTGCTGGCA CAGTTCCTGT TGAACCGCGC TGGGTTTAAC GGCGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP IAEQLDKISD GYISALLRRG ELEGKPGQTL LLHHVPNVLS ERILLIGCGK ERELDERQYK QVIQKTINTL NDTGSMEAVC FLTELHVKGR NNYWKVRQAV ETAKETLYSF DQLKTNKSEP RRPLRKMVFN VPTRRELTSG ERAIQHGLAI AAGIKAAKDL GNMPPNICNA AYLASQARQL ADSYSKNVIT RVIGEQQMKE LGMHSYLAVG QGSQNESLMS VIEYKGNASE DARPIVLVGK GLTFDSGGIS IKPSEGMDEM KYDMCGAAAV YGVMRMVAEL QLPINVIGVL AGCENMPGGR AYRPGDVLTT MSGQTVEVLN TDAEGRLVLC DVLTYVERFE PEAVIDVATL TGACVIALGH HITGLMANHN PLAHELIAAS EQSGDRAWRL PLGDEYQEQL ESNFADMANI GGRPGGAITA GCFLSRFTRK YNWAHLDIAG TAWRSGKAKG ATGRPVALLA QFLLNRAGFN GEE
|
| |