Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4516 |
Symbol | pepA |
ID | 5593274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4520599 |
End bp | 4522110 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640923612 |
Product | leucyl aminopeptidase |
Protein accession | YP_001461053 |
Protein GI | 157163735 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00000167051 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTTA GTGTAAAAAG CGGTAGCCCG GAGAAACAGC GGAGTGCCTG CATCGTCGTG GGCGTCTTCG AACCACGTCG CCTTTCTCCG ATTGCAGAAC AGCTCGATAA AATCAGCGAT GGGTACATCA GCGCCCTGCT ACGTCGGGGC GAACTGGAAG GAAAACCGGG GCAGACATTG TTGCTGCACC ATGTTCCGAA TGTACTTTCC GAGCGAATTC TCCTTATTGG TTGCGGCAAA GAACGTGAGC TGGATGAGCG TCAGTACAAG CAGGTTATTC AGAAAACCAT TAATACGCTG AATGATACTG GCTCAATGGA AGCGGTCTGC TTTCTGACTG AACTGCACGT TAAAGGCCGT AACAACTACT GGAAAGTGCG TCAGGCTGTC GAGACGGCAA AAGAGACGCT CTACAGTTTC GATCAGCTGA AAACGAACAA GAGCGAACCG CGTCGTCCGC TGCGTAAAAT GGTGTTCAAC GTGCCGACCC GCCGTGAACT GACCAGCGGT GAGCGCGCGA TCCAGCACGG TCTGGCGATT GCCGCCGGGA TTAAAGCAGC AAAAGATCTC GGCAATATGC CGCCGAATAT CTGTAACGCC GCTTACCTCG CTTCACAAGC GCGCCAGCTG GCTGACAGCT ACAGCAAGAA TGTCATCACC CGCGTTATCG GCGAACAGCA GATGAAAGAG CTGGGGATGC ATTCCTATCT GGCGGTCGGT CAGGGTTCGC AGAACGAATC GCTGATGTCG GTGATTGAGT ACAAAGGCAA CGCGTCGGAA GATGCACGCC CAATCGTGCT GGTGGGTAAA GGTTTAACCT TCGACTCCGG CGGTATCTCG ATCAAGCCTT CCGAAGGCAT GGATGAGATG AAGTACGATA TGTGCGGTGC GGCAGCGGTT TACGGCGTGA TGCGTATGGT CGCAGAGCTA CAACTGCCGA TTAACGTTAT CGGCGTGTTG GCAGGCTGCG AAAACATGCC TGGCGGACGA GCCTATCGTC CGGGCGATGT GTTAACCACC ATGTCCGGTC AAACCGTTGA AGTGCTGAAC ACCGACGCCG AAGGCCGCCT GGTACTGTGC GACGTGTTAA CTTACGTTGA GCGGTTTGAG CCGGAAGCGG TGATTGACGT GGCGACGCTG ACCGGTGCCT GCGTGATCGC GCTGGGTCAT CACATTACCG GTCTGATGGC GAACCACAAT CCGCTGGCCC ATGAGCTGAT TGCCGCGTCT GAACAATCCG GTGACCGCGC ATGGCGCTTA CCGCTGGGTG ACGAGTATCA GGAACAGCTG GAGTCCAATT TTGCCGATAT GGCGAACATT GGCGGTCGTC CTGGTGGGGC GATTACCGCA GGTTGCTTCC TGTCACGCTT TACCCGTAAG TACAACTGGG CGCACCTGGA TATCGCCGGT ACCGCCTGGC GTTCTGGTAA AGCAAAAGGC GCAACCGGTC GTCCGGTAGC GTTGCTGGCA CAGTTCCTGT TAAACCGCGC CGGGTTTAAC GGCGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP IAEQLDKISD GYISALLRRG ELEGKPGQTL LLHHVPNVLS ERILLIGCGK ERELDERQYK QVIQKTINTL NDTGSMEAVC FLTELHVKGR NNYWKVRQAV ETAKETLYSF DQLKTNKSEP RRPLRKMVFN VPTRRELTSG ERAIQHGLAI AAGIKAAKDL GNMPPNICNA AYLASQARQL ADSYSKNVIT RVIGEQQMKE LGMHSYLAVG QGSQNESLMS VIEYKGNASE DARPIVLVGK GLTFDSGGIS IKPSEGMDEM KYDMCGAAAV YGVMRMVAEL QLPINVIGVL AGCENMPGGR AYRPGDVLTT MSGQTVEVLN TDAEGRLVLC DVLTYVERFE PEAVIDVATL TGACVIALGH HITGLMANHN PLAHELIAAS EQSGDRAWRL PLGDEYQEQL ESNFADMANI GGRPGGAITA GCFLSRFTRK YNWAHLDIAG TAWRSGKAKG ATGRPVALLA QFLLNRAGFN GEE
|
| |