Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4826 |
Symbol | pepA |
ID | 6485470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4698984 |
End bp | 4700495 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642740039 |
Product | leucyl aminopeptidase |
Protein accession | YP_002043717 |
Protein GI | 194444911 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.956713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 88 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCA GTGTAAAAAG CGGTAGCCCG GAGAAACAGC GGAGTGCCTG CATTGTCGTG GGCGTCTTTG AACCGCGTCG CCTTTCTCCG ATTGCAGAAC AGCTCGACAA AATTAGCGAC GGATACATCA GCGCATTGCT GCGTCGCGGC GAACTGGAAG GAAAACCGGG GCAGACTCTG TTGCTGCACC ATGTTCCTAA CGTTCTTTCC GAGCGAATCC TCCTCATTGG TTGCGGCAAA GAGCGCGAGC TTGATGAACG TCAGTATAAG CAGGTTATTC AGAAAACGAT AAATACTCTG AATGATACTG GTTCGATGGA AGCCGTCTGT TTCCTGACCG AACTGCACGT TAAAGGCCGC AACAACTACT GGAAAGTGCG TCAGGCCGTC GAAACGGCCA AAGAGACGCT TTATAGCTTT GATCAACTCA AGACCAACAA GAGCGAGCCG CGCCGCCCGC TACGTAAGAT GGTCTTTAAT GTGCCGACCC GCCGTGAGCT CACCAGCGGC GAACGCGCCA TTCAGCACGG TCTGGCCATC GCCGCCGGGA TTAAGGCAGC GAAAGATCTC GGCAACATGC CGCCCAATAT CTGTAACGCC GCCTACCTGG CGTCACAGGC GCGCCAGTTG GCTGACAGCT ACAGCAAAAA TGTCATCACC CGCGTCATCG GCGAACAGCA AATGCGCGAA CTGGGTATGA ACGCTTATCT GGCGGTCGGC CACGGTTCGC AGAATGAATC GCTGATGTCG GTGATTGAGT ACAAGGGCAA TCCGTCCGAA GACGCGCGCC CGATCGTGCT GGTGGGTAAA GGCCTGACCT TCGACTCCGG CGGCATCTCC ATCAAGCCAT CTGAAGGGAT GGACGAGATG AAGTACGACA TGTGCGGCGC GGCGGCGGTT TACGGCGTGA TGCGTATGGT CGCCGAGCTT CAGCTACCGA TTAACGTTAT CGGCGTACTG GCGGGCTGTG AAAACATGCC GGGCGGACGC GCGTATCGTC CGGGCGACGT GCTGACCACC ATGTCTGGTC AAACCGTTGA AGTCCTGAAC ACCGACGCCG AAGGCCGTCT GGTACTGTGC GACGTGCTGA CCTACGTTGA GCGCTTCGAA CCGGAAGCCG TCATTGACGT CGCGACGCTA ACTGGCGCCT GCGTGATTGC GCTGGGCCAT CACATTACCG GTCTGATGTC GAACCATAAC CCGCTGGCGC ATGAACTGAT CGGCGCGTCC GAGCAAGCGG GCGACCGCGC GTGGCGTCTG CCGCTGGGCG ATGAGTTCCA GGAACAACTG GAGTCCAACT TTGCGGATAT GGCGAATATT GGTGGTCGTC CTGGCGGCGC TATCACCGCG GGCTGCTTCC TGTCGCGCTT TACCCGTAAG TATAACTGGG CGCACCTGGA TATCGCCGGC ACCGCCTGGC GATCCGGCAA AGCGAAAGGC GCGACGGGTC GTCCGGTAGC GCTGCTGTCG CAGTTCCTGC TCAATCGTGC GGGCTTTAAC GGCGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP IAEQLDKISD GYISALLRRG ELEGKPGQTL LLHHVPNVLS ERILLIGCGK ERELDERQYK QVIQKTINTL NDTGSMEAVC FLTELHVKGR NNYWKVRQAV ETAKETLYSF DQLKTNKSEP RRPLRKMVFN VPTRRELTSG ERAIQHGLAI AAGIKAAKDL GNMPPNICNA AYLASQARQL ADSYSKNVIT RVIGEQQMRE LGMNAYLAVG HGSQNESLMS VIEYKGNPSE DARPIVLVGK GLTFDSGGIS IKPSEGMDEM KYDMCGAAAV YGVMRMVAEL QLPINVIGVL AGCENMPGGR AYRPGDVLTT MSGQTVEVLN TDAEGRLVLC DVLTYVERFE PEAVIDVATL TGACVIALGH HITGLMSNHN PLAHELIGAS EQAGDRAWRL PLGDEFQEQL ESNFADMANI GGRPGGAITA GCFLSRFTRK YNWAHLDIAG TAWRSGKAKG ATGRPVALLS QFLLNRAGFN GEE
|
| |