Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3698 |
Symbol | prlC |
ID | 5595305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3685651 |
End bp | 3687693 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640922812 |
Product | oligopeptidase A |
Protein accession | YP_001460292 |
Protein GI | 157162974 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATC CGTTACTGAC TCCCTTTGAA TTGCCTCCGT TTTCTAAAAT TCTCCCGGAA CATGTCGTTC CAGCCGTGAC TAAGGCATTG AACGACTGCC GCGAAAACGT GGAGCGCGTA GTAGCGCAAG GGGCACCGTA CACCTGGGAA AATCTCTGCC AGCCGCTGGC GGAAGTGGAC GATGTGTTGG GGCGTATCTT CTCCCCGGTC AGCCACCTGA ACTCGGTGAA AAATAGCCCG GAACTGCGTG AAGCCTACGA ACAAACCCTG CCGCTGCTGT CGGAATACAG CACCTGGGTA GGGCAACATG AAGGGCTGTA TAAAGCGTAT CGCGACCTGC GCGATGGCGA TCATTACGCC ACGCTGAACA CGGCGCAGAA GAAAGCGGTT GATAACGCAC TGCGCGACTT TGAACTCTCT GGCATAGGTC TGCCGAAAGA GAAACAGCAG CGTTACGGCG AAATTGCGAC CCGTCTTTCT GAACTGGGCA ACCAGTACAG CAACAACGTC CTCGATGCGA CAATGGGCTG GACCAAACTC GTTACCGACG AAGCGGAGCT GGCGGGGATG CCAGAAAGCG CGCTGGCTGC GGCAAAAGCC CAGGCCGAAG CGAAAGAGCT GGAAGGTTAT TTGCTGACGC TGGATATCCC AAGCTACTTG CCGGTAATGA CCTACTGCGA CAACCAGGCT CTGCGTGAAG AGATGTATCG CGCTTACAGC ACCCGCGCCT CCGATCAAGG CCCAAACGCC GGTAAATGGG ATAACAGCAA GGTGATGGAA GAGATCCTCG CGCTGCGTCA CGAACTGGCG CAACTGCTGG GCTTTGAAAA CTACGCCTTT AAATCCCTTG CCACTAAAAT GGCAGAAAAC CCGCAGCAGG TGCTGGATTT CTTAACCGAT CTGGCAAAAC GCGCGCGTCC ACAAGGCGAA AAAGAGCTGG CGCAATTGCG TGCCTTTGCC AAAGCCGAAT TTGGCGTCGA TGAGTTGCAG CCGTGGGATA TCGCTTACTA CAGCGAAAAA CAAAAACAGC ACCTCTACAG CATCAGTGAC GAACAGCTGC GTCCGTACTT CCCGGAAAAC AAAGCGGTTA ACGGCCTGTT TGAAGTGGTT AAGCGTATTT ACGGCATCAC CGCTAAAGAG CGTAAAGATG TTGATGTCTG GCATCCGGAT GTACGTTTCT TCGAACTGTA TGACGAAAAT AACGAACTGC GCGGTAGCTT CTACCTCGAT CTGTATGCCC GTGAAAACAA GCGCGGCGGG GCGTGGATGG ATGACTGCGT AGGCCAGATG CGTAAAGCTG ATGGTTCTTT GCAAAAACCG GTCGCGTATT TGACTTGTAA CTTCAACCGC CCGGTAAATG GTAAACCGGC GCTGTTCACT CACGACGAAG TGATCACCCT CTTCCACGAG TTCGGTCACG GCCTGCACCA TATGCTGACC CGCATCGAAA CCGCTGGTGT TTCCGGTATC AGCGGTGTGC CGTGGGATGC GGTCGAACTG CCGAGTCAGT TTATGGAAAA CTGGTGCTGG GAGCCGGAGG CGCTGGCGTT TATCTCTGGT CACTATGAAA CCGGCGAACC GCTGCCGAAA GAGTTGCTGG ATAAAATGCT GGCGGCGAAG AACTACCAGG CGGCGCTGTT TATTCTGCGT CAGCTGGAGT TCGGCCTGTT TGATTTCCGC CTTCATGCCG AGTTCCGCCC GGATCAGGGG GCAAAAATCC TCGAAACTCT GGCAGAAATC AAGAAACTGG TTGCCGTGGT GCCATCTCCG TCCTGGGGCC GTTTCCCGCA CGCTTTCAGC CATATTTTCG CCGGTGGTTA TGCCGCAGGT TACTACAGCT ACCTGTGGGC TGACGTACTG GCGGCAGATG CTTTCTCGCG CTTTGAGGAA GAGGGCATTT TCAACCGTGA AACCGGGCAG TCGTTCCTCG ACAACATTCT GAGCCGTGGC GGTTCAGAAG AGCCGATGGA TCTGTTCAAA CGCTTCCGTG GTCGTGAACC GCAGCTGGAT GCGATGCTGG AGCATTACGG CATTAAGGGC TGA
|
Protein sequence | MTNPLLTPFE LPPFSKILPE HVVPAVTKAL NDCRENVERV VAQGAPYTWE NLCQPLAEVD DVLGRIFSPV SHLNSVKNSP ELREAYEQTL PLLSEYSTWV GQHEGLYKAY RDLRDGDHYA TLNTAQKKAV DNALRDFELS GIGLPKEKQQ RYGEIATRLS ELGNQYSNNV LDATMGWTKL VTDEAELAGM PESALAAAKA QAEAKELEGY LLTLDIPSYL PVMTYCDNQA LREEMYRAYS TRASDQGPNA GKWDNSKVME EILALRHELA QLLGFENYAF KSLATKMAEN PQQVLDFLTD LAKRARPQGE KELAQLRAFA KAEFGVDELQ PWDIAYYSEK QKQHLYSISD EQLRPYFPEN KAVNGLFEVV KRIYGITAKE RKDVDVWHPD VRFFELYDEN NELRGSFYLD LYARENKRGG AWMDDCVGQM RKADGSLQKP VAYLTCNFNR PVNGKPALFT HDEVITLFHE FGHGLHHMLT RIETAGVSGI SGVPWDAVEL PSQFMENWCW EPEALAFISG HYETGEPLPK ELLDKMLAAK NYQAALFILR QLEFGLFDFR LHAEFRPDQG AKILETLAEI KKLVAVVPSP SWGRFPHAFS HIFAGGYAAG YYSYLWADVL AADAFSRFEE EGIFNRETGQ SFLDNILSRG GSEEPMDLFK RFRGREPQLD AMLEHYGIKG
|
| |