Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3787 |
Symbol | prlC |
ID | 6142652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3854565 |
End bp | 3856607 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618613 |
Product | oligopeptidase A |
Protein accession | YP_001745753 |
Protein GI | 170684269 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.894892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC CGTTACTGAC TCCCTTTGAA TTGCCTCCGT TTTCTAAAAT TCTCCCGGAA CATGTCGTTC CAGCCGTGAC TAAGGCGCTG AACGACTGCC GCGAAAATGT GGAGCGCGTA GTAGCGCAAG GGGCACCGTA CACCTGGGAA AATCTCTGCC AGCCGTTGGC GGAAGTGGAC GATGTGCTGG GGCGTATCTT CTCCCCGGTC AGCCACCTGA ACTCGGTGAA AAATAGCCCG GAACTGCGTG AAGCGTACGA ACAAACCCTG CCGCTGCTGT CAGAATACAG CACCTGGGTA GGGCAACATG AAGGGCTGTA TAAGGCATAT CGCGACCTGC GCGATGGCGA TCATTACGCC ACGCTGAACA CGGCGCAGAA AAAAGCGGTT GATAACGCAC TGCGCGACTT CGAACTCTCT GGCATCGGTC TGCCGAAAGA GAAACAGCAG CGTTACGGCG AAATTGCTAC CCGTCTTTCT GAACTGGGCA ACCAGTACAG CAACAACGTC CTCGATGCGA CGATGGGCTG GACCAAACTC GTTACCGACG AAGCGGAGCT GGCGGGGATG CCAGAAAGCG CGCTGGCTGC GGCAAAAGCC CAGGCCGAAG CGAAAGAGCT GGAAGGTTAT TTGCTGACGC TGGATATCCC AAGCTACTTG CCGGTAATGA CCTACTGCGA CAACCAGGCT CTGCGTGAAG AGATGTATCG TGCTTACAGC ACCCGCGCTT CCGATCAAGG CCCGAACGCC GGTAAGTGGG ACAACAGCAA GGTGATGGAA GAGATCCTCG CGCTGCGTCA CGAACTGGCG CAACTGCTGG GCTTTGAAAA CTATGCCTTT AAATCCCTTG CTACTAAAAT GGCAGAAAAC CCGCAGCAGG TGCTGGATTT CTTAACCGAT CTGGCAAAAC GCGCGCGTCC ACAAGGCGAA AAAGAGCTGG CGCAACTGCG TGCCTTCGCC AAAGCCGAAT TTGGCGTCGA TGAGTTGCAG CCGTGGGATA TCGCTTACTA CAGCGAAAAA CAGAAACAGC ACCTCTACAG CATCAGCGAT GAACAACTGC GTCCGTACTT CCCGGAAAAC AAAGCGGTTA ACGGCCTGTT TGAAGTGGTG AAACGTATTT ACGGCATCAC CGCTAAAGAG CGTAAAGATG TTGATGTCTG GCATCCGGAT GTACGTTTCT TCGAACTGTA TGACGAGAAC AACGAACTGC GCGGCAGCTT CTACCTCGAC CTGTATGCCC GTGAAAACAA ACGCGGCGGG GCGTGGATGG ATGACTGCGT AGGCCAGATG CGTAAAGCCG ACGGTTCGCT GCAAAAACCG GTCGCGTATC TGACCTGCAA CTTCAACCGC CCGGTAAATG GTAAACCGGC GCTGTTTACC CATGACGAAG TGATCACCCT GTTCCACGAG TTCGGTCACG GCCTGCATCA TATGCTGACC CGCATTGAAA CCGCTGGAGT GTCTGGTATC AGCGGGGTGC CGTGGGATGC GGTCGAACTG CCGAGCCAGT TTATGGAAAA CTGGTGCTGG GAGCCGGAGG CGCTGGCGTT TATCTCCGGT CACTATGAAA CCGGCGAACC GCTGCCGAAA GAGTTGCTGG ATAAAATGCT GGCGGCGAAG AACTACCAGG CGGCGCTGTT TATTCTGCGC CAGCTGGAGT TCGGTCTGTT CGATTTCCGC CTCCATGCCG AGTTCCGCCC GGATCAGGGA GCGAAAATCC TCGAAACTCT GGCAGAAATC AAGAAACTGG TTGCCGTAGT ACCGTCTCCA TCCTGGGGCC GTTTCCCGCA CGCTTTCAGC CATATTTTCG CCGGTGGTTA TGCCGCAGGT TACTACAGCT ACCTGTGGGC CGACGTGCTG GCGGCAGATG CCTTCTCGCG CTTTGAGGAA GAGGGCATTT TCAACCGTGA AACCGGACAG TCGTTCCTCG ACAACATTCT GAGCCGTGGC GGTTCAGAAG AGCCGATGGA TCTGTTCAAA CGCTTCCGTG GTCGTGAACC GCAGCTGGAT GCGATGCTGG AGCATTACGG CATTAAGGGC TGA
|
Protein sequence | MTNPLLTPFE LPPFSKILPE HVVPAVTKAL NDCRENVERV VAQGAPYTWE NLCQPLAEVD DVLGRIFSPV SHLNSVKNSP ELREAYEQTL PLLSEYSTWV GQHEGLYKAY RDLRDGDHYA TLNTAQKKAV DNALRDFELS GIGLPKEKQQ RYGEIATRLS ELGNQYSNNV LDATMGWTKL VTDEAELAGM PESALAAAKA QAEAKELEGY LLTLDIPSYL PVMTYCDNQA LREEMYRAYS TRASDQGPNA GKWDNSKVME EILALRHELA QLLGFENYAF KSLATKMAEN PQQVLDFLTD LAKRARPQGE KELAQLRAFA KAEFGVDELQ PWDIAYYSEK QKQHLYSISD EQLRPYFPEN KAVNGLFEVV KRIYGITAKE RKDVDVWHPD VRFFELYDEN NELRGSFYLD LYARENKRGG AWMDDCVGQM RKADGSLQKP VAYLTCNFNR PVNGKPALFT HDEVITLFHE FGHGLHHMLT RIETAGVSGI SGVPWDAVEL PSQFMENWCW EPEALAFISG HYETGEPLPK ELLDKMLAAK NYQAALFILR QLEFGLFDFR LHAEFRPDQG AKILETLAEI KKLVAVVPSP SWGRFPHAFS HIFAGGYAAG YYSYLWADVL AADAFSRFEE EGIFNRETGQ SFLDNILSRG GSEEPMDLFK RFRGREPQLD AMLEHYGIKG
|
| |