Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4121 |
Symbol | |
ID | 8255256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4978018 |
End bp | 4979718 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937786 |
Product | carboxyl-terminal protease |
Protein accession | YP_003094374 |
Protein GI | 255534002 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.132003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAT ACAAAATTGC CGCAGTTGCT CTCCTTATTA CCTTTGCTTC TATAACCTGG TCTTTTAAGG AAGACCTTTT TCAGGTGTCA AAGAATCTGG ACATTTTTGC TTCTTTATAT AAAGAAATAA ACATCAATTA TGTAGAGGAA ACAAACCCTT CCAGCCTCAT GCGCAGCAGC ATTGATGCGA TGCTCGAAAA TCTGGACCCT TATACAGAAT ACGTTCCTGA ATCAGAAGTA GAAGATTATA AGCTGAAATA CGTGAGTACT CAATACGGTG GCATTGGGGC CAGCACCATT TTTATTGAGG GTAAGTTATT TGTAAATGAG GTTAATGAAG GCTATCCGGC CGATAAACAG GGAGTAAAAC CTGGCGATCA ACTTGTAAAA ATTAATGGTA ATGAGGTTAA GGGGAAAGAC CGGGCGCAGG TAAGCCAGTT GCTGAGGGGA CCAAAAGGCT CTGTTGTCGA ACTCCTGATC ATTAGGGAAG GTACCTTGAT TACCAAAAAC CTGACCCGTG ATGAAATCAA ACAGCCCAAT GTTGCCTACT CCGGCATGAC AGCAGATAAT ATTGCCTATA TCCGTTTGGA TAAATTCCTT GAAAACTCTG CTCAGGAGGT TAAGGATGCT GCAGTTACAT TGGGTAGACA GCAGCCTAAG GGTATGATCC TCGATTTGCG ATACAACGGT GGGGGGATAC TGCAGGAAGC TGTTAAAATT GTCAACATTT TTGTGGATAA GGATATCCTG ATCGTGACCC AGAAAGGAAG AAATCCGCAA AAAACCATTA CCTATAAAAC AATTAACCAG CCCTTATTTC CAAACGTTCC ACTGGTGGTG TTAATCAGTG GATCTTCTGC CTCGGCTTCT GAAATTGTTG CTGGAGCACT GCAGGACCTC GACAGGGCTA TAATTGTTGG ACAGAGGAGC TATGGAAAGG GGCTGGTTCA ACAAACCTTT AACCTGCCTT ATAACAGCCT TGTTAAGGTT ACTGTAGCCA AATATTTTAC CCCCTCGGGC AGGTGCATCC AGGCGCTTGA CTATGCGCAT AAGGATGCCA ACGGCAAAAC ACTCAAATTT GCAGATTCGC TGATGAGTAA ATTCAGTACA AAAACCGGGA GAAATGTATA TGACGGAAAT GGCATTTATC CTGATGTGCT GGTAAATAGC CCTAAGCTTA GCCCGGTAAC CATTTCACTG TTGAATAAGA ACCTGTTTTT TGATTATGCC AATAATTATA AAAAGAACAA TAAAGAAATT GCTCCGGCAG CTTCTTTTCA GCTTACGGAA AACGATTATG CCGCTTTTGT AAATACCATG GCAGGACGGG ATTACTCATA CACCTCACGT ACAGAACGCT TATTGTCTGA CCTGAGAACA GAGGCAGAAA AAGAGAATAA ACTGACGCTT GTTAAGGCCG ACCTTGAAGA TTTAAAAGAA AAAATGCTTG GTGCCAGAAA AACAGACCTG ACTACCTATA AAGCAGAGAT CAAAAGAGTT TTAGAAACCC AGATCGTAAG CCGCTACTAC TATGAAAAGG GTAAAGTGAT CCAGGCGTTT CAGTACGATA AGGAGCTGAA TGCAGCAAAA AGTCTGTTAA ATAACAACAA TAAAATGCTG GCCATCCTTA AAGGTGAGGG CGAATATAAA ACAATAGGCA GCCCTATAAA AACAATAGCA GCTGCCTCTG ATAATAATTA A
|
Protein sequence | MKIYKIAAVA LLITFASITW SFKEDLFQVS KNLDIFASLY KEININYVEE TNPSSLMRSS IDAMLENLDP YTEYVPESEV EDYKLKYVST QYGGIGASTI FIEGKLFVNE VNEGYPADKQ GVKPGDQLVK INGNEVKGKD RAQVSQLLRG PKGSVVELLI IREGTLITKN LTRDEIKQPN VAYSGMTADN IAYIRLDKFL ENSAQEVKDA AVTLGRQQPK GMILDLRYNG GGILQEAVKI VNIFVDKDIL IVTQKGRNPQ KTITYKTINQ PLFPNVPLVV LISGSSASAS EIVAGALQDL DRAIIVGQRS YGKGLVQQTF NLPYNSLVKV TVAKYFTPSG RCIQALDYAH KDANGKTLKF ADSLMSKFST KTGRNVYDGN GIYPDVLVNS PKLSPVTISL LNKNLFFDYA NNYKKNNKEI APAASFQLTE NDYAAFVNTM AGRDYSYTSR TERLLSDLRT EAEKENKLTL VKADLEDLKE KMLGARKTDL TTYKAEIKRV LETQIVSRYY YEKGKVIQAF QYDKELNAAK SLLNNNNKML AILKGEGEYK TIGSPIKTIA AASDNN
|
| |