Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2930 |
Symbol | |
ID | 8254041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3494760 |
End bp | 3495851 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644936578 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003093190 |
Protein GI | 255532818 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0248751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAACG AGTTAGAGTT AGAAGGTATA ATTGAGGATT ACCTGAATGG TAAACTGAAT GAGGATGAAG CAAAAGCTTT TGAACAATTG CGCTTAAATG ATCCTGCTGT TGATCATAAG GTAGTTGCCC ATAAGGTTTT TATGGAGTCA TTAAACGATT ATGCAAGGGT TTCTGACCTG AAAAATACGA TGAACCTGAT ACATGAGCAG ATGGATGTAA ATGCCCTGAG CAGAAAACTG GGTCCTCATC CTTCTTTTAT TGTTAACCTG TGGCGTAAAA ATAAAGCTGC CATTGCTGTG GCCGCATCCT TTATCCTGCT AACAGTGGTT ACCATTTATT CTATACAGCA AACCACGCAA CAAACAGGTT CTTATGAAAA ACTGAATAAG GAAGTAAATA ACCTGAGAAG TTCAACAAAT AACCTGATCA GAAATGTAAA ATCCAATGCA CCTGCAAAAC CGAATGTAAA CCCCGGTAAG TTTGGCGGTA CCGGCTTTGC CCTTTCTTCA AATGGTTACA TCTTAACCAG TCACCATGTG ATTGAAAAGT CAGATTCAGT ATATGTTCAA AACTATAAAG GCGATTCTTA TAAAGTGAAA ATGGTTTACA GTGACCCTGT AAACGATATC GCCATCCTGA AAATTACCGA CAAGAATTTT TCTCATCTTT CAGCTTTGCC TTATTCGTTG AAAAAGAGCA TTGCAGGTAT GGGAGAACAG GTTTATACGC TGGGCTACCC TAAAGACGAT GTGGTATTTG GAAAGGGATA CCTGAGCTCT AAAACGGGTT TCAATGGAGA TACACTGGCT TACCAGGTAG CCATTACCGT TAATCCGGGC AATAGCGGTG GCCCCTTGCT AGACAATAAC GGAAATGTAA TTGGTATCAT CAATGCAAAA GAAAGCAATA CCGATGGGGC GGCTTTTGCA GTAAAATCCA AATACATTGC CGAAGCATTA AACGCCATCC CGCAGGATTC ACTTGTGAAA CGGATTGTAT CTGGTAAGAC CAACCAGTTG CAGGGCCTTA AACTGACCAA ACAAATTGAA AAGATGCAGG ACTTTGTATT TATGATTAAA GTTTATAATT AG
|
Protein sequence | MRNELELEGI IEDYLNGKLN EDEAKAFEQL RLNDPAVDHK VVAHKVFMES LNDYARVSDL KNTMNLIHEQ MDVNALSRKL GPHPSFIVNL WRKNKAAIAV AASFILLTVV TIYSIQQTTQ QTGSYEKLNK EVNNLRSSTN NLIRNVKSNA PAKPNVNPGK FGGTGFALSS NGYILTSHHV IEKSDSVYVQ NYKGDSYKVK MVYSDPVNDI AILKITDKNF SHLSALPYSL KKSIAGMGEQ VYTLGYPKDD VVFGKGYLSS KTGFNGDTLA YQVAITVNPG NSGGPLLDNN GNVIGIINAK ESNTDGAAFA VKSKYIAEAL NAIPQDSLVK RIVSGKTNQL QGLKLTKQIE KMQDFVFMIK VYN
|
| |