Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0209 |
Symbol | |
ID | 8251294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 244910 |
End bp | 246463 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644933859 |
Product | protease Do |
Protein accession | YP_003090497 |
Protein GI | 255530125 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0976741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGA TAGGATTAAT AGCGTTGGCT GCATTTATAG GTGGTGCGGC TGCAATTGGT GGCTATAAAT TATTAGAGAC CAAAAATAGT GAGATGTTAT CTTTTGCCGA CCAGCAAAAG GTGCTTTTTG CAAATAACCC CAAAATTTCT TCAGCAGGTG CTGTTGATTT TGTAGAAGCT GCAGCATCAG TTTCTCCGGC GGTAGTTCAC ATTAAAACTT CGTACGGTAG TAATTCTTCC GAAGGGCAGG CAAGGTCCTC TTCTCCTTTT GATATGTTTG ACGATCTGTT TGGTGGTGGC GGTGGCCGCC GAATGCAGCG TATGCCAAGG GCAGCTTCTG GTTCGGGAGT TATTTTAACC CCTGATGGTT ATATTGTAAC CAACAACCAC GTGGTAGACA ATGCAGATAA AATTGAAGTG ATCTTGTCCA ACAAACGGAA AGTAAGTGCC AAAGTGATCG GTAAAGACCC GAATACGGAC CTTGCCCTGA TTAAAGTTGA GGCCAATGAC CTGCCCGTAG TAAAAATGGG TAACTCTGAT AATGTTCAGA TCGGTGAATG GGTACTTGCA GTAGGTTTCC CGCTTGATCT GCAAACTACG GTTACTGCAG GTATTGTAAG TGCAAAAGCA AGAAGTATTG GCATTTTGGG CAGACAACAA CAAGGCCTGA CAGAAGAAGA GTATGAAGAA TACCGCAGAA CCGGTAAAGC ACCGGAAAGG GCAAATACCA GTATAGAATC TTATATCCAG ACCGATGCGG CCATTAATCC TGGAAACAGT GGTGGTGCAT TGGTAAATAC CAACGGAGAG CTGATCGGGA TCAATGCAGC AATTGCTTCG CAAACCGGTA CCAATGAAGG ATATGGCTTC GCTATTCCAA TTAACCTGGC TAAAAAGATC CTTGATGATT TTAGAAAATA TGGTAGTGTG AAACGTGGTT ATATAGGTGT TTCCTTTGTT CCACTTGATG CAGATAATGC TGACAAACTG AAAGTGAACG ATATCAATGG ACTTTATGTA AGTGAAGTAC TGCCAAATGG TGGTGGTGCT GCTGCGGGGA TTAAAAAAGG AGATATCATT AAAAAAGTTG AGGGCGTGGA AGTTTTTGAC TCTCCAGACC TTCAGGAAAG AATAGGCCGT TTGAGTCCTG GCGATAAAGT TCAGTTGACC TTGTTAAGAG ACGGTGCCTT AAAAGATGTT AAGGTTACTT TGAAAGGTGA TAACAGTGTA GGCCTGAATA CTTCAAAACT GGCTGAAAAA TCAACCGGTA CAAGCCTGAG TAAGTTAGGG GCTTCTTTTG AACCTGCCTC TGCGCAACTT AAAGCACGTT ATGGGGTGAA AAATGGAGTT GTAGTGAGCG ATATCGAAGC AGGTAAACTG TTCGATTCAT GGGAGATCCC TAAAGGCGTA TTGATCACCG CTGTGAACGG TACACCGGTA AACAGTGCTA AAGATGTGGA AAGCGCTCTG CCAAGATCAA GAAACGGTAT GACTACCATT TCAGGGGTTG GCCCTCAGGG TAGGTTTACC TATTCACTGA CGAACGGAAG ATAA
|
Protein sequence | MKRIGLIALA AFIGGAAAIG GYKLLETKNS EMLSFADQQK VLFANNPKIS SAGAVDFVEA AASVSPAVVH IKTSYGSNSS EGQARSSSPF DMFDDLFGGG GGRRMQRMPR AASGSGVILT PDGYIVTNNH VVDNADKIEV ILSNKRKVSA KVIGKDPNTD LALIKVEAND LPVVKMGNSD NVQIGEWVLA VGFPLDLQTT VTAGIVSAKA RSIGILGRQQ QGLTEEEYEE YRRTGKAPER ANTSIESYIQ TDAAINPGNS GGALVNTNGE LIGINAAIAS QTGTNEGYGF AIPINLAKKI LDDFRKYGSV KRGYIGVSFV PLDADNADKL KVNDINGLYV SEVLPNGGGA AAGIKKGDII KKVEGVEVFD SPDLQERIGR LSPGDKVQLT LLRDGALKDV KVTLKGDNSV GLNTSKLAEK STGTSLSKLG ASFEPASAQL KARYGVKNGV VVSDIEAGKL FDSWEIPKGV LITAVNGTPV NSAKDVESAL PRSRNGMTTI SGVGPQGRFT YSLTNGR
|
| |