Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1782 |
Symbol | |
ID | 6461948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | - |
Start bp | 1881611 |
End bp | 1883107 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642727994 |
Product | protease Do |
Protein accession | YP_002018631 |
Protein GI | 194336837 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACC TGCTTCTTGT TTTTGCGGGT ATTGCCGTAG GCGCACTTGT CTTTTCGAAT GTTGAGTTTA GCCTCTCTTC CAGGGGGTTG AGCTTTTCAA ACAGTCCAAG TTTTGCTACG GCAAAAAACA ATATTGAGAA TTATCCCATC CAGTCACTCA AAGGCTTTAA CGAGGCATTT GTGCAGATTG CCGAGTCGGC AACCCCTTCA GTAGTGACGA TTTTCACTGA AAAAACGGTA AACCAGCGGA TTGGCTCCCC GTTTGGTTTT TGGGGAAGGC CTTTCTTTGA TGATTTTTTT GATATGCCGG AGATTGCCCC TCGCAACGGC CAGAAAAAAG TATTGCAGGG CATCGGATCG GGAGTCGTCG TAACGGCTGA CGGTTATATT CTGACCAACA ATCATGTGAT CGATAATGCC GATGCCGTCT ATATCCGCAC CTTCGATAAC AAGAAGATTG ATGCCAAAGT AATAGGAAAA GATTCCAAAA CCGATCTTGC TGTTATCAAG GTCAATGCAA AAAATCTGAA ACCAATCCTG ATTGGCGACA GTGACAAACT CAGGGTTGGC GAATGGGTCA TTGCCATAGG AAGTCCACTT GGTGAAAATT TTGCACGAAC GGTTACTCAG GGTATCGTCA GTGCCAAAGG ACGGGCCAAT GTCGGGCTGG CCGATTATGA AGACTTCATA CAGACTGATG CGGCCATCAA TCCGGGAAAT TCCGGAGGCC CTCTTGTCAA CATCAACGGC GAGCTTGTCG GCATCAATAC GGCTATAGCA AGTCGAACCG GTGGATTTGA TGGTATAGGA TTTGCCGTTC CGTCAAACAT GGCCCAAAAA GTAATGACCG CACTCATTAC TACCGGAAAA GTTACCCGAG GTTATCTTGG CGTCAGTATC CAGGATATTG ATGAAAATAT TGCAAAAGCA ATGCACCTTA AGGCTGGAGA AGGGGCGCTT GTAGGAACGG TTGTCGAGGG TGGACCTGCG GCAAAGATTG GTATCAAAAC CGGCGATGTT ATCCTTGATA TCAACGATCA AAAGGTTACA GGCAGTATTG AACTGCGTAA TGCCATCTCC AGTCAGTTAC CCGGATCGAT GGTCAAGTTT CGGGTGCTCA GGAACGGAAC GATCATGCTT TTTCAGGCCC GTCTTGAAGA ACAGCCCGCC AGGGGTGTGG CTTCAGCCAT GACAGAAGAA CAGGAAAAAA TTCCTGCGGT ACTCGGTTTT AAGGCCGAGG AGCTTACAGC AAGGCTGGCA CAGAAATTGA ACCTGGTGCC CGGCTCTGGC AAGGTTGTAC TTACAGCTCT TGATCCTGCA TCCAATGCCT ACCTTGCCGG ATTGCGCGTC GGCGATATTA TACTTACCCT CAACCGTCAG AGTGTGAACT CGTTTGCCGG GTATAGTGCT CTTATCAAAA ACATCAAAAG CGGAGATCTT CTTTTTCTGC TGGTAGAAAG GAAGGGCAAT AAAATATATT TTGCGTTTAA TGTGTAA
|
Protein sequence | MKYLLLVFAG IAVGALVFSN VEFSLSSRGL SFSNSPSFAT AKNNIENYPI QSLKGFNEAF VQIAESATPS VVTIFTEKTV NQRIGSPFGF WGRPFFDDFF DMPEIAPRNG QKKVLQGIGS GVVVTADGYI LTNNHVIDNA DAVYIRTFDN KKIDAKVIGK DSKTDLAVIK VNAKNLKPIL IGDSDKLRVG EWVIAIGSPL GENFARTVTQ GIVSAKGRAN VGLADYEDFI QTDAAINPGN SGGPLVNING ELVGINTAIA SRTGGFDGIG FAVPSNMAQK VMTALITTGK VTRGYLGVSI QDIDENIAKA MHLKAGEGAL VGTVVEGGPA AKIGIKTGDV ILDINDQKVT GSIELRNAIS SQLPGSMVKF RVLRNGTIML FQARLEEQPA RGVASAMTEE QEKIPAVLGF KAEELTARLA QKLNLVPGSG KVVLTALDPA SNAYLAGLRV GDIILTLNRQ SVNSFAGYSA LIKNIKSGDL LFLLVERKGN KIYFAFNV
|
| |