Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0862 |
Symbol | |
ID | 2552872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 922880 |
End bp | 926278 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637149592 |
Product | type IIS restriction endonuclease, putative |
Protein accession | NP_905111 |
Protein GI | 34540632 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00475816 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACTGA AACAATACTT ACAGCAGTCT TATCAGGGCA TCGAGTCTTT TCTGGAGAAA ATAGTCTTCC CAATCTTTGG TCAAGAGCTA TTTGAGGATG GTCATGGTGT ATCTATCCTA GAGATGTACC CCGACCTACG ACCAGCAGCA CAAGCAACGG GCATCCTAGA GATCAAGCAT ATCGGCAATA TCGATATGGA TTTCAATCCT ATCAACATCT TTGACATCAC GGTATCTAGC CAGATCAAGA TGGCTCGCAA TAAGGTTGGC ATCCAGAATA TTATCAGGCG TATCATGGAC ACCTATTCGA GTGCTTTTAT GATATTCCAC TACGAGGACA ATCCTTTGTG GGAATGGCGG TTTACCTTCT GCCACAAGGG GAGGAGCCAG TCCGACATTA CCAGCAGCAA GCGTTTCACC TTCTTGCTCG GCCCTGGGCA GCACTGTCGT ACAGCAGCAG ACAACTTCCA GAAGTTAATC GACAAGAAAC AACGCCAAGA CATCGAACTC AAAGATATTG AGGATGCTTT CTCCGTCGAG GCACTGACGA AACAGTTCTA CAAGGATCTC TTCGAGTGGT ACCAGTGGGC CATATCTCCC GAGGCCGATA TCAGTTTCCC TAATGATACT AGTACAAGTG AAGATGATAG AGAAGATCTA GAAACGAAGA TAATCCGCCT AATTACGCGT ATTATGTTTG TTTGGTTCAT CAAGCAAAAG GAGCTGGTTC CTCAGCATCT TTTCGATATT GCCTTCCTCA AGACCATCCT CAAGGATTTC GATCCCAATA GCACCACCGA GGGGAATTAT TACAATGCCA TACTGCAGAA TCTTTTCTTC GGTACACTCA ACCGAGCACG CCAAGACGAA GATGGCAAAC CACGACGATT TGCCACTGGG AGCAAGCGCG ATGTGAAGAC GCTCTATCGC TATGCAGAGC TGTTTAGCAT CAGCGAGAAG GAGGTAATCC AGCTCTTCGA CTCCATTCCG TTCCTTAATG GTGGTCTATT CGAGTGCCTC GACAAAACTC GCTACATCGA TGGAGTAGAG CGATGCTACA GCTTGGATGG CTTTAGTCGC AATGATACAC GCTTCGCAAA TGGACGCTTC AAGCATCGAG CTACGATCCC CAACAATCTA TTCTTTGCTC CTGAGAGAGG GCTGGTTTCT ATCCTAAGTC GCTACAACTT CACTATCGAG GAGAACTCAC CCGAGGAGCA ACAAGTGGCA CTAGACCCAG AGCTACTCGG CAAGGTCTTT GAGAATCTCC TCGGAGCATA CAACCCCGAG ACTCAGGAGA CTGCTCGCAA CCAGAGTGGC TCGTTCTATA CTCCTCGAGA GGTTGTCAAC TATATGGTGG ATGAGAGCCT GATTTCTTAT TTAGGAGATA GCGATCTGGT GCGCTCTCTG TTTAGACCAG ATTTCGTACT GCAGGAGGAC AACAAAGTGC AATGCGAAGC TATTGCAAGC AAACTCAAAG CAGTCAAGAT ACTAGACCCT GCTTGTGGCT CGGGAGCATT CCCAATGGGA TTGCTCAATA GGATGATTGA GCTACTTGAG CGCATATCTC CCCAAGAGAA AAGCTATGAT CTCAAGCTCT TTGTGATCGA GAATTGTCTC TACGGCTCAG ATATCCAGAG CATCGCTGCT CAGATCACCA AGCTACGTTT CTTTATCTCT CTGATCTGCG ACTGCGAGCG AGATGAAACA AAGCCCAACT TCGGCATTCC GACACTGCCC AACCTTGAGA CCAAGTTCGT AGCCGCAAAC TCCCTCATCG CCAAGAAGAA GATGGCACAG CATCGGAACC TCTTCGAAGA TCCCGCGATT GAAGAAACTA AAAACGAACT TATAGGGGTG CGGCACAAGC ATTTCTCTGC CAAGTCTACC TCGGCCAAGC TGCGCCTACG AGAGCAAGAC CAAGACTTGC GTAAGAAGCT TGCACAACTT CTGGCAGAGA ATGAAGACTT TGCACCTGAA GATGCTCTAC AGCTCGCGGC ATGGAATCCC TATGACCAGA ATGCCGTCAG CTCATTCTTC GACCCTGAAT GGATGTTTGG CCTCGCAGAT GGCTTCGACA TCGTGATTGG CAATCCCCCG TATGTACAAC TACAGAATAA TGGGGGAGAA TTAACCAAGC TCTATCAAGG ATGTGACTTC AAGACTTTTG CTCGTACTGG AGATGTATAT TGCCTATTCT ACGAGCGAGG CTGGCAACTA CTCAAAGAGG GAGGGCACCT CTGCTACATC ACCTCCAACA AATGGATGCG TGCAGGGTAT GGAGAAAAGA CAAGGCTATT CTTCGCCTCC AAGACTAACC CCAAGCTACT TGTAGACTTC GCTGGAGTTA AGGTCTTTGA GAGTGCTACG GTCGATACCA ACATCCTACT CTTCGCCAAG GAGGCCAACG CAGGACAGAC TCAAGCGGTC TCCCTGAATA AGAATGTCCA AATTGGCGGT AGTGAATTGT GCGAATATAT TCAACAGCAT GCAACAGCCT CTGCTTTCAT CTCCTCCGAG AGCTGGGTCA TCCTATCCCC TATCGAGCAG AGTATCAAGC GAAAAATAGA AGCCGTCGGC AAACCTCTCA AGGATTGGGA TATTAACATT TATCGCGGTG TGCTTACTGG ATGCAATGAA GCCTTTATCA TAGATGAAGA TAAGCGTAAC GAAATACTCA ATAACTGCCA GAGCGAGGAC GAGCGTAAGC GAACGGAAGA GATTATTCGA CCGATATTGA GGGGACGGGA CATCAAGCGT TACAGCTATG ATTGGGCAGG GCTGTATATT GTGTATATAC CTTGGCATTT CCCCTTGCAT TTAGACTCTA CAATCACGGG AGCATCAGAA AAAGCAGAAG ATGCATTTAA GGCTTCTTAT CCTGCTGTAT ATAGATATCT TGAAGGATAT AAAGATTTAT TAACAAAGAG GAATCAAGCA GAGACTGGCA TTCGCTACGA GTGGTATGCT CTTCAACGAT GGGGGGCTAA CTATTGGGAG GATTTCAGTA AGCCAAAAAT TATGTGGAAA AGGATCGGAT CCATACTTCG TTTTTGCTAT AACGAGAATG GAGCCTTAGG ATTAGATAGC ACCTGTTTTG CTGCTGGTAA AGGAGCTGAG TTTTTGTGCT GCTTGCTAAA CTCTCCTATG GGACATTATC TATTGAAAGA CAGCCCTAAA ACAGGGACAG GAGATTTGCT AATTAGTATT CAAGCAGTAG AACCAATCAA AGTTCCTCCT ATTACAGAGA CCTTAATAAG GTCTTTCAAA GCCCTATTAA CGAATGTATG TCAAATTGGA ACTGAGGAAC AAGAAACCTC TATCAACCAC CAAATTTTCT CTTTGTATAA CCTCTCAGAG GAAGAACAGA GATACATTAA AAACAACTTT AGTCATTGA
|
Protein sequence | MELKQYLQQS YQGIESFLEK IVFPIFGQEL FEDGHGVSIL EMYPDLRPAA QATGILEIKH IGNIDMDFNP INIFDITVSS QIKMARNKVG IQNIIRRIMD TYSSAFMIFH YEDNPLWEWR FTFCHKGRSQ SDITSSKRFT FLLGPGQHCR TAADNFQKLI DKKQRQDIEL KDIEDAFSVE ALTKQFYKDL FEWYQWAISP EADISFPNDT STSEDDREDL ETKIIRLITR IMFVWFIKQK ELVPQHLFDI AFLKTILKDF DPNSTTEGNY YNAILQNLFF GTLNRARQDE DGKPRRFATG SKRDVKTLYR YAELFSISEK EVIQLFDSIP FLNGGLFECL DKTRYIDGVE RCYSLDGFSR NDTRFANGRF KHRATIPNNL FFAPERGLVS ILSRYNFTIE ENSPEEQQVA LDPELLGKVF ENLLGAYNPE TQETARNQSG SFYTPREVVN YMVDESLISY LGDSDLVRSL FRPDFVLQED NKVQCEAIAS KLKAVKILDP ACGSGAFPMG LLNRMIELLE RISPQEKSYD LKLFVIENCL YGSDIQSIAA QITKLRFFIS LICDCERDET KPNFGIPTLP NLETKFVAAN SLIAKKKMAQ HRNLFEDPAI EETKNELIGV RHKHFSAKST SAKLRLREQD QDLRKKLAQL LAENEDFAPE DALQLAAWNP YDQNAVSSFF DPEWMFGLAD GFDIVIGNPP YVQLQNNGGE LTKLYQGCDF KTFARTGDVY CLFYERGWQL LKEGGHLCYI TSNKWMRAGY GEKTRLFFAS KTNPKLLVDF AGVKVFESAT VDTNILLFAK EANAGQTQAV SLNKNVQIGG SELCEYIQQH ATASAFISSE SWVILSPIEQ SIKRKIEAVG KPLKDWDINI YRGVLTGCNE AFIIDEDKRN EILNNCQSED ERKRTEEIIR PILRGRDIKR YSYDWAGLYI VYIPWHFPLH LDSTITGASE KAEDAFKASY PAVYRYLEGY KDLLTKRNQA ETGIRYEWYA LQRWGANYWE DFSKPKIMWK RIGSILRFCY NENGALGLDS TCFAAGKGAE FLCCLLNSPM GHYLLKDSPK TGTGDLLISI QAVEPIKVPP ITETLIRSFK ALLTNVCQIG TEEQETSINH QIFSLYNLSE EEQRYIKNNF SH
|
| |