Gene PG1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1542 
SymbolprtC 
ID2553029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1620928 
End bp1622172 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content50% 
IMG OID637150179 
Productcollagenase 
Protein accessionNP_905681 
Protein GI34541202 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAA ACGACTTCGA GATAATGGCT CCAGTCGGTT CGTACGAATC GCTTATGGCA 
GCCATCAAGG CAGGAGCAGA TTCAGTTTAC TTCGGGATTG AAGGACTGAA TATGCGTGCG
CGATCTGCCA ACAACTTCAC CACAGAAGAT CTGTACAAAA TAGCCGAGAT TTGCAGAGAT
AAAGGCGTAA AGAGCTATTT AACGGTGAAT ACCGTCATAT ACGATGAGGA CATAGCACTC
ATGCGCTCCG TCATCGATGC GGCACAAAAG GCACAAATAT CTGCCATTAT AGCTTCCGAC
GTAGCTGCGA TGATGTATGC CAACGAGATC GGAGTAGAAG TGCATCTGTC CACTCAGCTC
AATATCAGCA ACGCGGAGGC CCTACGCTTT TATTCGCGCT TTGCCGATGT GGTCGTATTG
GCAAGAGAGC TGAATATGGA TCAGGTGCGT ACAATCCACG AGACCATCGT CAGGGATAAT
ATCTGTGGGC CTAAAGGCCA TCCCGTACGT ATAGAGATGT TTGCTCACGG CGCTCTGTGT
ATGGCCGTTT CGGGCAAGTG CTATCTAAGC CTGCACGAAC ACAACAGCTC CGCCAACAGA
GGAGCCTGTG CGCAGATCTG CAGGAGGGGC TACACCGTCA AGGACAAGGA TAGCGGTCTG
GAACTGGACA TTGAGAACCA ATACATCATG TCGCCGAAAG ATCTGAAGAC TATTCATTTC
ATCAATAAGA TGATGGATGC CGGCGTACGA GTATTCAAGA TAGAAGGAAG GGCACGTGGC
CCCGAATACG TCTATACGGT CTGCCGCTGC TATAAAGAAG CGATCGAAGC CTACTGCAAC
GGCACCTATG ATGAAGAGGC CATAGGCCGG TGGGACGAAC AATTGGCTAC GGTATTCAAC
CGAGGCTTTT GGGATGGCTA CTACCTCGGA CAACGGCTCG GCGAATGGAC ACATCGTTAC
GGCTCAGGAG CTACGCGACA GAAAATATAT GTAGGCAAGG GGATCAAATA CTTCAGCCGT
CTCGGTGTGG CTGAATTCGA GATAGAGTCC GGCGAACTGC ATATAGGCGA TGAGATTGTG
ATCACCGGCC CTACTACAGG TGTGATCATC CAAAAGGTGG AAGAGATCCG ATACGAACTG
CAAACCGTGG AAAAGGCGAC AAAGGGACAA CGCATTTCCA TTCCGGTAAA GGAGAAAGTG
CGTCCGTCGG ACAAGCTCTA CCGGTTCGAC AAAAGAGAAG AATAA
 
Protein sequence
MNVNDFEIMA PVGSYESLMA AIKAGADSVY FGIEGLNMRA RSANNFTTED LYKIAEICRD 
KGVKSYLTVN TVIYDEDIAL MRSVIDAAQK AQISAIIASD VAAMMYANEI GVEVHLSTQL
NISNAEALRF YSRFADVVVL ARELNMDQVR TIHETIVRDN ICGPKGHPVR IEMFAHGALC
MAVSGKCYLS LHEHNSSANR GACAQICRRG YTVKDKDSGL ELDIENQYIM SPKDLKTIHF
INKMMDAGVR VFKIEGRARG PEYVYTVCRC YKEAIEAYCN GTYDEEAIGR WDEQLATVFN
RGFWDGYYLG QRLGEWTHRY GSGATRQKIY VGKGIKYFSR LGVAEFEIES GELHIGDEIV
ITGPTTGVII QKVEEIRYEL QTVEKATKGQ RISIPVKEKV RPSDKLYRFD KREE