Gene PG0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0414 
Symbol 
ID2551602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp451862 
End bp453742 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content51% 
IMG OID637149186 
Producthypothetical protein 
Protein accessionNP_904716 
Protein GI34540237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.867976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGG GAGAAAAACG AGAATCTCGG CTCGGTAGCC GACAACTGGG GGCGATAATT 
CTGATCGTCA CTCTCTCTTT TTCTGCCCTT GCCTCTCTCC AAGGTCCCCC TCCCAAAGGG
AGTAAGGGGA AAACGCATGT CATCCTCGAA CATGCCGATG AACTCCGTTA CGACAGACTC
TACAACCCCG ATGTACAGCG TCTGCTTGGC AATGTCGTGA TCAAGCATGA AGGAGCTGTG
ATGCGCTGCG ACAGCGCTCA TCTTAATCAG GAAGAGAACA CTTTCGAAGC ATTCGGCCAA
GTTTCCATGC AGCAGGGCGA CACCGTATCC ATGTTCGCCC GCTATCTCCA TTATGATGGA
AACATCAAAT ACGCTCGTCT TCGCCATGAA GTGCGACTGG AAAATCGTTC GGCTACTCTC
TTTACGGATA GTTTGGATTA TGACCGGGTC ATGAACCTGG GCTACTATTT CGAAGGTGGT
AGCATAGTCG ACTCTCTCAA TACGCTGACT TCCAGCTATG GAGAATATTC TCCCACCACA
TCCGATGCCA TCTTCCGAGA TAATGTCCAT TTGGAAAATA AGGACTATAC CATGGACACG
GAAGAACTTC ATTATAATAC AGATACCAAG ATCAGTCACA TATTGGGGCC TACGGAGATG
AGATCGGACT CCGGCTATAT CGTTTCTACG CGAGGAGTGT ACGATTCGAA CACCGATGTA
GGCATTTTGC TGGATCGCTC CATCGTTTAT TCCTCCAACG GGGCCAAGCA ATTGACGGGG
GACTCGATCT TTTACGACCG TCGTACCGGT TTTGGCGAAG CCTTCGGCAA TATGATCCTC
ACCGATACGG TGAACCGTTC TTCTCTTTAT GGGGAGTATG GTTATTACGA TGAGAAGAAG
GACTATGCTT TTGCCACCCA ACGATCTTAT ATGATCGACT TTTCCAAACC CGACACCTTG
TGGGCAGCAG CCGACACGCT TGAGATGATC ACGCAGCGTC GCGTCCCCGA GGATAGGCGG
ATAGCACGCG GGTACAGACA TGTACGGGTT TATCGAACTG ATGTCCAAGC CATTGCCGAC
TCTATGCAGT ACGACTCTCG CGATTCTCTG CTCTACCTTT ATGACAACCC CATTATGTGG
AATGAAGACT CCCAGTTGAG CGGCGATACG ATCCGGTTCA AATTCCGCAA CGACAGTCTG
GACTATGTCG ATGTGCTTAC CAAGGCTCTT GCCGTTCGGC GGATAGATTC CGTCATGTAT
GACCAGCTGG CCGGCAGACA TATCAGAGCC TATATGCAGG ACAGCCTTGT ACGCCAGATA
CAGGTGCATG GCAATGCCGA AGTCATCCAA TACGAACAGC ACAAACGATC GAAACGCTGG
TATCTGATGA ATCGAATCGA AGCTCCCTCT ATAATTGCCG ATTTCGAAGA AGGCCAACTC
AAGAAAGTAC TCTTGCGTGG AGTGGCATCG GGCAAAGGCT ACCCGATCAA AATGCTCACG
CCCGATCTTC AACGCTTGGC CTCTTTTCGA TGGGAGGAGG CTGTCAGGCC GAAATCGAAG
GAGGATCTTT TCCGCAGGCA GCCGGATTCC GTCTTGCAGG TGCATCGATC GCTGTCCGAT
TTGCGACGCT TTAGCGGTGC TTTGGCAGCC CTTCGTGCAT ACACGGCTCT GGCCGAAGAG
GAGAGAAAAG ACTCATTGAC AATCGCTGCC CTACAGACCG ACTCCATTCC TCCGACACCT
GCAGCCGGCA AAGAGGCTAC CGATCCTACA GACCGGCTCT CTCCTTATAT TGCCCGACCT
ACTACGGACA CCAAAGAGGA AGGCTTCTTC GATCTTTTCT TCACTCCATT CATCTTTAAT
AGAGAAAAAC TATGGGATTA G
 
Protein sequence
MRKGEKRESR LGSRQLGAII LIVTLSFSAL ASLQGPPPKG SKGKTHVILE HADELRYDRL 
YNPDVQRLLG NVVIKHEGAV MRCDSAHLNQ EENTFEAFGQ VSMQQGDTVS MFARYLHYDG
NIKYARLRHE VRLENRSATL FTDSLDYDRV MNLGYYFEGG SIVDSLNTLT SSYGEYSPTT
SDAIFRDNVH LENKDYTMDT EELHYNTDTK ISHILGPTEM RSDSGYIVST RGVYDSNTDV
GILLDRSIVY SSNGAKQLTG DSIFYDRRTG FGEAFGNMIL TDTVNRSSLY GEYGYYDEKK
DYAFATQRSY MIDFSKPDTL WAAADTLEMI TQRRVPEDRR IARGYRHVRV YRTDVQAIAD
SMQYDSRDSL LYLYDNPIMW NEDSQLSGDT IRFKFRNDSL DYVDVLTKAL AVRRIDSVMY
DQLAGRHIRA YMQDSLVRQI QVHGNAEVIQ YEQHKRSKRW YLMNRIEAPS IIADFEEGQL
KKVLLRGVAS GKGYPIKMLT PDLQRLASFR WEEAVRPKSK EDLFRRQPDS VLQVHRSLSD
LRRFSGALAA LRAYTALAEE ERKDSLTIAA LQTDSIPPTP AAGKEATDPT DRLSPYIARP
TTDTKEEGFF DLFFTPFIFN REKLWD