Gene PG0350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0350 
Symbol 
ID2551891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp385446 
End bp386900 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content49% 
IMG OID637149129 
Productinternalin-related protein 
Protein accessionNP_904662 
Protein GI34540183 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0909966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA AACCGCTATT CTCAGCCCTT GTAATCCTTT CCGGCTTCTT CGGATCGGTT 
CACCCGGCCT CAGCACAGAA AGTTCCTGCA CCCGTCGATG GCGAGCGCAT TATCATGGAG
CTAAGTGAAG CCGATGTGGA GTGTACAATC AAAATAGAAG CCGAGGATGG CTATGCCAAC
GACATTTGGG CAGACCTCAA CGGAAACGGC AAGTACGATT CGGGGGAGAG GCTCGATTCA
GGTGAGTTTC GTGATGTTGA GTTCAGACAA ACAAAGGCCA TCGTCTATGG CAAAATGGCC
AAATTCTTGT TTAGAGGTTC TTCTGCAGGG GACTATGGTG CTACCTTTAT AGATATTAGC
AATTGTACCG GCCTGACTGC TTTCGACTGC TTTGCCAATC TGCTGACAGA ACTCGATCTG
TCCAAAGCAA ACGGTCTGAC TTTTGTAAAC TGCGGCAAAA ACCAGCTGAC CAAGCTTGAC
CTGCCCGCAA ATGCGGACAT TGAGACGCTG AACTGCTCCA AAAACAAGAT AACGAGTCTC
AACCTATCGA CCTATACCAA GCTGAAAGAG CTTTATGTGG GCGACAACGG GCTGACAGCC
TTGGATCTCT CCGCCAATAC GCTCCTCGAA GAGCTGGTGT ATTCTAACAA CGAGGTGACT
ACGATAAACC TGTCTGCCAA TACGAACTTG AAAAGCCTGT ATTGCATAAA CAATAAGATG
ACCGGACTCG ATGTCGCAGC CAACAAAGAG CTGAAAATAC TCCACTGCAA CAACAATCAG
CTGACCGCCC TCAATCTCTC GGCCAATACC AAGCTGACGA CTCTAAGCTT CTTCAACAAC
GAGCTGACAA ATATCGATCT CTCCGACAAC ACGGCTTTGG AGTGGCTTTT CTGCAACGGC
AATAAGCTGA CGAAGTTAGA TGTATCTGCC AACGCCAATC TGATAGCACT GCAATGCAGC
AACAACCAGC TGACTGCTCT GGATCTGTCA AAAACGCCGA AACTGACAAC GTTGAATTGC
TACTCCAACC GGATCAAAGA TACCGCCATG CGTGCATTGA TCGAAAGCCT GCCTACGATC
ACTGAAGGAG AAGGCAGGTT CGTTCCTTAC AACGACGATG AAGGAGGAGA AGAGGAGAAC
GTGTGTACAA CCGAACACGT GGAAATGGCC AAGGCCAAGA ATTGGAAGGT ACTTACCTCG
TGGGGAGAGC CTTTCCCCGG AATAACGGCT TTGATTTCCA TCGAAGGTGA GAGCGAATAT
TCCGTATATG CTCAAGATGG CATCCTCTAC CTCTCCGGTA TGGAGCAGGG CTTGCCCGTT
CAGGTATATA CCGTGGGAGG AAGCATGATG TACTCATCTG TCGCTTCCGG ATCAGCCATG
GAAATACAGC TCCCGAGAGG TGCAGCCTAT GTAGTACGTA TCGGCAGCCA TGCGATCAAA
ACCGCGATGC CGTAA
 
Protein sequence
MKRKPLFSAL VILSGFFGSV HPASAQKVPA PVDGERIIME LSEADVECTI KIEAEDGYAN 
DIWADLNGNG KYDSGERLDS GEFRDVEFRQ TKAIVYGKMA KFLFRGSSAG DYGATFIDIS
NCTGLTAFDC FANLLTELDL SKANGLTFVN CGKNQLTKLD LPANADIETL NCSKNKITSL
NLSTYTKLKE LYVGDNGLTA LDLSANTLLE ELVYSNNEVT TINLSANTNL KSLYCINNKM
TGLDVAANKE LKILHCNNNQ LTALNLSANT KLTTLSFFNN ELTNIDLSDN TALEWLFCNG
NKLTKLDVSA NANLIALQCS NNQLTALDLS KTPKLTTLNC YSNRIKDTAM RALIESLPTI
TEGEGRFVPY NDDEGGEEEN VCTTEHVEMA KAKNWKVLTS WGEPFPGITA LISIEGESEY
SVYAQDGILY LSGMEQGLPV QVYTVGGSMM YSSVASGSAM EIQLPRGAAY VVRIGSHAIK
TAMP