Gene PG1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1943 
Symbol 
ID2552167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2029064 
End bp2030350 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID637150535 
Producthypothetical protein 
Protein accessionNP_906025 
Protein GI34541546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA GTAGCAAAAC GAAGAAAAAA ATGGAAGGTT CGCGCGATAG TACTTCGAAG 
AAACAGAAAA ACGGGGAGCA TGGGCATTCG GAGAGAAAAA TGGCCAACTC ATTGCAGATC
CTCATGCTTG CATCGCTCCT ATGGTTGGCA ATAGAAGATT TGGCCACTTC GGGACAATGG
GGAATATCTG CCATTACCGG GCTGGCTTTT TGGGCAATCC TACGGAAGCG CAGACTACTG
CCAAGAAAGA TCGATCAGCG GTTGCAATCA CTTCTCTTTC CCCTTCGGGT ACGAATCCGC
GCATTCGAGT TGAGACTGAT GCGTGGCAGA GCGACTCCAC CTGCAGACCC ATACATACAT
ATATGCCAAA ACTGCGGAGA TGGATATACG GGCAATTTCT GCAACCGCTG CGGACAAACC
TCTCGCACCG GACGGTATCA CTTCCGGCAG ATGATCCGAA ATGTGATAGG CGGCTTCACC
AATATCGACA GCGGTTTCGG GCGTACTATC GTGGAGCTGC TCTACCGCCC GGGTTATTTG
ATACGGGACT TTATCGGTGG CAAGAGGGTT GTCTACTTCC GTCCCTTTCA GGCCCTTTTC
GTGCTGGCCT CTCTGTACAT CATCTTTGCA CAACTGATCG GACCTGATCC CTTGCAGAAA
AAGCCCATTG GCGAAGAGCT TACGCGTCAA GAGTATCGGA TGGATCAAAA AGAAATATCC
TCCTTGCAAA AGGAGAATAA GGCAGTCGGC AGCGATAGCG GAGAAATGAT TGCCGAGCGA
CACACGGGGA GAAGAGCCTT TTTTTACAAA CAAATGAGGT TTGTCACCGA GCAGAAAAAT
AGGTTGAAGT CCCTCCCTTT CTTTTCAAGA ATATGGAGCC TGCTGATGGA GTGGTTTCAA
GGCAACAAGG CTGTCAGGAT CATCAGTATT CTTCCCATTC TCGCCGTGAG TACGAAAGCG
GCTTTCCGAC AAAAAGGAGC CGGAGCATAT AATCTGACGG AGCACATTAT CGCTCAGGCG
TACATAGCCT GCCAGCTATT GTTGCTGAAC GTTCTCGCCC TACCTTTCTA CAGCGATGCC
CGGGTGGGGT CGCTATACGG GCTACCTGCA TTGCTCCTAT TCCTTTTGTT TTGCTGGGAC
TACAAGCAGC TATTCCTCTG CTCTTGGTGG CGGAGCTTCT GGCGGACAAT CCTCGTTGTG
GTCTATTGCT TGCTCTTCTT GGCTCTGACG GCTTCGCTGA TTGCGGTAGT TATTGTGGCG
ATCGAGGCTG TCGGTACAGC TTCATAG
 
Protein sequence
MQTSSKTKKK MEGSRDSTSK KQKNGEHGHS ERKMANSLQI LMLASLLWLA IEDLATSGQW 
GISAITGLAF WAILRKRRLL PRKIDQRLQS LLFPLRVRIR AFELRLMRGR ATPPADPYIH
ICQNCGDGYT GNFCNRCGQT SRTGRYHFRQ MIRNVIGGFT NIDSGFGRTI VELLYRPGYL
IRDFIGGKRV VYFRPFQALF VLASLYIIFA QLIGPDPLQK KPIGEELTRQ EYRMDQKEIS
SLQKENKAVG SDSGEMIAER HTGRRAFFYK QMRFVTEQKN RLKSLPFFSR IWSLLMEWFQ
GNKAVRIISI LPILAVSTKA AFRQKGAGAY NLTEHIIAQA YIACQLLLLN VLALPFYSDA
RVGSLYGLPA LLLFLLFCWD YKQLFLCSWW RSFWRTILVV VYCLLFLALT ASLIAVVIVA
IEAVGTAS