Gene PG1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1037 
Symbol 
ID2553084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1101505 
End bp1102884 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content47% 
IMG OID637149742 
Producthypothetical protein 
Protein accessionNP_905257 
Protein GI34540778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000133275 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAGAAC AGAAATTGTC TAATCGGTTT TGTCCGGAAG GCATGACCGT AGAAGAGTGG 
CAAGTAGGAT TAAGAAGAGA ATTTGCTGAA CAGAATCCTT TCGATGTTGA ACATTTGGAT
GAGAATCGGA TATGGGGCGA TTATTTGGTC GCAAACGGCC GCAACCGATA CCGAGTTGCA
TTCCGTGGTG TTCGGAGCGA ACGGAATTAT TGTTCTTGTC TCGATTTTAG AACGAATGGG
CTTGGAACAT GCAAGCATAT CGAGGCAGTA ACACTCTTTC TTCAAGATCA GGTACCCGGT
TATCCTTGGG GCGAATTTGA CTATACGCCT GAGTATAGCT CCATCTATGT AAGCTATAAA
GGCGGACGGA GCATCAAGAT GCGTGTTGGA CGAGAAGAGA CGGCACGCTT CGAGGCCTTG
CGTGCTCGCT TTTTCGATGA ATCGGGCACT TTGCCTCCTG AGAACTACAG CTTCTTACCG
CAGATTTGTG ATGAGGCCAA GGATATAGCT GCTTCATTCC GTTGTTATGA AGATGTATTT
GATCTGACCC AGGAGCATTG TCGCAGGATG GAGTGGCAAG ATTATTTACG TGAGACCAAT
CCTGATAAGC GTGTCGATAA TGAATACACG GCCGATATGC CGGAAGAAAT GCGTAGGCAT
GTGTACGATT TGGCATATGA AGGACATGGT GTGATGGTCG GTCTTCCTTC TGATGTAATT
GTCCGCGAAG TCGTTGCTCT TGCCGATGTA TATTTGCGCG ATCAGCCCGG GGCTACGGGA
TATGTGATTA TCAATGATCC CCAGCGCTTC ATCCTTTGGC GCAATGCCTT TGCCAATCCT
GCTCTTCGCC GCTTGCCCAT TGAGGTGATG CATGCATCTG CCTTTGTCAA GAAGGTGGCT
AATAATGTTC CCCCTTGTAC GTTCATGTTT GTGGAGCAGG CCGCTCGTTT GAAGGAGTGG
CGCGATCCCG TTTCGGTGGC TATCAAGAGG ATGCAGATCG ACCATCTTTA TATGAATCTG
GAGACCATCG AAGATCTCAC CCCTGTTCAG TTCTCCTCTG TGGTACAGCA TATAGATCCC
TTTATCTTAG GCCCGTTCTA TCGCTTTATC CGAGACTATC GTCCTATCTT CCCGCTTAGA
AATGATGGCT CCAATCTGCC TGATGATGTG CGCGACTTCA TGTTCTTGTA CACATCCGAT
TCGATTAGAG AAATGAACGA AATGCTTCCT CCTATCCGTA CGGCAGAAAT TTTGCTGCAT
GGTACTGATG CGGAAGAAGA GATAGACAAA CTCCTGCGTC ATCTTAATCG TGTTCTCTCG
AATGATACGC TGCGAGAAGT ATTTCTTAAA AGAATCAGTA TGATTGTCGG GGCATCGTAA
 
Protein sequence
MKEQKLSNRF CPEGMTVEEW QVGLRREFAE QNPFDVEHLD ENRIWGDYLV ANGRNRYRVA 
FRGVRSERNY CSCLDFRTNG LGTCKHIEAV TLFLQDQVPG YPWGEFDYTP EYSSIYVSYK
GGRSIKMRVG REETARFEAL RARFFDESGT LPPENYSFLP QICDEAKDIA ASFRCYEDVF
DLTQEHCRRM EWQDYLRETN PDKRVDNEYT ADMPEEMRRH VYDLAYEGHG VMVGLPSDVI
VREVVALADV YLRDQPGATG YVIINDPQRF ILWRNAFANP ALRRLPIEVM HASAFVKKVA
NNVPPCTFMF VEQAARLKEW RDPVSVAIKR MQIDHLYMNL ETIEDLTPVQ FSSVVQHIDP
FILGPFYRFI RDYRPIFPLR NDGSNLPDDV RDFMFLYTSD SIREMNEMLP PIRTAEILLH
GTDAEEEIDK LLRHLNRVLS NDTLREVFLK RISMIVGAS