Gene PG2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2043 
Symbol 
ID2552043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2142358 
End bp2143452 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content58% 
IMG OID637150620 
Producthypothetical protein 
Protein accessionNP_906109 
Protein GI34541630 
COG category[S] Function unknown 
COG ID[COG0327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000126917 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCATTC AGGATATTAT AGAGGCTATC GAGGCGGTCT GCCCGAGGGC TTATCAAGAG 
AGCTATGACA ATAGTGGCGT GCAGGTGGGC GACACCAAGC GGGAGGCAAC GGGTGCCCTC
CTCTGTGTGG ATGTTACCGA AGCGGTATTG GAGGAGGCCA TTCGGCTGGG ATGCAATCTC
GTCATTGCCC ACCATCCGAT TCTTTTCAAA CCGCTCAAGC GATTGACCGG CAGCTCCTAC
GTGGAGCGAT GCGTGGAGCT GGCCGTACGG CACGGTCTGG TGCTATATGC GGCTCATACC
AATGCGGACA ACGCTCCGCA GGGACTGAAT GCGCTGCTGG CCGAACGCTT CGGCTTGCTG
AATACGCGAC CGCTGGAGCC GCTGCAAGGC AAGCTCTTAG AACTGGTCAC CTTCGTCCCC
ACGGAGTATG CCGATGCCGT GAGGCAGGCT TTGTGGCAGG CCGGTGCAGG CCGTTTGGGG
CATTACGATT GCTGTTCGTT CAGCCATGCC GGCACAGGGA CTTTCAGAGC TGCCGAGGGT
GCCAATCCCT TTGTGGGAGC GATAAGCGAA TTGCACCATG AGGCGGAGGA GCGGATCAGC
CTCGTACTGC CGGCATACAG GCAGGGTACT GTGCTGCAGG CTTTGCACGC GGCTCATCCG
TACGAGCTGC CGGCTGTCAG CCTGATCCCG CTGGCCAACG ATCATCCCTC GGCCGGAGGC
GGAATAGTGG GGGATCTGCC TTCGCCCATA AGCGAGCGGG AGATGCTGCT GCACATCAAG
GAGGTATTCG GTCTGAAGGT CCTGTCCCAT TCGGCTTGGA GGGAACGGCC GTTGAGGCGG
ATGGCTATAT GCGGCGGTAG CGGTGCTTTC ATGTGGCGGC GTGCAGCACA GGAGGGTGCA
GACCTCTTCC TGACAGGGGA GGCGAAGTAC AACGACTTCT TCGATGCAGG GGAGCATCTG
CTGCTGGTTA CGATCGGTCA TTACGAGAGC GAAGAGGTGG CTAATGAGCT ATTTATGCGC
ATAATATCGC AGAAATTCCC TACCTTTGCC ACCCACAAAT CATCGGTTGC AACCAATCCG
GTAAACTATT TGTAG
 
Protein sequence
MIIQDIIEAI EAVCPRAYQE SYDNSGVQVG DTKREATGAL LCVDVTEAVL EEAIRLGCNL 
VIAHHPILFK PLKRLTGSSY VERCVELAVR HGLVLYAAHT NADNAPQGLN ALLAERFGLL
NTRPLEPLQG KLLELVTFVP TEYADAVRQA LWQAGAGRLG HYDCCSFSHA GTGTFRAAEG
ANPFVGAISE LHHEAEERIS LVLPAYRQGT VLQALHAAHP YELPAVSLIP LANDHPSAGG
GIVGDLPSPI SEREMLLHIK EVFGLKVLSH SAWRERPLRR MAICGGSGAF MWRRAAQEGA
DLFLTGEAKY NDFFDAGEHL LLVTIGHYES EEVANELFMR IISQKFPTFA THKSSVATNP
VNYL