Gene PG0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0120 
SymbolepsC 
ID2551825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp143144 
End bp144304 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID637148931 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionNP_904464 
Protein GI34539985 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000198131 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TGATGTTGGT CTTCGGGACG AGACCCGAAG CGATCAAGAT GGCTCCGCTG 
GTGAAGGAAT TTCAAGCGAG AGCAAGTGAG TTTGATACCA TTGTCTGTGT GACGGGTCAG
CATAGAGAGA TGCTCAAGCA AGTGCTGGAG CTATTTGATA TCAAGCCCGA TTATGACTTG
GAGATCATGA AGGAGGGGCA GGATCTCTAT GACGTAACTA CACGTGTGCT GTTGGGTATG
CGTGAAGTAC TCAAGAAGAC AAAGCCCGAT GTAGTACTCG TACACGGCGA TACGACTACA
AGTACTGCCG CTGCATTGGC TGCTTTCTAT CAACAGATTC CGGTAGGACA TGTGGAGGCA
GGGCTTCGCA CGCACAACAT TTACAGCCCA TGGCCGGAAG AGATGAACCG TCAGCTCACC
GGTAGGATGG CTACCTATCA CTTTGCTCCT ACGGAATTGA GTCGGGACAA TTTACTTGCA
GAAGGGATTG CTACAGATCG TATATTTATT ACAGGAAATA CAGTAATCGA TGCTCTACAA
CAAGTCGTTA CACGAGTTAA GGGTAATGCC GATTTGCGAA ATCAAGTGTC TCGAAAGCTA
CTTCAATTTG GATATGATGT GAATCGTTTA GAGGCTGGGC GTAGACTTGT TCTTATCACA
GGGCATCGCA GAGAAAACTT TGGCGAAGGA TTCCTTAATA TCTGCCGTGC TATTCAAACT
CTTAGCAAGC GTTTCCCGGA GGTAGACTTT GTTTATCCCA TGCACCTTAA CCCCAATGTG
CGTAAGCCTA TTCGCGAGAT CTTCGGCGAT AACCTTGGAG GCTTGGATAA TCTCTTTTTT
ATTGAGCCGC TGGAGTATTT GCAGTTTGTT ACGCTCATGG ATCGTTCGTC CATTGTTCTG
ACTGATAGTG GAGGTATTCA GGAAGAAGCT CCAGGGTTAG GCAAACCTGT ATTGGTAATG
CGAGATACTA CGGAGCGTCC CGAAGCGGTG AAAGCAGGAA CCGTGAAACT TGTAGGGACA
GATTATAATC AAATCGTCGA CAATGTCGAA AAACTACTGA CAGACAACGC CGCATATGCC
GAAATGAGCA GAGCCAATAA TCCGTACGGT GACGGAAAAG CATGCTCATA TATAGCGGAT
GCTCTTACTC GATGCATTTA G
 
Protein sequence
MKKVMLVFGT RPEAIKMAPL VKEFQARASE FDTIVCVTGQ HREMLKQVLE LFDIKPDYDL 
EIMKEGQDLY DVTTRVLLGM REVLKKTKPD VVLVHGDTTT STAAALAAFY QQIPVGHVEA
GLRTHNIYSP WPEEMNRQLT GRMATYHFAP TELSRDNLLA EGIATDRIFI TGNTVIDALQ
QVVTRVKGNA DLRNQVSRKL LQFGYDVNRL EAGRRLVLIT GHRRENFGEG FLNICRAIQT
LSKRFPEVDF VYPMHLNPNV RKPIREIFGD NLGGLDNLFF IEPLEYLQFV TLMDRSSIVL
TDSGGIQEEA PGLGKPVLVM RDTTERPEAV KAGTVKLVGT DYNQIVDNVE KLLTDNAAYA
EMSRANNPYG DGKACSYIAD ALTRCI