Gene PG1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1072 
Symbol 
ID2553210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1139116 
End bp1140507 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content44% 
IMG OID637149773 
ProductMutS family protein 
Protein accessionNP_905287 
Protein GI34540808 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.6736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAC GGGAACAGAT AAATAGCATT AGTGGCTTTC GCTATGTAAT AGATGAGTTG 
TGCATACATT CATCTGTAGG GCGACGTTGT CTGATGGAGC AAGAATTCTT GACCGAGGCT
TCCGATATTG AAGTGCTTCT TTCTCGTGTA GAAATAGCCA TCTCATACCA AGCAGACCAA
CGAAAACAAA AAGGTCTGGA TGAAATTGCA CACAAATTAA TGCAGCTGCG TGACATCCAA
GGGACGATAT ATTCTCTTTC ACGCCACGTA GTTTGCACGG ACATTGATTT TTTCGAGATC
AAGTTCTTAG CAATTTTAAG TGAAGATATT CGGGATCTGA TCCGTTTTTA CCAGTTAGAT
GATCTCTCTT CTCCCCTACC CGATTTGTCG CATATCGTTT CCGTTTTGGA TCCGGAGGAA
AAGAAAATTC CTCATTTCTA TATATACGAT GCATATTCGG AGACATTGAG AGAGTTGAGA
GACAGGCTCA AAAAAGAAAC AAACGAAGAC GCCAGGATCG AAATCCGCAA TGAAAGTTTG
CAGGAGGAAG ACATAGTCCG TAAGCGACTT TCTCGCGAGT TGTCCCCTTA TGCTGGAGGA
TTGGCTACAG CTCTGGAATT GTTGGGAGCG ATAGATCTGT TATTAGCAAA GGTCAAACTA
TTCATTCAGC TTGGATGGAG TAAACCGGGT TCTGGTCATA GTGTTACGAA CTATATGGGA
CTGGTACATC CACATGTCCT TAGCCTCCTG GGGAAAAAAG GAGAAAAGTT CCAGCCGGTA
GATATAGCCC TACCCTCTCT GCCAACCTTA ATTACCGGTG CTAACATGGC AGGAAAAAGT
GTGCTGTTGC AAGGAGTTGC ATTAGCTCAG ATCCTCTATC AATATGGCTT CTATGTGCCG
GCACAAAAGG CAGAGATATG CCCTGTAGAA AAAGTGATGC TTTCACTTGG AGATGCACAA
GATATTAGAC AAGGGCTTTC CTCTTTCGGG GCGGAAATGA TGTGTCTTTC GTCCATTGCC
GATGAGGCCA GACAGGGAAA GCAACTACTC GTTTTAGTCG ATGAACCTGC AAGGACAACG
AATCCTGTAG AAGGACAAGC CATTGTCAGT GGACTATTGG CTATATTGAG CAGGTATAAG
ATCCGATCTC TCGTCACTAC GCATTATGGC AGTATAGACA TTCCATGTCG CCGCTTGAAA
GTGCGTGGTT TTAGAGAAGA CAAAGTGAAC TTACCTCTAC AAGTAAATTC CCTCAGCAAA
TGTGTGGACT ATACGCTTGA AGAAGTGAGC GAAAACGATG TTCCACACGA AGCAATACGC
ATAGCAGAGA TCCTTGGGGT TAACGAGGCT CTTATGACAG AATGCAAACA GTTTTTGAAC
AACACGAAAT AG
 
Protein sequence
MKLREQINSI SGFRYVIDEL CIHSSVGRRC LMEQEFLTEA SDIEVLLSRV EIAISYQADQ 
RKQKGLDEIA HKLMQLRDIQ GTIYSLSRHV VCTDIDFFEI KFLAILSEDI RDLIRFYQLD
DLSSPLPDLS HIVSVLDPEE KKIPHFYIYD AYSETLRELR DRLKKETNED ARIEIRNESL
QEEDIVRKRL SRELSPYAGG LATALELLGA IDLLLAKVKL FIQLGWSKPG SGHSVTNYMG
LVHPHVLSLL GKKGEKFQPV DIALPSLPTL ITGANMAGKS VLLQGVALAQ ILYQYGFYVP
AQKAEICPVE KVMLSLGDAQ DIRQGLSSFG AEMMCLSSIA DEARQGKQLL VLVDEPARTT
NPVEGQAIVS GLLAILSRYK IRSLVTTHYG SIDIPCRRLK VRGFREDKVN LPLQVNSLSK
CVDYTLEEVS ENDVPHEAIR IAEILGVNEA LMTECKQFLN NTK