Gene Ppha_2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2225 
Symbol 
ID6462037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2309576 
End bp2311957 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content54% 
IMG OID642728415 
ProductSmr protein/MutS2 
Protein accessionYP_002019039 
Protein GI194337245 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.57059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCG TTAGCCTGAA AAAACTCGAA TTTGATAAAG TTGCCAACTA CGCAACGCAG 
TTCTGCCTCT CCGCGATGGG GCGCGACCGG CTTCTTGAGG CTGTGCCGCA GGTTGGTCGT
GAAGCGCTGG TAGCGGAGCT TGAACGGGTG CTTGAGTTAC GCAATCTGCT GCAGGAGGGG
AGTGCGCTCC CTTTTTCCTG GTTGCCGGAT ACGCGCCCGC TCCTGAAAAA ACTGGAAATT
CTTGAGAGTT ATCTTGAGCC GGAGGAGTTG CAGGATATTC ATCATCTGCT CTTTTCATCG
GTGCAGTTGC GCAAGTTCAT GTTCCTCAAC CGTGAGGTTT ATCCGCTGCT GAATGAGTTC
ACCATCAGGC TCTGGCTTGA GAAGAGTTTG CAGGCCTCTA TCCGACGGAT TATTGATGAG
CAGTCGAGGG TGCGTGATAC GGCCAGCGAG GCGCTTCTGA TGATCCGCCG TGAGCTGAGT
GGCAGCCGTG AGCTGATCCG TCGGAAGATG GAGCGGCTGC TCAGGCGTTG CCAGGAGAGT
GGCTGGCTGA TGGAGGATAC GATAGCCATC AAGAACGGGC GGCTGACGCT TGGCCTCAGG
GTGGAGTACA AGTACAAGAT AGCGGGCTAC ATCCAGGATT ATTCGGGGAG CGGGCAGACG
GTTTTTATTG AGCCTGCCGA AACGCTTGAG ATCAGTAACC GCATTCAGGA TCTGGAGATC
AGTGAGCGGA GGGAGATTGA GCGCATTCTG AAGGAGATGA CTGAAGAGCT GCGGCCCGAG
ATTGAAAACC TAAGGTATAA TGAAACTATC CTCGGTGAGT TTGATGCCTT GTATGCACGG
GCACGCTTTG CCGTTGAAAC GAACTCGGTG CTTCCCGGTA TTGCCAAAGG GCATTCGCTG
CGCATTGTGA AGGGGTTTCA TCCCTGGCTC CTGATTTCGC ATCATCAAAA AGAGGTTTTG
CCGCTCGATC TTGATCTGGA TGAGGATGAC CGGGTGCTTG TGATTTCAGG CCCCAATGCG
GGTGGAAAAT CGGTGGCGAT GAAGACGGCC GGTCTGCTCT GTTGTATGCT GGTGCACGGC
TATCTGCTGC CCTGCAGCGA GAGCTCGGTT TTTCCTCTTT TTGGCGATAT TTTTATTGAG
ATTGGCGACG ATCAGTCGAT TGAGAATGAC CTTTCCACCT TCAGTTCCCA TCTTGGCGCC
ATCAAAACCA TTCTTGATGT TGCCGGGAGC AGTGATCTGG TGCTGATTGA TGAGCTCTGC
GCCGGAACTG ATGTGGAGGA GGGTGGAGCC ATTGCCCGTG CGGTGATGGA GGAGCTGCTC
AATCGAGGCA CCAAAACCAT TGTGACCACC CATCTCGGCG ATTTGAAGGC CTATGCCCAT
GAGCGCGAGG GAGTGGTCAA CGGCGCGATG GAGTTCAACC GGGCAGGGCT GGTGCCTACC
TTCCGTTTTG TCAAGGGGTT GCCGGGCAAC AGCTTTGCCT TTGCAATGAT GAAGCGGATG
GGGTTTCCTG AAACCATGGT TGAGCGGGCA TCGGGATTTA TGCAGGATGA GCGTATCGGG
CTTGATCGGA TGCTTGATGA CCTGAGCCGC CTCTTTGAGG AGAATCGCCT GCTGAAGCAG
CAGCTTGAGG CAGAGCGGGC GGATCTTGCG GCACGCGAAC TCTCCCTGCG CACTGAGGAG
GCTCGAATGG AGCGGAAGCG CAGGGATCTG AAACTTGGCG CCTCAAGGGA ACTCCAGAAA
GAGGTGGAAT CTGCCAGAAA AGAGATCAAA GAGATTGTTC AGGAGGTCAA GAGCACACCA
ACTGACGCAA AAGCCGTGCA GGAGGCCAGA AAAAAACTTG GATTGAAAAA GCAGGAGGCT
GAAAAGAGTG AGCTTTCTCT TCGGGCTGAA GCCGAAATGG CAGTGCCTCT TGATCGGACT
ATCCGTGAGG GTGACCTGGT CAGGATTCTT GATACGAGTA CCTCCGGTGA GGTTGAGAGT
GTTAATCAGG AGAGTGTGGT GGTGCTGTGC GGTAATTTCA GGTTGACCAC ATCGCTCAAA
AATCTTGAGA AAACATCAAA AACTCAGGCT AAAAAAATTC ACAAGGAGCC TTTGCCCCGG
CAGCAAAAGG TCTCCTGGTC GGCTACGACT TCAGGTGTTG AATCAACAAA GCTTGATTTG
CGGGGGCTCA GTGGTGATGA GGCGATCATG AAAATTGACC GGTTTATTGA TACGATGCGC
CTTAACCGTA TTCATTCAGC GATGATCCTT CATGGCAAGG GAACCGGATC ACTCCGGCAG
AGGACGGCGG AGTTCCTGCA GCAGCACAGC GCGGTCAAGA GTTTTCGTCT TGGTGAGTGG
GGCGAAGGAG GGGCTGGAGT GACGGTGGTG GAGCTGGAGT AA
 
Protein sequence
MNAVSLKKLE FDKVANYATQ FCLSAMGRDR LLEAVPQVGR EALVAELERV LELRNLLQEG 
SALPFSWLPD TRPLLKKLEI LESYLEPEEL QDIHHLLFSS VQLRKFMFLN REVYPLLNEF
TIRLWLEKSL QASIRRIIDE QSRVRDTASE ALLMIRRELS GSRELIRRKM ERLLRRCQES
GWLMEDTIAI KNGRLTLGLR VEYKYKIAGY IQDYSGSGQT VFIEPAETLE ISNRIQDLEI
SERREIERIL KEMTEELRPE IENLRYNETI LGEFDALYAR ARFAVETNSV LPGIAKGHSL
RIVKGFHPWL LISHHQKEVL PLDLDLDEDD RVLVISGPNA GGKSVAMKTA GLLCCMLVHG
YLLPCSESSV FPLFGDIFIE IGDDQSIEND LSTFSSHLGA IKTILDVAGS SDLVLIDELC
AGTDVEEGGA IARAVMEELL NRGTKTIVTT HLGDLKAYAH EREGVVNGAM EFNRAGLVPT
FRFVKGLPGN SFAFAMMKRM GFPETMVERA SGFMQDERIG LDRMLDDLSR LFEENRLLKQ
QLEAERADLA ARELSLRTEE ARMERKRRDL KLGASRELQK EVESARKEIK EIVQEVKSTP
TDAKAVQEAR KKLGLKKQEA EKSELSLRAE AEMAVPLDRT IREGDLVRIL DTSTSGEVES
VNQESVVVLC GNFRLTTSLK NLEKTSKTQA KKIHKEPLPR QQKVSWSATT SGVESTKLDL
RGLSGDEAIM KIDRFIDTMR LNRIHSAMIL HGKGTGSLRQ RTAEFLQQHS AVKSFRLGEW
GEGGAGVTVV ELE