Gene Phep_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0656 
Symbol 
ID8251744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp763000 
End bp764814 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content43% 
IMG OID644934305 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003090940 
Protein GI255530568 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAA CAAAACATAG CATCCTTAAC GGTTATCAAA ATAAAGCTGC ACAACAACAA 
AAGCAGATTG ACGGGCTTAA ACGCAAATTA AATAACATCT CCTTTTCGAG GTTAGGGCTC
TTTATTGCCG AGATCCTTAT GGTAGCCCTG ATCATTAATT TTGGTTTCGA ATGGTTTTTT
GGGGTATTAC TTTTTGTGCC GCTGGTACTT TTTCTGGTCC TGGTCAAAAA ACAAACTACC
GTTCAAAAAG AGCTAGCTTA TACCAGGGCC TTGTTATGGG TTTATCAGAA CGAGATAAAC
CAGTTAAGCG ACGGTAAGAA CGGATATGAC AATGGTAACG CTTATGCTGA TGAATACCAT
CCGTACGCAT CCGATCTGGA TATTTTTGGA CAGGGTTCGC TGTATTCGTA TGTTAACCGT
TGCAATACCA ATGACGGACT AGACCTGCTG GCTGCCAATC TGAGCAGGGC AAACGATAAA
GCTACCATCC TGCAAAGACA GGAGGCCATA GCCGAATTGA TAAACCACAT CGCACAAACC
TTTCATTTCA GGGCCGAGTT ACAAGACCAT AAACCGGAAC AGCTGCGTGT GATAAAAAAT
AAACTACAGC ACGAATTGCC AGGGCAGCTT AAGTTTGCCC GCAACCGGAC CCTCAGGTTA
TATGTTAAAC TTGTTCCCTT TGTAACCATG GGTATGCTGG CCCTGGCTAT AGGGTATGGT
GGCTTGTTGT GGCAGTTTTT TGCCCTGGTC CTGCTTTTCA ATTCCGGCCT TACTTTTTTT
AACCTTGCTG CCATTAACCG GGTTTATAAT GGGTTTGGTA AAGGATCGGC CATGCTCAAT
GCATTTGCTG GTACCGTAAA ATGGACTGAG GATGTAAAAT GGAACAGCAC TTACATCAAA
GGCTTTTTTG ACAGCAGTAA ATCCGATCAG CCGGTGAGCG CACAGATCAG GAGCTTGTCG
GCCATTATCC AGGCTTTTGA TGCCAGACTA AACATCATTG TCAGCGCATT CCTGAACCTC
TTTTTACTAT GGGACCTGAA GTGTTCCATT AACCTGAGCA ACTGGCACGA CAAGTCGTCC
ATACAGTTGA TCAAAGGGAT GCTGCGGATC AGTCAGTTTG AAGAACTGAT CTCTTTTGCT
ACCTTAAGTT ATAACCAGCC CGACTGGAAC TTCCCTTTAA TTGAAGACAA TTTTCATTTT
AGCGCCAGCA AACTTGGTCA TCCGCTCATT CCTGAAAAGG TACGTGTACT AAACGATTTT
AATGTAACCG CTAATCCAAC TGTTGATATT GTAACAGGCT CTAATATGGC CGGAAAAAGT
ACTTTCCTGC GTACAGCGGG TATCAATATG GTACTTGCTT TTACCGGTGC AGCGGTTTGC
GCAGCCCAAA TGTCGGTATC CATTTTCAAT ATCCTGTCGT ATATGCGGAT CAAAGACTCG
TTAAACGACC AGACCTCTAC CTTTAAAGCA GAGCTGAACC GCTTAAAAAT GATCCTGGAT
GCGATACAGA CCAACCAGAA TTCTTTTGTG CTGATAGATG AAATGCTGAG GGGAACCAAT
AGCAAAGACA AATACCTCGG CTCCAAAGTG TTCATAGAAA AAATGATCGA ACAAAAGACC
CCTGCATTAT TTGCTACACA TGACCTGCAA CTGTCTGAGA TGGAGGAAGA CCATCCGGAA
AAGATCCGTA ATTATCATTT CGATATCCAG ATCTCGGAAG GAGAGATGAA CTTCGATTAT
AAGTTAAAAC ATGGGCCCTG TAAAACTTTT AATGCAGCTT TGCTGCTGAA ACAGATAGGC
TTAACGTTAA CTTAA
 
Protein sequence
MVKTKHSILN GYQNKAAQQQ KQIDGLKRKL NNISFSRLGL FIAEILMVAL IINFGFEWFF 
GVLLFVPLVL FLVLVKKQTT VQKELAYTRA LLWVYQNEIN QLSDGKNGYD NGNAYADEYH
PYASDLDIFG QGSLYSYVNR CNTNDGLDLL AANLSRANDK ATILQRQEAI AELINHIAQT
FHFRAELQDH KPEQLRVIKN KLQHELPGQL KFARNRTLRL YVKLVPFVTM GMLALAIGYG
GLLWQFFALV LLFNSGLTFF NLAAINRVYN GFGKGSAMLN AFAGTVKWTE DVKWNSTYIK
GFFDSSKSDQ PVSAQIRSLS AIIQAFDARL NIIVSAFLNL FLLWDLKCSI NLSNWHDKSS
IQLIKGMLRI SQFEELISFA TLSYNQPDWN FPLIEDNFHF SASKLGHPLI PEKVRVLNDF
NVTANPTVDI VTGSNMAGKS TFLRTAGINM VLAFTGAAVC AAQMSVSIFN ILSYMRIKDS
LNDQTSTFKA ELNRLKMILD AIQTNQNSFV LIDEMLRGTN SKDKYLGSKV FIEKMIEQKT
PALFATHDLQ LSEMEEDHPE KIRNYHFDIQ ISEGEMNFDY KLKHGPCKTF NAALLLKQIG
LTLT