Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0656 |
Symbol | |
ID | 8251744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 763000 |
End bp | 764814 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644934305 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003090940 |
Protein GI | 255530568 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA CAAAACATAG CATCCTTAAC GGTTATCAAA ATAAAGCTGC ACAACAACAA AAGCAGATTG ACGGGCTTAA ACGCAAATTA AATAACATCT CCTTTTCGAG GTTAGGGCTC TTTATTGCCG AGATCCTTAT GGTAGCCCTG ATCATTAATT TTGGTTTCGA ATGGTTTTTT GGGGTATTAC TTTTTGTGCC GCTGGTACTT TTTCTGGTCC TGGTCAAAAA ACAAACTACC GTTCAAAAAG AGCTAGCTTA TACCAGGGCC TTGTTATGGG TTTATCAGAA CGAGATAAAC CAGTTAAGCG ACGGTAAGAA CGGATATGAC AATGGTAACG CTTATGCTGA TGAATACCAT CCGTACGCAT CCGATCTGGA TATTTTTGGA CAGGGTTCGC TGTATTCGTA TGTTAACCGT TGCAATACCA ATGACGGACT AGACCTGCTG GCTGCCAATC TGAGCAGGGC AAACGATAAA GCTACCATCC TGCAAAGACA GGAGGCCATA GCCGAATTGA TAAACCACAT CGCACAAACC TTTCATTTCA GGGCCGAGTT ACAAGACCAT AAACCGGAAC AGCTGCGTGT GATAAAAAAT AAACTACAGC ACGAATTGCC AGGGCAGCTT AAGTTTGCCC GCAACCGGAC CCTCAGGTTA TATGTTAAAC TTGTTCCCTT TGTAACCATG GGTATGCTGG CCCTGGCTAT AGGGTATGGT GGCTTGTTGT GGCAGTTTTT TGCCCTGGTC CTGCTTTTCA ATTCCGGCCT TACTTTTTTT AACCTTGCTG CCATTAACCG GGTTTATAAT GGGTTTGGTA AAGGATCGGC CATGCTCAAT GCATTTGCTG GTACCGTAAA ATGGACTGAG GATGTAAAAT GGAACAGCAC TTACATCAAA GGCTTTTTTG ACAGCAGTAA ATCCGATCAG CCGGTGAGCG CACAGATCAG GAGCTTGTCG GCCATTATCC AGGCTTTTGA TGCCAGACTA AACATCATTG TCAGCGCATT CCTGAACCTC TTTTTACTAT GGGACCTGAA GTGTTCCATT AACCTGAGCA ACTGGCACGA CAAGTCGTCC ATACAGTTGA TCAAAGGGAT GCTGCGGATC AGTCAGTTTG AAGAACTGAT CTCTTTTGCT ACCTTAAGTT ATAACCAGCC CGACTGGAAC TTCCCTTTAA TTGAAGACAA TTTTCATTTT AGCGCCAGCA AACTTGGTCA TCCGCTCATT CCTGAAAAGG TACGTGTACT AAACGATTTT AATGTAACCG CTAATCCAAC TGTTGATATT GTAACAGGCT CTAATATGGC CGGAAAAAGT ACTTTCCTGC GTACAGCGGG TATCAATATG GTACTTGCTT TTACCGGTGC AGCGGTTTGC GCAGCCCAAA TGTCGGTATC CATTTTCAAT ATCCTGTCGT ATATGCGGAT CAAAGACTCG TTAAACGACC AGACCTCTAC CTTTAAAGCA GAGCTGAACC GCTTAAAAAT GATCCTGGAT GCGATACAGA CCAACCAGAA TTCTTTTGTG CTGATAGATG AAATGCTGAG GGGAACCAAT AGCAAAGACA AATACCTCGG CTCCAAAGTG TTCATAGAAA AAATGATCGA ACAAAAGACC CCTGCATTAT TTGCTACACA TGACCTGCAA CTGTCTGAGA TGGAGGAAGA CCATCCGGAA AAGATCCGTA ATTATCATTT CGATATCCAG ATCTCGGAAG GAGAGATGAA CTTCGATTAT AAGTTAAAAC ATGGGCCCTG TAAAACTTTT AATGCAGCTT TGCTGCTGAA ACAGATAGGC TTAACGTTAA CTTAA
|
Protein sequence | MVKTKHSILN GYQNKAAQQQ KQIDGLKRKL NNISFSRLGL FIAEILMVAL IINFGFEWFF GVLLFVPLVL FLVLVKKQTT VQKELAYTRA LLWVYQNEIN QLSDGKNGYD NGNAYADEYH PYASDLDIFG QGSLYSYVNR CNTNDGLDLL AANLSRANDK ATILQRQEAI AELINHIAQT FHFRAELQDH KPEQLRVIKN KLQHELPGQL KFARNRTLRL YVKLVPFVTM GMLALAIGYG GLLWQFFALV LLFNSGLTFF NLAAINRVYN GFGKGSAMLN AFAGTVKWTE DVKWNSTYIK GFFDSSKSDQ PVSAQIRSLS AIIQAFDARL NIIVSAFLNL FLLWDLKCSI NLSNWHDKSS IQLIKGMLRI SQFEELISFA TLSYNQPDWN FPLIEDNFHF SASKLGHPLI PEKVRVLNDF NVTANPTVDI VTGSNMAGKS TFLRTAGINM VLAFTGAAVC AAQMSVSIFN ILSYMRIKDS LNDQTSTFKA ELNRLKMILD AIQTNQNSFV LIDEMLRGTN SKDKYLGSKV FIEKMIEQKT PALFATHDLQ LSEMEEDHPE KIRNYHFDIQ ISEGEMNFDY KLKHGPCKTF NAALLLKQIG LTLT
|
| |