Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6533 |
Symbol | |
ID | 6133786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 7182683 |
End bp | 7185571 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641646621 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001773224 |
Protein GI | 170744569 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.948742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.317969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCG GGGAGAACTC GGGCCTCGCC TGGAAGGACC GGGACCGTCT GCTAGAGTCC GGGCCGATGA CGATGGACAG CGATTTCGGC CGGCGCCCGA CCCGCGACGA GCCCCACGAG CCGGCGCAGG ACGCCCCGCC CCCGCGCGGG CGGCGCGGCG CGCCGCCGCC CGCCGAGGCG GTGGCCTCGC CCATGATGGC GCAGTACATC GAGATCAAGT CGGCCAATCC GGGCTTGCTC CTGTTCTACC GGATGGGGGA TTTCTACGAG CTGTTCTTCG AGGACGCGGA GATCGCCTCG CGGGCGCTCG GGATCGTGCT GACCCGGCGC GGCAAGCACG CGGGGGCGGA CATCCCGATG TGCGGGGTGC CGATTGACCG GGCGGACGAT TACCTGCAGC GGCTGATCGC GCTCGGCCAC CGGGTCGCGG TCTGCGAGCA GACCGAGGAC CCTGCCGAGG CGAAGAAGCG CGGCGCGAAA TCGGTGGTGC GGCGCGAGGT CGTGCGCCTC GTCACCCCCG GCACGATCAC CGAGGAGCGG CTGCTCGACC CGGCCCGCGC CAACCTGCTG GTGGCGCTGG CCCGGCGGCG CGCCTCCGAG ACCGGCTGGA CCTACGGCAT CGCGGCGGTC GACATCTCGA CCGGCCGCTT CACCCTCTCG GAGGTGGACG GCGCCGGGCT CGCCGCCGAA CTCGCCCGGC TCGACCCCCG CGAGATCGTG GTGGCGGAGG CGATCCACGC CGATCCCGAC CTCGCGCGGC TGTGGCGCGA CACCTCCGCC TCGGTGACGC CCCTCGGGCG CGGCGAGGCC GACCCGGCCT CGGCCGAGCG GGCGCTCAGG GAGCAGTTCG GCGTCGCCAC CCTGGACGGG TTCGGCGCCT TCGGCCGCGC CGAGGTGGCG GCGGCCGGGA CCGTGCTGCA CTACATCGCC CGCACGCAGC TCGGCGCCAA GGTGCCGCTC GGCCCGCCCG CGCGCCACGC CGCGGGCGGC ACCCTGCTGA TCGACGCGGC GACCCGGGCC AATCTCGAAC TCACCCGCAC GCTCTCGGGC GAGCGCGCCG GCAGCCTGCT GGCGGCCATC GACCGCACGG TCGGGGCGGC CGGCGCCCGG CTCCTCGCCG AGCGCCTCGC CAGCCCCTCG ACCGACCTCG CGCTGATCCG GCGCCGCCAG GACGCGGTGG CCTTCCTGGT CGCCGAGGGC GGCCTGCGCG CCGAATTGCG CGCCGACCTC GCCCGGGCGC CCGACATGGC GCGGGCGCTG TCCCGGGTCG GGGTCGGGCG GGCGGGGCCG CGCGACCTCG CCGCCCTGCG CGACGGGCTC GACGCGGCGC GCAGCATCGC GACGCGCCTC GCCGGGGCGG GCGCGCTGCC GGCCGAGATC GGCAAGGCGG CGCGGCTCCT CGCCAGCGTC GGCGACGGGC TCGTCGAGAC CCTGCAGGGG GCGCTCGCCG ACGACTTGCC CCTGTCCAAG CGCGACGGCA ACTTCGTGCG CGAGGGGCAC TCGCCCGAAC TCGACGAGGC GCGGGCGCTG CAGAGCGATT CCCGCCGCTT CGTCGCCGGG CTGCAGACCC GCTACGCCGA GGAGACCGGC TGCCGCACCC TGCGCATCAA GCACAACAAC CTGCTGGGCT TCTTCATCGA GGTGCCGCAG GCGGTCGGCG AGACCCTGCT CAAGGATCCC TGGCGCGGCA CCTTCGTGCA CCGCCAGACC ATGGTCGACG CGATGCGCTT CACCAGCGTG GAGCTCGGGG AGCTGGAATC GCGGATCGCC AACGCGGCGG GCCGGGGGCT GGCCCTGGAA CTCGCGGCCT TCGAGGCGCT CGCGGCCGCC GTGATGGGGC GGGCGGAGGC GATCACCGCC GCGGCGACCG CGCTCGCGGC GCTGGACGTG GCGGCCTCGC AGGCGGAGCT CGCGGTCGAA CTCGGCTGGG TCCGGCCCGT CCTCGACGAG AGCCTGACCT TCCGGGTCGA GGGCGCCCGC CACCCGGTGG TGGAGGCGGC CCTGCGCCAG GCGGGCGAAC CCTTCATCGC CAATTCCTGC GATCTGTCGG GGGAGAGGGC CGGCCCCGGC CGCGAGGCCG GCTCCGGCCG CGAGGCCGGC TCCGGCCGGG AGGCCGGCAG GATCCTGGTC GTCACCGGCC CGAACATGGG CGGCAAATCG ACCTTCCTGC GCCAGAACGC GCTGATCGCG GTGCTGGCCC AGATGGGGGC CTTCGTGCCG GCGCGCGCGG CCCATCTCGG GCTGGTGGAC CGGCTGTTCT CGCGGGTCGG CGCGGCCGAC GACCTCGCGC GGGGCCACTC GACCTTCATG GTCGAGATGG TGGAGACCGC CGCGATCCTG AATCAGGCGA CGCGGCGCTC CCTCGTGGTT CTCGACGAGA TCGGGCGCGG CACCGCGACC TTCGACGGGC TGTCGATCGC CTGGGCCTGC CTGGAGCACC TGCACGAGGT GACGGGCTGC CGGGCGCTGT TCGCGACCCA TTTCCACGAG CTCACGGGCC TCGCCAAGCG CCTGGAGCGG CTGTCGAACG CGACCCTCAA GGTCACCGAG TGGGAGGGCG ACGTGGTCTT CCTGCACGAG GTGGTGCCGG GTGCCGCGGA CCGCTCCTAC GGCCTGCAGG TGGCGCGGCT CGCGGGGCTG CCAGCCTCGG TGATCGCCCG CGCCAAGGTC ATCCTGGCCG ACCTGGAGAA GGGGGAGGGT GGGCGCGGGC GGCGGCCCCC GGCCGAGCTG CCGCTGTTTT CCGCGCTGCC GCCGGCACCG CCCGCGCCGC CCCCTGCCCC GAAGGCCGAT CCCCTGCGCG ACCTCCTCGA CAGCCTGGAT CCGGACGGGC TCACCCCGCG CGAAGCGCTC GACGCCCTCT ACCGGCTGAA GGCGGCGCGG AAGGCGTGA
|
Protein sequence | MIPGENSGLA WKDRDRLLES GPMTMDSDFG RRPTRDEPHE PAQDAPPPRG RRGAPPPAEA VASPMMAQYI EIKSANPGLL LFYRMGDFYE LFFEDAEIAS RALGIVLTRR GKHAGADIPM CGVPIDRADD YLQRLIALGH RVAVCEQTED PAEAKKRGAK SVVRREVVRL VTPGTITEER LLDPARANLL VALARRRASE TGWTYGIAAV DISTGRFTLS EVDGAGLAAE LARLDPREIV VAEAIHADPD LARLWRDTSA SVTPLGRGEA DPASAERALR EQFGVATLDG FGAFGRAEVA AAGTVLHYIA RTQLGAKVPL GPPARHAAGG TLLIDAATRA NLELTRTLSG ERAGSLLAAI DRTVGAAGAR LLAERLASPS TDLALIRRRQ DAVAFLVAEG GLRAELRADL ARAPDMARAL SRVGVGRAGP RDLAALRDGL DAARSIATRL AGAGALPAEI GKAARLLASV GDGLVETLQG ALADDLPLSK RDGNFVREGH SPELDEARAL QSDSRRFVAG LQTRYAEETG CRTLRIKHNN LLGFFIEVPQ AVGETLLKDP WRGTFVHRQT MVDAMRFTSV ELGELESRIA NAAGRGLALE LAAFEALAAA VMGRAEAITA AATALAALDV AASQAELAVE LGWVRPVLDE SLTFRVEGAR HPVVEAALRQ AGEPFIANSC DLSGERAGPG REAGSGREAG SGREAGRILV VTGPNMGGKS TFLRQNALIA VLAQMGAFVP ARAAHLGLVD RLFSRVGAAD DLARGHSTFM VEMVETAAIL NQATRRSLVV LDEIGRGTAT FDGLSIAWAC LEHLHEVTGC RALFATHFHE LTGLAKRLER LSNATLKVTE WEGDVVFLHE VVPGAADRSY GLQVARLAGL PASVIARAKV ILADLEKGEG GRGRRPPAEL PLFSALPPAP PAPPPAPKAD PLRDLLDSLD PDGLTPREAL DALYRLKAAR KA
|
| |