Gene M446_6533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6533 
Symbol 
ID6133786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7182683 
End bp7185571 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content75% 
IMG OID641646621 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001773224 
Protein GI170744569 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.948742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.317969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCG GGGAGAACTC GGGCCTCGCC TGGAAGGACC GGGACCGTCT GCTAGAGTCC 
GGGCCGATGA CGATGGACAG CGATTTCGGC CGGCGCCCGA CCCGCGACGA GCCCCACGAG
CCGGCGCAGG ACGCCCCGCC CCCGCGCGGG CGGCGCGGCG CGCCGCCGCC CGCCGAGGCG
GTGGCCTCGC CCATGATGGC GCAGTACATC GAGATCAAGT CGGCCAATCC GGGCTTGCTC
CTGTTCTACC GGATGGGGGA TTTCTACGAG CTGTTCTTCG AGGACGCGGA GATCGCCTCG
CGGGCGCTCG GGATCGTGCT GACCCGGCGC GGCAAGCACG CGGGGGCGGA CATCCCGATG
TGCGGGGTGC CGATTGACCG GGCGGACGAT TACCTGCAGC GGCTGATCGC GCTCGGCCAC
CGGGTCGCGG TCTGCGAGCA GACCGAGGAC CCTGCCGAGG CGAAGAAGCG CGGCGCGAAA
TCGGTGGTGC GGCGCGAGGT CGTGCGCCTC GTCACCCCCG GCACGATCAC CGAGGAGCGG
CTGCTCGACC CGGCCCGCGC CAACCTGCTG GTGGCGCTGG CCCGGCGGCG CGCCTCCGAG
ACCGGCTGGA CCTACGGCAT CGCGGCGGTC GACATCTCGA CCGGCCGCTT CACCCTCTCG
GAGGTGGACG GCGCCGGGCT CGCCGCCGAA CTCGCCCGGC TCGACCCCCG CGAGATCGTG
GTGGCGGAGG CGATCCACGC CGATCCCGAC CTCGCGCGGC TGTGGCGCGA CACCTCCGCC
TCGGTGACGC CCCTCGGGCG CGGCGAGGCC GACCCGGCCT CGGCCGAGCG GGCGCTCAGG
GAGCAGTTCG GCGTCGCCAC CCTGGACGGG TTCGGCGCCT TCGGCCGCGC CGAGGTGGCG
GCGGCCGGGA CCGTGCTGCA CTACATCGCC CGCACGCAGC TCGGCGCCAA GGTGCCGCTC
GGCCCGCCCG CGCGCCACGC CGCGGGCGGC ACCCTGCTGA TCGACGCGGC GACCCGGGCC
AATCTCGAAC TCACCCGCAC GCTCTCGGGC GAGCGCGCCG GCAGCCTGCT GGCGGCCATC
GACCGCACGG TCGGGGCGGC CGGCGCCCGG CTCCTCGCCG AGCGCCTCGC CAGCCCCTCG
ACCGACCTCG CGCTGATCCG GCGCCGCCAG GACGCGGTGG CCTTCCTGGT CGCCGAGGGC
GGCCTGCGCG CCGAATTGCG CGCCGACCTC GCCCGGGCGC CCGACATGGC GCGGGCGCTG
TCCCGGGTCG GGGTCGGGCG GGCGGGGCCG CGCGACCTCG CCGCCCTGCG CGACGGGCTC
GACGCGGCGC GCAGCATCGC GACGCGCCTC GCCGGGGCGG GCGCGCTGCC GGCCGAGATC
GGCAAGGCGG CGCGGCTCCT CGCCAGCGTC GGCGACGGGC TCGTCGAGAC CCTGCAGGGG
GCGCTCGCCG ACGACTTGCC CCTGTCCAAG CGCGACGGCA ACTTCGTGCG CGAGGGGCAC
TCGCCCGAAC TCGACGAGGC GCGGGCGCTG CAGAGCGATT CCCGCCGCTT CGTCGCCGGG
CTGCAGACCC GCTACGCCGA GGAGACCGGC TGCCGCACCC TGCGCATCAA GCACAACAAC
CTGCTGGGCT TCTTCATCGA GGTGCCGCAG GCGGTCGGCG AGACCCTGCT CAAGGATCCC
TGGCGCGGCA CCTTCGTGCA CCGCCAGACC ATGGTCGACG CGATGCGCTT CACCAGCGTG
GAGCTCGGGG AGCTGGAATC GCGGATCGCC AACGCGGCGG GCCGGGGGCT GGCCCTGGAA
CTCGCGGCCT TCGAGGCGCT CGCGGCCGCC GTGATGGGGC GGGCGGAGGC GATCACCGCC
GCGGCGACCG CGCTCGCGGC GCTGGACGTG GCGGCCTCGC AGGCGGAGCT CGCGGTCGAA
CTCGGCTGGG TCCGGCCCGT CCTCGACGAG AGCCTGACCT TCCGGGTCGA GGGCGCCCGC
CACCCGGTGG TGGAGGCGGC CCTGCGCCAG GCGGGCGAAC CCTTCATCGC CAATTCCTGC
GATCTGTCGG GGGAGAGGGC CGGCCCCGGC CGCGAGGCCG GCTCCGGCCG CGAGGCCGGC
TCCGGCCGGG AGGCCGGCAG GATCCTGGTC GTCACCGGCC CGAACATGGG CGGCAAATCG
ACCTTCCTGC GCCAGAACGC GCTGATCGCG GTGCTGGCCC AGATGGGGGC CTTCGTGCCG
GCGCGCGCGG CCCATCTCGG GCTGGTGGAC CGGCTGTTCT CGCGGGTCGG CGCGGCCGAC
GACCTCGCGC GGGGCCACTC GACCTTCATG GTCGAGATGG TGGAGACCGC CGCGATCCTG
AATCAGGCGA CGCGGCGCTC CCTCGTGGTT CTCGACGAGA TCGGGCGCGG CACCGCGACC
TTCGACGGGC TGTCGATCGC CTGGGCCTGC CTGGAGCACC TGCACGAGGT GACGGGCTGC
CGGGCGCTGT TCGCGACCCA TTTCCACGAG CTCACGGGCC TCGCCAAGCG CCTGGAGCGG
CTGTCGAACG CGACCCTCAA GGTCACCGAG TGGGAGGGCG ACGTGGTCTT CCTGCACGAG
GTGGTGCCGG GTGCCGCGGA CCGCTCCTAC GGCCTGCAGG TGGCGCGGCT CGCGGGGCTG
CCAGCCTCGG TGATCGCCCG CGCCAAGGTC ATCCTGGCCG ACCTGGAGAA GGGGGAGGGT
GGGCGCGGGC GGCGGCCCCC GGCCGAGCTG CCGCTGTTTT CCGCGCTGCC GCCGGCACCG
CCCGCGCCGC CCCCTGCCCC GAAGGCCGAT CCCCTGCGCG ACCTCCTCGA CAGCCTGGAT
CCGGACGGGC TCACCCCGCG CGAAGCGCTC GACGCCCTCT ACCGGCTGAA GGCGGCGCGG
AAGGCGTGA
 
Protein sequence
MIPGENSGLA WKDRDRLLES GPMTMDSDFG RRPTRDEPHE PAQDAPPPRG RRGAPPPAEA 
VASPMMAQYI EIKSANPGLL LFYRMGDFYE LFFEDAEIAS RALGIVLTRR GKHAGADIPM
CGVPIDRADD YLQRLIALGH RVAVCEQTED PAEAKKRGAK SVVRREVVRL VTPGTITEER
LLDPARANLL VALARRRASE TGWTYGIAAV DISTGRFTLS EVDGAGLAAE LARLDPREIV
VAEAIHADPD LARLWRDTSA SVTPLGRGEA DPASAERALR EQFGVATLDG FGAFGRAEVA
AAGTVLHYIA RTQLGAKVPL GPPARHAAGG TLLIDAATRA NLELTRTLSG ERAGSLLAAI
DRTVGAAGAR LLAERLASPS TDLALIRRRQ DAVAFLVAEG GLRAELRADL ARAPDMARAL
SRVGVGRAGP RDLAALRDGL DAARSIATRL AGAGALPAEI GKAARLLASV GDGLVETLQG
ALADDLPLSK RDGNFVREGH SPELDEARAL QSDSRRFVAG LQTRYAEETG CRTLRIKHNN
LLGFFIEVPQ AVGETLLKDP WRGTFVHRQT MVDAMRFTSV ELGELESRIA NAAGRGLALE
LAAFEALAAA VMGRAEAITA AATALAALDV AASQAELAVE LGWVRPVLDE SLTFRVEGAR
HPVVEAALRQ AGEPFIANSC DLSGERAGPG REAGSGREAG SGREAGRILV VTGPNMGGKS
TFLRQNALIA VLAQMGAFVP ARAAHLGLVD RLFSRVGAAD DLARGHSTFM VEMVETAAIL
NQATRRSLVV LDEIGRGTAT FDGLSIAWAC LEHLHEVTGC RALFATHFHE LTGLAKRLER
LSNATLKVTE WEGDVVFLHE VVPGAADRSY GLQVARLAGL PASVIARAKV ILADLEKGEG
GRGRRPPAEL PLFSALPPAP PAPPPAPKAD PLRDLLDSLD PDGLTPREAL DALYRLKAAR
KA