Gene Hoch_3422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3422 
Symbol 
ID8545810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4723176 
End bp4725818 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content71% 
IMG OID646388089 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003267817 
Protein GI262196608 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0998408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0667766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGC GCACGCCGCG AGACACGCCG ATGATGCGGC AGTACATGGA GATCAAGGAC 
CAGCATCCCG ACGCGGTGCT GTTCTTCCGC CTCGGCGATT TCTACGAAAT GTTCTACGAG
GACGCCGAGG TAGCGGCCAA GGCCCTCGAT CTCACGCTCA CCTGTCGCCA CAAGGACAGC
GCCAACCCCG TGCCCATGGC CGGCGTGCCG CATCACGCCG CGCGCGGCTA CATCGCCCGG
CTCACCGAAC AGGGCTACAA GGTCGTGGTC TGCGAGCAGG TCGAGGACCC CAAGCTGGTC
AAGGGCATCG TCAAGCGCGC GGTGGCCCAG GTCATCACGC CCGGCGTGGT GCTCGACGAG
GAGGTCCTCG ACCCCAAGCA GCCGCGCTAT CTGGCCGCGA TCGCGTGTGA GGGCGGGCGC
TACGGCCTGT CCTTTCTCGA CGTCTCCACC GGCGAGTTCC GGGCCACCGA GCTCGACAGC
GAGGACGGCT TGCTCGACGA GCTGGCCCGG GTGCGGCCGC GCGAGATCCT GGCCGGCGCG
CGCAACCTGG CCGAGGGCGG TCCGCTCACG GCGACCCAGC GCGACTTCAA CCAGGTCACC
TATTCGCCGG TCGAGCCCCA CACCTGGGGC CAGGCCAAGA CGCTGTTGGT CTCGCTGCTC
GCGGGCGACA GCGCCAGCCT CGGCCTCGAG GAGCGCATCC TGGCCTCGCG CGCGGCCGCC
GACGTCATCG GCTACGCGCG CAGCACGCAG CCCACCGGGG TGTTGCCCGT GAGCCGCTTG
CAGCTCTACG AGCCCGGCGA CACCATGATG CTCGACGAGG CGGCCATCGC CAACCTGGAG
CTGACCGAGA CGCTCATCGG CGGCCGGCGC GCGGGCACGC TGCTGTCGGT CATCGACGAG
ACCTGCACCG CCCCGGGCGG TCGTCTGCTG CGCCACTGGC TGCTGTATCC GCTCAGCGAG
GTGGCCCCGA TCCGGCGGCG CCAGGACGCG GTCGGCTATT TTGTCGAGCA CGCCAGCCTG
CGCCGTTCGG TGCGCGAGGT GCTCGAGGGC GTGCACGATC TCGAGCGTCT GGCCGCGCGC
GTCGGCCTCG GCGTGGCCAC GCCCCGCGAC CTGGGCCGGC TGCGCGACTC GCTGGTGCAG
CTCCCGTCGC TGTCGGCGCT CTTGGCCAGC CCGGTGGTCC AGGCCGGGGG CGAGAGCCCG
CTCGACGCGG TGCCCGCGCT GCTGCGTTTC AACAACGCCA TCCTCGGCGA GCTGGCCGAG
CTGCAGGGCT TGCTGTCGCG CGCGCTGGTC GACGAGCCCG GGCCGCTGGC GCGCGAGGGC
GGCTTCATCC GCGCCGGCTA CTGCCGCATC GTCGACGAGA GCCGCAGCCT GGCCGACGGC
AGCAAAGAGG CCATCTTGGC CATCGAGACC CGCGAGCGCG AGCGCACCGG CATCGGCTCG
CTCAAGGTCA AGTACAACCG CGTCTTCGGC TATTACATCG AGGTCACCAA GGCCAACCTG
GCCCGGGTCC CGGCCGAGTA TGTGCGCAAG CAGACCATCG CCACGGGCGA GCGCTACGTG
ACCACCGAGC TGGCCGAGCT CGAGACCAAA GTGCTGGGCG CCCAGGAGCG CTTGCTCGAG
CGCGAGCAGG AGCTGTTCGC CGAGCTGTGC GCGGCCGTGG GCGAGCGCGC CGGCGGGCTG
CGCGACGTCG GCGAGCGGGT CGCCGGCATC GATTGTCTGG CCAACCTGGC CGAGATCGCG
CACGTGCGCG ACTACCGCTG CCCCGAGGTC GACGACGGCG AGCGCCTCGA GATCGTCGAG
GGTCGTCACC CCGTGGTCGA GCGCACCGTG CCGACCGGGC GTTTCGTGCC CAACGACTGC
CAGCTCGACC CGCGCGAGGC GCAGATCCTG CTCATCACCG GGCCCAACAT GGCCGGCAAG
TCGACCTACA TGCGGCAGGT GGCCCAGATC GTCGTGCTCG CGCAGATGGG CAGCTTCGTG
CCGGCCAAGC GGGCGCGCGT GGGCATTGTC GACCGCGTGT ACACGCGCGT GGGGGCGGCC
GACAACCTGG CCCGCGGCGA GTCCACCTTC ATGGTCGAGA TGCGCGAGAC CTCGGCCATC
ATGCGCGGGG CCACGCGGCG TTCGCTGGTG GTGCTCGACG AGGTCGGCCG CGGCACCTCG
ACCTTCGACG GCGTGTCCAT CGCGTGGGCG GTGACCGAGT ATCTGCACGA CGCCATCGGC
GCGCGCACCT TGTTTGCCAC CCACTATCAC GAGCTGTGCG CGCTGTCCGA GGTGCGGCCG
CGGGTGCGCA ACGTGTCGAT GGCGATTCGC GAGCACGAGG GCGACATCGT GTTCCTGCGC
CAGGTGGTCG CCGGCGGCGC CAGCAAGAGT TACGGCATCG AGGTCGCGCG GCTGGCCGGG
CTGCCGCGAT CCTTGGTGTC GCGGGCGCGG CAGATCTTGG CGCAGCTCGA GGGCGGGCGC
GAGTGGCACC AGCCCAGCCA GCTCACCCTG TTCTCGGCCG CGCAGTCGGC GCCGCCCGAC
GAGCCCGAGG GCGAGGCCGG AGGCGCCATC GTCGAGCGCC TGCGCGGCCT CGACCCGCAG
CGCATGACGC CCATCGAGGC GCTGCAAGTG CTCGCCGAGC TGTGCTCGGA AGCCGGGCGC
TAG
 
Protein sequence
MAKRTPRDTP MMRQYMEIKD QHPDAVLFFR LGDFYEMFYE DAEVAAKALD LTLTCRHKDS 
ANPVPMAGVP HHAARGYIAR LTEQGYKVVV CEQVEDPKLV KGIVKRAVAQ VITPGVVLDE
EVLDPKQPRY LAAIACEGGR YGLSFLDVST GEFRATELDS EDGLLDELAR VRPREILAGA
RNLAEGGPLT ATQRDFNQVT YSPVEPHTWG QAKTLLVSLL AGDSASLGLE ERILASRAAA
DVIGYARSTQ PTGVLPVSRL QLYEPGDTMM LDEAAIANLE LTETLIGGRR AGTLLSVIDE
TCTAPGGRLL RHWLLYPLSE VAPIRRRQDA VGYFVEHASL RRSVREVLEG VHDLERLAAR
VGLGVATPRD LGRLRDSLVQ LPSLSALLAS PVVQAGGESP LDAVPALLRF NNAILGELAE
LQGLLSRALV DEPGPLAREG GFIRAGYCRI VDESRSLADG SKEAILAIET RERERTGIGS
LKVKYNRVFG YYIEVTKANL ARVPAEYVRK QTIATGERYV TTELAELETK VLGAQERLLE
REQELFAELC AAVGERAGGL RDVGERVAGI DCLANLAEIA HVRDYRCPEV DDGERLEIVE
GRHPVVERTV PTGRFVPNDC QLDPREAQIL LITGPNMAGK STYMRQVAQI VVLAQMGSFV
PAKRARVGIV DRVYTRVGAA DNLARGESTF MVEMRETSAI MRGATRRSLV VLDEVGRGTS
TFDGVSIAWA VTEYLHDAIG ARTLFATHYH ELCALSEVRP RVRNVSMAIR EHEGDIVFLR
QVVAGGASKS YGIEVARLAG LPRSLVSRAR QILAQLEGGR EWHQPSQLTL FSAAQSAPPD
EPEGEAGGAI VERLRGLDPQ RMTPIEALQV LAELCSEAGR