Gene Hoch_3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3217 
Symbol 
ID8545605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4433225 
End bp4435513 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content74% 
IMG OID646387884 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003267612 
Protein GI262196403 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0687703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC ACCGCCAAGC CAGCGCAGAC GCCGAGTCGT CGGAGAACCC ACCACGGCCC 
GAGATCCGCG TTCTGCCCGA CACCGTGGTC GACCAGATCG CGGCCGGCGA GGTGGTCGAG
CGGCCGGCCT CGGTGGTCAA GGAGCTGGTC GAGAACGCGC TCGACGCCCA CGCCACGCAC
GTCAACGTCG AGGTCGAGGC CGGCGGCAAG CAGCTCATCC GCGTGCTCGA CAACGGCATC
GGCATGACCG AGAGCGACGT GCGCTTGGCG CTCACGCGTC ACGCCACCAG CAAGCTGCGC
GCCCTCGACG ACCTCTACGG CCTGGGCACC ATGGGCTTCC GCGGCGAGGC GCTGCCGTCG
ATCGCGGCGG TCTCGCGCAT GAGCATCACC ACGCGGACGC GCGGTCAGGT CGCGGGCACC
AAGCTCGACA TCGAGGGCGG CCGCATCACG CAGATCAGCG AGGTGGGTGC GCCCGTGGGC
ACGCACATCG AGATCCGCGA CCTGCTGTTC AACGTGCCCG CGCGGCTCAA GTTCCTCAAG
GGCAACGCCA CCGAGGCCTC GCACGTGACC GATTCGGTGG CCAAGCTGGC CATGGTGCAC
CCCCAGGTGC ACGTGCGTCT GCGCCACGGC GGCCGGGTGG CGCTCGAGGC GCCGCAGCAC
AGCAGCGGGC TCGAGCGCGC GCGGGCGATT CTGGGCTCGC GTCTGGGGCG CGAGCTGCAC
GAGGTGAGCG GCGCCGAGAA CGGCGTGCGC GTCACCGCCT ATCTGGCCGC GCCCGATCTC
GCGCAGAGCA CCTCGCGTTC GACCTATCTG TTCGTCGGCA AGCGCGCGGT CAAAGACCGC
GGCCTGCTGC ACGCGGTGTC GATGGGCTAC GGCGAGCTGG TGCCCAAGGG GCGCTTCCCC
GTGGCCGTGC TGTGCCTCGA GGTGCCCGGC GGTGAGGTCG ACGTCAACGT CCACCCGCAG
AAGCTCGAGG TCCGCTTCTC CGACGGCCCC GCGGTGTTCG CGGCGGTGCG CCACGTGCTC
CGCCGCGGCG TGGCCGAGGC GCCGTGGCTG AGCGAGCAGC AGTCCGGCTC GCCCGTGCGC
ATGCGCGCCA AGGCGCACGT GTCGCCGCCG CGCGAGTCCG CGGGCGGCGG CCGAGCCTCG
CGTCTGGCCG AGCGTCAGGC CGCCGGCGCC GCGCGCATGC TCTTGCCCTT TGGCCGCGAC
GCCGCGCAGC CGGGCGCGAG CTGGCAGCCG CCGGCCCGCG ACGAGCGCGG CGGAGGGAAT
CCGCCCGGCG CTGCGGCCCC ACCGTCGGCC CAGGCGCCAT CCCGATCGCC GGACAGCCCG
GCGGACGAGC CCGGCGCCGC AGTTCGCGAT GGCGGCGCCG CGACCTCGAC GCGGGCCCGG
GAGGACGCCG GCGACAGCGG TGAGCGAGGC GGCGGACGCG GCCTGTCGCG CACGGCCTTT
CCGCTGGCGC CGCGCGAGTG GCCGGAGACC CCGCCCGGGC ATGGCAGCAG CCGCGCGTCC
GAGCCCGCGC CCTGGCCGTA TGCGGGCGAT GCGCCGCCAT CGCCGTCATC GCCGGACGAG
TCCGCGAGCG ACGACAGCCT CGCCCGCCAG CCGCTCGATG CCGGGCGCGC TGCGGCGGGC
GAGGGCGGGG CGGTTCCGTA CGCTTCGCCG GCCCCGGCGG CGGGAGCCGA GGCCGCGCCA
GCGGCGCACG TGCCCCGCGA TCCCTCGCGC TTCTTCACCG AGCTGAGCTA CATCGGCCAG
CTCGACCGGA CGTACCTGGT GTGCGAATCC AACGGCGAGA TGGTGCTGGT CGACCAGCAC
GCGGCCCACG AGCGCGTCGC CTTTCAACGC CTGCGCGATC GCTGGGCCCA GCACGCCGTG
CCCGTGCAGC GCCTGCTGCT GCCCAAGACC TTCGACCTGA GCCCCGAGCA GGCCGCCGTG
GCCGAAGACG CGCGCGCGAC CCTGCACGAC ATGGGCTTCG AGCTCGAGCA CTTCGGCGGC
ACCACCTACG CGCTCAAAGC GCTGCCCGCC GGTCTGCGCG AGTCAGACGT CGAGACCGTG
CTGCACGAGC TGCTCGACGA CCTGGCCGAG CGCGGCGGCA GTCGCGCGCT CGAGGAGCGG
CTCGATCTGG CGCTGGCGAC CATCGCTTGT CACTCGGTGG TGCGCGCCGG CGACGCGCTC
AGCGCGCAGG AAGTGCGGGC GCTGTTCAAG TCCCTCGACG AGGTCGACTT CAAGGCCCAC
TGTCCCCACG GACGGCCGGT GCTGCTGCGC ATCAGCGTCG ACGAGATCGC GCGTCGCTTC
GGCCGCTAA
 
Protein sequence
MDEHRQASAD AESSENPPRP EIRVLPDTVV DQIAAGEVVE RPASVVKELV ENALDAHATH 
VNVEVEAGGK QLIRVLDNGI GMTESDVRLA LTRHATSKLR ALDDLYGLGT MGFRGEALPS
IAAVSRMSIT TRTRGQVAGT KLDIEGGRIT QISEVGAPVG THIEIRDLLF NVPARLKFLK
GNATEASHVT DSVAKLAMVH PQVHVRLRHG GRVALEAPQH SSGLERARAI LGSRLGRELH
EVSGAENGVR VTAYLAAPDL AQSTSRSTYL FVGKRAVKDR GLLHAVSMGY GELVPKGRFP
VAVLCLEVPG GEVDVNVHPQ KLEVRFSDGP AVFAAVRHVL RRGVAEAPWL SEQQSGSPVR
MRAKAHVSPP RESAGGGRAS RLAERQAAGA ARMLLPFGRD AAQPGASWQP PARDERGGGN
PPGAAAPPSA QAPSRSPDSP ADEPGAAVRD GGAATSTRAR EDAGDSGERG GGRGLSRTAF
PLAPREWPET PPGHGSSRAS EPAPWPYAGD APPSPSSPDE SASDDSLARQ PLDAGRAAAG
EGGAVPYASP APAAGAEAAP AAHVPRDPSR FFTELSYIGQ LDRTYLVCES NGEMVLVDQH
AAHERVAFQR LRDRWAQHAV PVQRLLLPKT FDLSPEQAAV AEDARATLHD MGFELEHFGG
TTYALKALPA GLRESDVETV LHELLDDLAE RGGSRALEER LDLALATIAC HSVVRAGDAL
SAQEVRALFK SLDEVDFKAH CPHGRPVLLR ISVDEIARRF GR