Gene Hoch_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2791 
Symbol 
ID8545179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3830215 
End bp3832116 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content68% 
IMG OID646387483 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_003267211 
Protein GI262196002 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCA AGCGTCAGAT CCTGCACGAG ATCAGCAAGA ACGAGCTCTT GCCCTTCCTC 
GACCGCTGGC AGCTCGAGGT CGACGACCGC CGCGTCAAAG AGCAACTCGT CGAAGCCCTC
GCGCGCTCCA AGCGCGCCCG CCTCGAGGAG CTGCTCGGCG AGCTATCGCG CGACACGCTC
AAGGCCGTGT GCCGCGCGCT CGACCTCGAC GACACCGGCC GCGCCAAGCT CACCCTGATC
GACCGCCTGC TCGGCCGCGA GTCCTCGCCC GCGTCGACCG ACGCCGACGC CGACGCCCCG
TCCAAGTCCC CGGCCCGGTC CACATCCAAG CCCAAGCCCG CGCCGGTCAT CGACGACCAG
GCCGAGGCCG AGCTCGCGGC CACCGAGGCC GAGCTCGCGG CCACCGAGCA GCTCACCACC
GCGCAGCTCG AGCGCTACCT GTGGGCCGCG GCCGACATCC TGCGCGGCCA GATCGACTCA
TCCGACTACA AGAACTACAT ATTTGGCCTG CTGTTCCTCA AGCGCCTGTC CGACGTCTTC
GAGGAAGAGG CCGAAAAACT CACCGCCGAG GGGCTGCCCG CCGCCGTGGC CTGGAACGAC
CCCGACGAGC ATCAGTTCTT CGTGCCCGAG CGCGCGCGCT GGTCCGAGAT CGCCAAGGTC
GCCACCGGCA TCGGCGAGGC GCTCAACGTC GCCTGCGCGG CCCTCGAGGA GGCCAACAGC
GGGCTCGACG GCGTGCTCGA GGGCATCGAC TTCAACGACG AGCGCCGCCT GGGCAACACC
AAGAACCGCG ACGCCGTGCT CGCCCGCCTG GTGCAGCACT TTGGCCAGCT CAGCCTCAAG
AACGCCGACC TCAGCGAGCC CGACATGCTC GGCCGCGCGT ACGAATACCT CATCGAGAAA
TTCGCCGACG ACGCCGGCAA AAAGGGCGGC GAGTTCTACA CCCCGCGCAA GGTCGTGCAG
CTCATCGTCG AGCTGCTCGC GCCCACCGCC GGCATGCGCA TCAGCGACCC CACCTGCGGT
TCCGGCGGCA TGCTCATCGA GTGCGCCCAC TACGTCGAGC GCCAGGGCGG CAACCCGCGC
AACCTGACGC TGCACGGCCA GGAGAAGAAC CTGGGCACCT GGGCCATCTG CAAGATGAAC
ATGCTGCTGC ACGGCCTGCC CAGCGCGCGC ATCGAAAAAG GCGACACCAT CCGCGACCCG
CGCCTGCTCG ATAACGGCGC GCTCCTGGTC TACGATCGCG TCATCGCCAA TCCGCCCTTT
TCGCTCGACG AGTGGGGCGT CGAGGTCGCC GAGGGCGACG GCCACGGCCG CTTTCGCTTC
GGCCTGCCGC CCAAGACCAA GGGCGACCTG GCGTTTTTGC AGCACATGGT CGCCACACTC
AACGAGGGCG GCCGCCTCGG CGTGGTCATG CCCCACGGCG TGCTGTTCCG GGGCTCGTCC
GAGGGCCGCA TCCGCAGCAA GCTGCTCGCC GAGGACCTGT TCGAGGCCGT CATCGGGCTG
GCGCCCAACC TGTTCTATGG CACCGGCATC CCGGCCGCCG TGCTCGTGCT CAGCCGCGAC
AAGGCCCGGG CGCGCAAAGG CAAGGTGTTG TTCGTCGACG CCTCGTCCGA GTTCGAGGCC
GGCAGCGCGC AGAACTACCT GCGCGATGTC CACGTAACCA AGATCGCGCG CGCGTTTCAC
GAGTATCGCG ACGTCGAGCG CTTCGCGCGC GTGGTTCCGC TGGCCGAGAT CGAGCAGAAC
GAGGGCAACC TCAACATCAG CCGCTACGTG GACACCAGCC AGGAAGAGGA GCGCATCGAC
GTGGCCGCCG CCGTGGCCCG GCTGCGCGAG CTAGAAGCCG CCCGCGACGA GGCCGAGGCG
ACCATGCATC GGTTTCTGGA GGAGTTGGGT TATGGCGGAT GA
 
Protein sequence
MPTKRQILHE ISKNELLPFL DRWQLEVDDR RVKEQLVEAL ARSKRARLEE LLGELSRDTL 
KAVCRALDLD DTGRAKLTLI DRLLGRESSP ASTDADADAP SKSPARSTSK PKPAPVIDDQ
AEAELAATEA ELAATEQLTT AQLERYLWAA ADILRGQIDS SDYKNYIFGL LFLKRLSDVF
EEEAEKLTAE GLPAAVAWND PDEHQFFVPE RARWSEIAKV ATGIGEALNV ACAALEEANS
GLDGVLEGID FNDERRLGNT KNRDAVLARL VQHFGQLSLK NADLSEPDML GRAYEYLIEK
FADDAGKKGG EFYTPRKVVQ LIVELLAPTA GMRISDPTCG SGGMLIECAH YVERQGGNPR
NLTLHGQEKN LGTWAICKMN MLLHGLPSAR IEKGDTIRDP RLLDNGALLV YDRVIANPPF
SLDEWGVEVA EGDGHGRFRF GLPPKTKGDL AFLQHMVATL NEGGRLGVVM PHGVLFRGSS
EGRIRSKLLA EDLFEAVIGL APNLFYGTGI PAAVLVLSRD KARARKGKVL FVDASSEFEA
GSAQNYLRDV HVTKIARAFH EYRDVERFAR VVPLAEIEQN EGNLNISRYV DTSQEEERID
VAAAVARLRE LEAARDEAEA TMHRFLEELG YGG