Gene Hoch_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3385 
Symbol 
ID8545773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4682595 
End bp4683833 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content75% 
IMG OID646388052 
Productputative RNA methylase 
Protein accessionYP_003267780 
Protein GI262196571 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.871032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.230972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCA GACCCGCGCG CGCTCTGGTC AAAGATCTGC GCGAGCGCAT CCGGCTCGGA 
CACCCGTGGA TCTACGAACG CGCGCTGGCG CGGGCGCGCG CCGACATCGG CCCAGGCGCG
CTGGTGAGCA TCACCCACGG CGGCGACGAC ATCGCCATCG GCTTCGCCGA TCCCGGCTCG
CCCATCGCCG TGCGCGTGCT CGCCAGCGCC CGCGGCGTGC GCGCCGATCG CGCCGAGCGC
GCGCTGTGGG CCGCCGAGGA CGCTTTCGGC GGCGCCTGGG CCGCCGAGCG CGCCGAGCAG
GCCGCGCGCG TGCGCACCGC CGACCCGCGC CTGGCCGGCC TCGACGGCAT GCGCCTGGTG
CACGGCGAGA ACGACTACAT GCCCGGCCTG GTCATCGACA GATACGCCGA CACCGGCGTG
GTGATGTTCG ACGGCCCCGG CGCCGCCGGC TTCTGGGAGC CGCGCATCGA CGGCGTGTGC
GAGGGCCTGC GCCGCGGCGG TGTCGAACTC GCCCGGCTGT GGGCACGGCC GCTGCCACGG
GTCGAGCGCG GCGGCGCGGG CCGTGTGCTG CGCGGCGATG CGCCGCCCGC GCGCATCCCC
ATCCACGAGG GCGAGGCGCG CTTCGAGGTC GACGTCCGCG CCGGCCAGAA GACCGGCTTC
TTCCTCGACC AGCGGCGCAA CCGCATGCTG GTCGGCGAGC TCGCCGCCGG CGCCGAGGTC
CTCAACCTCT ACGCCTACAC CGGCGGCTTC TCCGTGCACG CCGCGCTCGG CGGCGCCCAG
CGCGTCAGCT CCGTGGACAT CGCGCGCCCG GCCATCGCCA GCGCGCGCGA CAACTTCGCC
CTCAACGGCC TCGATCCCGA CGCCCACGAG TTCGCGGCCG AAGACGCCTT GGCCTTTCTC
GAGCGCGTCC AGCAGCGCGG GCGCCGCTTC GACCTGGTCA TCGTCGATCC GCCCAGCTTC
GCCCCCAGCG AGCGCGCCAA GCCCAAGGCC CTGCGCGCCT ACGGCAAGGT CAACGAGCTG
GCGCTGCGCG TCGTCGCCGC CGGCGGCACC CTGGTGAGCG CCTCGTGTTC GAGCCACGTC
GGCGGCGCCG ACATGAGCGA GATGCTGGCC CAGGCCGCGG CCCGCGCCGG CCGCGTAGTG
CGCATCGTCG AGCAGCGCGG CGCTGACCGC GACCACCCGG TGCGCCCGGG CTTCCCGGAG
GGCGAGTACC TCCAGGCGCT GTTCCTCAGC GTGGCCTGA
 
Protein sequence
MTGRPARALV KDLRERIRLG HPWIYERALA RARADIGPGA LVSITHGGDD IAIGFADPGS 
PIAVRVLASA RGVRADRAER ALWAAEDAFG GAWAAERAEQ AARVRTADPR LAGLDGMRLV
HGENDYMPGL VIDRYADTGV VMFDGPGAAG FWEPRIDGVC EGLRRGGVEL ARLWARPLPR
VERGGAGRVL RGDAPPARIP IHEGEARFEV DVRAGQKTGF FLDQRRNRML VGELAAGAEV
LNLYAYTGGF SVHAALGGAQ RVSSVDIARP AIASARDNFA LNGLDPDAHE FAAEDALAFL
ERVQQRGRRF DLVIVDPPSF APSERAKPKA LRAYGKVNEL ALRVVAAGGT LVSASCSSHV
GGADMSEMLA QAAARAGRVV RIVEQRGADR DHPVRPGFPE GEYLQALFLS VA