Gene Hoch_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1950 
Symbol 
ID8544332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2687929 
End bp2688888 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content71% 
IMG OID646386654 
ProductS-adenosyl-methyltransferase MraW 
Protein accessionYP_003266389 
Protein GI262195180 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0275] Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis 
TIGRFAM ID[TIGR00006] S-adenosyl-methyltransferase MraW 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0408916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGT CAGACGCATT CGTCCATCTG CCCGTACTCA AGGAGGAAGT CCTCGCGCAT 
ATGTCGCCCC GCCCCGGCGG CGTGTACTGC GATGGCACCT TGGGCGGCGG CGGTCATGCG
GGCGCCGTGC TCGCGCGCGC CAATCCCGAC GGCCGGCTGT ACGGCATCGA TCGCGACGCC
ACCGCGCTGG CGGCCGCGCA GCGCGCGCTG GCCGACTTTG GCGAGCGCGT GCAGTTCCTG
CGCGGCACCT ACGGCTACGC CGATGAGCTT CTGGCCGAGG CCGGGGCGCC GCCGCTCGAC
GGCATCCTGC TCGATATCGG GCCGTCCTCG CCGCAATTCG ACCGCGCCGA GCGCGGCTTT
TCGTTCCTCA GGCCCGGTCC CATCGACATG CGCATGGACC AGAGCAGCGG CGAGACCGCG
CTCGATCTCA TGCGCCGACT CGGGCCGGGC GAGCTCGCCG ACATCCTGTG GTCTTTCGGC
GAGGAGCGCT TCAGCAAGCG CATCGCGGCG CGCATCAAAG ACGCCGTCCG CGACCACCGC
CTCGAGACCA CGACCGACCT GGCCGCCCTC GTCGAGGACG CCATCCCCGC TTCCGTGCGC
CGACAGATGA AGACCCACCC CGCGACCAAA ACCTTCCAGG CGCTGCGCAT CGCCGTCAAC
GGCGAGCTCG ACCAGCTCGC GCGTTTTTTG CGCGTGTTCC CGCCGCTGCT CGCGCCCGGC
GGGCGCTGCG TGATCATCAG CTTCCACTCG CTCGAGGACC GGCTGGTCAA GCGCGCGTTT
CGCGATCTCG CGTGGTCCTC GCGGCTGCCG CCGGATCTGG CCCGCGCCGC GGGCGAGCGC
ATCGAGCCGG TGTGCGTGCC GGTGACGCGC AAGGCCGTGT TCGCCAGCGA GGACGAGATC
GCCAGCAACC CGCGGGCGCG CTCGGCGCGG CTGCGCGCGT GCGAGAAGGT GGCGGCATGA
 
Protein sequence
MPTSDAFVHL PVLKEEVLAH MSPRPGGVYC DGTLGGGGHA GAVLARANPD GRLYGIDRDA 
TALAAAQRAL ADFGERVQFL RGTYGYADEL LAEAGAPPLD GILLDIGPSS PQFDRAERGF
SFLRPGPIDM RMDQSSGETA LDLMRRLGPG ELADILWSFG EERFSKRIAA RIKDAVRDHR
LETTTDLAAL VEDAIPASVR RQMKTHPATK TFQALRIAVN GELDQLARFL RVFPPLLAPG
GRCVIISFHS LEDRLVKRAF RDLAWSSRLP PDLARAAGER IEPVCVPVTR KAVFASEDEI
ASNPRARSAR LRACEKVAA