Gene Hoch_1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1639 
Symbol 
ID8544021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2234650 
End bp2235690 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content72% 
IMG OID646386347 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_003266082 
Protein GI262194873 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.459164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACG CCGCCGGTTT CTCCCCTGCC CTGCCCGTCA TCGACATGGC GCCGCTGCTC 
GCGCCGGCCG GGGCTTCGTC GCGCGCGCGG GCCGAGGTGG TCGGCCAGCT CGAGGCCGCG
TGTCGCGACA GCGGCTTCTT CTACGTGGTC GGACACGGCG TCGACCAAGA ATGCCTGGCT
CGCCTCGACG CCGCCAGCCG ACGCTTTTTC GCGCTGCCGC TGGCCGACAA GATGGCCATC
GACATGGCCC GCGGCGGACG CGCGTGGCGC GGCTATTTTC CCGTCGGCGG CGAGCTGACC
TCGGGCCAGC CCGACCGCAA GGAGGGTCTG TACCTGGGCA CCGAGCTGCC CGCCGAGCAT
CCGCGCGTGC GCGCTGGCTG GCCGCTGCAC GGCGCCAACC TGTGGCCGGC CCAGGTTCAC
GAGCTGCGCC CGGCCGCGCT CGCGTACATG GACGCGGTCG CGCGCGCGGC CCAGGCGCTG
CTCGTCGGCC TGGGTCTGAG CCTGGGCCTC GACGCCGAGT ATTTCGCGCG CTGCTACACG
GCCGAACCCA CAGTGCTATT TCGCATCTTC CATTACCCGG CGTGCGCGAA CCCGACCGAG
GCCGCGGCCT GGGGCGTGGG CGAACACACC GACTACGGGC TGCTCACGCT GCTCGCGCAG
GACGCCTGTG GCGGCTTGCA GGTCAAGACG CCGCGCGGCT GGTGCGACGC GCCGCCGATC
GCGGGCGCGC TGGCGTGCAA CATCGGCGAT ATGCTCGACC GCCTGTCCGG CGGCTGGTTT
CGCTCTACCC CGCACCGCGT GCGCAACCTC AGCGGACGCT CGCGGCTGTC GTTCCCGCTG
TTCTTCGACC CCGACTTCGC CGCGCCCATG CAGCCGCTGC CGCAGCGCGG CCTCGACGCC
GCCGGCATCG AACGCGACCG CGAGCAGCGC TGGGACGGCG CCAGCGTGCA CCACTTCGAG
GGCAGCTACG GCGATTACCT GCTGTCCAAG GTCGCCCGCG TATTCCCGCA GCTCGGCCAG
CGCGTGCTCA GCGAGCAATG A
 
Protein sequence
MNHAAGFSPA LPVIDMAPLL APAGASSRAR AEVVGQLEAA CRDSGFFYVV GHGVDQECLA 
RLDAASRRFF ALPLADKMAI DMARGGRAWR GYFPVGGELT SGQPDRKEGL YLGTELPAEH
PRVRAGWPLH GANLWPAQVH ELRPAALAYM DAVARAAQAL LVGLGLSLGL DAEYFARCYT
AEPTVLFRIF HYPACANPTE AAAWGVGEHT DYGLLTLLAQ DACGGLQVKT PRGWCDAPPI
AGALACNIGD MLDRLSGGWF RSTPHRVRNL SGRSRLSFPL FFDPDFAAPM QPLPQRGLDA
AGIERDREQR WDGASVHHFE GSYGDYLLSK VARVFPQLGQ RVLSEQ