Gene Hoch_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3787 
Symbol 
ID8546180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5204962 
End bp5206539 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content73% 
IMG OID646388457 
ProductRadical SAM domain protein 
Protein accessionYP_003268180 
Protein GI262196971 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.243034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTC GCTCCTGGAA TATCCAGCAA GCCCTGCGCG ATCTCGTCGC TGACGAACAG 
GGCACGATCA TCAAAGACGC GCCCCATCGC GTCGCCCTGG TCTACCCCAG CCCGTATCGC
GCGGCCATGT CGTCGCTGGG CTATCAGACC ATCTATCGGC TGCTCAACGA GCTGCCCGAT
GTCGTGGCCG AGCGCGCGGT GCTGCCCGAC GACATCGCCG CCCACGACAA CTCGGGCACG
CCGCTGCTCA CGCTCGAGCA GCAGACGCCG GTGGGCGCGT TCCCGGTGCA GGCCTTCTCG
GTCGCCTACG AGCTCGAGAT CGCCGGCGTG GTCGACTGCC TGCGGCTCTC GGGCGTGCCG
GTGCTGCGCG GCGATCGCGG CCCGCAGCAC CCGCTGATCG TGGCCGGCGG GCCGCTCACC
TTCTCGAACC CCGCGCCGCT GGCGCCGTTC TTCGACATCG TGGTGATGGG CGAGGCCGAG
ACCACGCTGC CCGCGCTGCT GGAGGAGGCC CGCGGCATGA CCCGCGAGCG CGCCATCGAG
GCCTTCGCCG GTCGCCCCGG CTACTACGTG CCCGCGGCCG ACGGCGAGCG CGTCCCGCCC
GTGGCCCGCG CCGACGACGC GCTGCTGCCG GCGCGCTCGC AGATCATCAC GGCCAACACC
GAGCTGCGTT CGATGTTCCT CACCGAGGCG GTGCGCGGCT GCAGCCGCGG CTGCACCTAC
TGCGTGATGC GCCGCTCGAC CAACGGCGGC ATGCGCGCGC TGGCGCCCGA GAAGGTGCTC
GCCGGCATCC CCGAGCACGC GCGCCGCGTC GGCCTGGTCG GCGCCTCGGT CACCGACCAC
CCGCGCATCG TCGATATCGT CCGCGGCGTG GTCGAGGGCG GGCGCGAGGT CGGCATCTCG
TCGCTGCGCG CCGACCGCCT GGGCGACGAG CTGGTGAGCC TGTTGGCCCG CGGCGGCTAT
CGCACCCTGA CCGTGGCCGC CGATGGCGCC AGCGAGGCCA TGCGCCGCCG CGTCGAGCGC
CGCACCAGCG AGAAGCACCT GCTGCGCTGC GCCGAGCTGG CCCGCGACCA CGGCCTGCGC
ACGCTCAAGA TCTATCTGAT GGTGGGCGTG CCCGGCGAGA CCGACGACGA CATCGACGAG
CTCATCCGTT TCACAGCCGA GCTGGTCAAG GTGCACCCGC GCGTGGCCTT TGGCGTGGCG
CCCTTCGTGG CCAAGCGCAA CACCCCGCTC GACGGCAGCC CGTACGCCGG CATCCGCGCG
GTCGAGGCCC GGCTCAGTCG CCTGCGTCGG GGCCTGCGCG GCCGCGCCGA GCTGCGGCCG
ACCTCGGCGC GCTGGGCCTG GGTGGAATAC ATGATCGCCC AGGCCGGCAG CGCCGCCGGG
CTCGCGGTCA TGGACGCGCA CCGCGCCGGC GGTCGCTTTG CAGACTACAA GCGCGCGTTT
CGCGAGCGCG AGGTCACGCC CACCGGACCC GTGGCCCGGG TGCCCAGCAC CAGCGAGCGC
ATCGCGTTGC ACAAGCTCGG CCGGCGCGCG CCCGCGAGCA CGGCGGCGAG CACCGAAGCG
CAGAGCAGCG CGAGCTGA
 
Protein sequence
MASRSWNIQQ ALRDLVADEQ GTIIKDAPHR VALVYPSPYR AAMSSLGYQT IYRLLNELPD 
VVAERAVLPD DIAAHDNSGT PLLTLEQQTP VGAFPVQAFS VAYELEIAGV VDCLRLSGVP
VLRGDRGPQH PLIVAGGPLT FSNPAPLAPF FDIVVMGEAE TTLPALLEEA RGMTRERAIE
AFAGRPGYYV PAADGERVPP VARADDALLP ARSQIITANT ELRSMFLTEA VRGCSRGCTY
CVMRRSTNGG MRALAPEKVL AGIPEHARRV GLVGASVTDH PRIVDIVRGV VEGGREVGIS
SLRADRLGDE LVSLLARGGY RTLTVAADGA SEAMRRRVER RTSEKHLLRC AELARDHGLR
TLKIYLMVGV PGETDDDIDE LIRFTAELVK VHPRVAFGVA PFVAKRNTPL DGSPYAGIRA
VEARLSRLRR GLRGRAELRP TSARWAWVEY MIAQAGSAAG LAVMDAHRAG GRFADYKRAF
REREVTPTGP VARVPSTSER IALHKLGRRA PASTAASTEA QSSAS