Gene Hoch_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1997 
Symbol 
ID8544379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2755305 
End bp2757518 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content69% 
IMG OID646386701 
Productprotein of unknown function DUF1111 
Protein accessionYP_003266436 
Protein GI262195227 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.248873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGCC CGAGCCTCAC GCTGCTCTCA TCGAGCGATG GCGACGTGTA TAGCGAGCCC 
ACGGTCGAGC TTCGTTTCCG CATCGATAAT GCGCCCGCGC TCTCGTACCG TATCGTCGTC
GACGACGTAC AGGTCAGCGC GGTCTCGGGT GCATTCGCGG TTGGCGAGGA ACTCACCGCG
CTCCTCACCT TGGAGGAGGG CATCCGCTCG GTCGAGATCC TGCTTTTCGA CGGCGACGGG
GTGGCCGATA GCGAGCTTCT GCTGCTCGCG ATCGAGCTCC CCGCAGGGCC GTCGATCGTC
CTCGACGCGG CGCTGCCAGC GACGAGTTTC GAAGAGAGCC TGGTCGTTAG CGGTCGCGTC
GATGGGGGGC GTCCGCTCGT GTCGCTCACC CTCGTGAGCG GCGACGAGCG CTCGGACATC
GAGGTCACCG AGGCCGACGG CGGCGCTCTG TTCTCCGCGC TCGTACCGCT CACGCTGGGA
GACAACCCCT TCACACTGCA GCCCGTCGAC GATTTCGGAC GAACCGACGA GAAAACCTGG
AGCGTGCTCC GTTCTGTCGA TACGGAACCG CCACACGTCG ACGGGCGCGA CCACGTGCTC
GCCGTAGGCG ACGACGGTTC CCTGTGGGCC TGGGGGCTCA ACGGTAGCAG CCAGGTGGGC
CCCGAGGGGA TCGGAGGCTT CGTGGACGAC GTGCTCAGCC CGGTGGTGCT CGCCGGGGTA
GACGACGCGT TCGCCGTCTA CGCCAACGGC AACCAGGGGT TCTATGAGGA CGGCGCCGGT
CAGCTCTGGG GATGGGGACA GAACGGCGCT ACCGGCAACC TCGGCATCCC GGCCGAAGGC
GACGTCGCCG TGCCCTCGGC GCCGGTCTTC GGTATCCGCG GCGTTGTCGA TGTCGCCATC
GGCGCACTGC ATGGCGTCGT CGTCGACGGC GCTGCCGAAG ACGGCACACC GCTGCGTGAC
AGCGACGCCG GACGCTCGTC CGAAGGCGAC ACAGTAGATG CCTCCTCGCT GGCAGATGCC
GCGAGCGACG AAAATTCGGA CGCGAGTCCG ACGTCGATTC CCGATCCGCT GCGCATTGGC
GGTGCGCTCA CGGCCACAGA GTACGGGAGC CGCCCCTTTT TGACCACCGC GCCCGCTCTC
GACCTCGGCG GCGCGCCAGC CGTCTCCTTT GGCCGCGAAC TCTTCGTCGC CGACTGGGAC
GCCGCCCCCG GCTCGCGCGA ACTCATCGAC GGCCTTGGGC CCCTCTACCA TGCTCTCGCC
TGCCTCGGCT GCCACCCCGA GAGCGGGCGA GCTGCGTCAC TCGAAGCGGG GGGCTCCGTG
GCCCCCGGTT TGCTCCTGCG CCTCGTGCGG ACCGAGGGCG AAGGCCTCGT GCACGACCCG
TCACTAGGCG GTCAGCTACA GACGCTCGCG ATCGCCGGGG TGCCTGCCGA AGGGACGGCG
CAGTGGGAGC CCGCAGCCAT CGCCGAGTTG GCGCCACACT ACCATGAGGT CGCAGCTCGC
GCGCCTGCGC CTCGCTTCAC GGTCGCGATC GACCCCGCCT ACCCCGCGCT CGCGGACAAC
ACGAATTCCG GCCCGCGGCT CGCCCCTCAG CTCGTCGGCG TCGGTCTGCT CGAGCAGGTC
CCAGAGGACA CGCTGCTTGC CTGGGAAGAT ATCGAGGACG CGGATGGAGA CGGCATCTCG
GGGCGGGCGA GTTGGGTCGC GACGCCCTCC GGCCCACGGA TCGGACGCTT TGGCTGGAAC
GGTGACGGAC TCCAGGCCGT CGCCACCTTC CTCTCGTTGC TCGCGGTACC GGCTGCGCGT
CGCGAGAGGC GGGATCCGCA GGTAGAGGAA GGCGCGTCGC TGTTCCGTGC CGCGCGTTGC
GATGCCTGCC ACCGCGAGAC GCTCACCACC GGAGCGGTGG CGACGCAAGC GCTGCTCTCG
GAACAGACCT TTCACCCATA TACCGATCTG CTGCTACACG ACATGGGCGC CGCACTCGCC
GATCCTGTCG GCGAAGGCGA CACCGCCGCG CGCGAGTGGC GTACACCGCC GCTCTGGGGC
CTCGGCCTGA TCGAGGAGGC GGCCAACGCG CGCTTCCTCC ACGACGGGCG TGCGCTCTCG
CTCGAGGACG CCATCCTTTG GCATGGCGGC GAAGCCGAGG CCGCGCGCGC GGCCTTCGCC
GCGATGGGCC GCAGCGACCG CGACGCGCTG CTGGCGTTCG TGCGCTCACT GTGA
 
Protein sequence
MRRPSLTLLS SSDGDVYSEP TVELRFRIDN APALSYRIVV DDVQVSAVSG AFAVGEELTA 
LLTLEEGIRS VEILLFDGDG VADSELLLLA IELPAGPSIV LDAALPATSF EESLVVSGRV
DGGRPLVSLT LVSGDERSDI EVTEADGGAL FSALVPLTLG DNPFTLQPVD DFGRTDEKTW
SVLRSVDTEP PHVDGRDHVL AVGDDGSLWA WGLNGSSQVG PEGIGGFVDD VLSPVVLAGV
DDAFAVYANG NQGFYEDGAG QLWGWGQNGA TGNLGIPAEG DVAVPSAPVF GIRGVVDVAI
GALHGVVVDG AAEDGTPLRD SDAGRSSEGD TVDASSLADA ASDENSDASP TSIPDPLRIG
GALTATEYGS RPFLTTAPAL DLGGAPAVSF GRELFVADWD AAPGSRELID GLGPLYHALA
CLGCHPESGR AASLEAGGSV APGLLLRLVR TEGEGLVHDP SLGGQLQTLA IAGVPAEGTA
QWEPAAIAEL APHYHEVAAR APAPRFTVAI DPAYPALADN TNSGPRLAPQ LVGVGLLEQV
PEDTLLAWED IEDADGDGIS GRASWVATPS GPRIGRFGWN GDGLQAVATF LSLLAVPAAR
RERRDPQVEE GASLFRAARC DACHRETLTT GAVATQALLS EQTFHPYTDL LLHDMGAALA
DPVGEGDTAA REWRTPPLWG LGLIEEAANA RFLHDGRALS LEDAILWHGG EAEAARAAFA
AMGRSDRDAL LAFVRSL