Gene Hoch_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2090 
Symbol 
ID8544472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2889022 
End bp2891406 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content70% 
IMG OID646386793 
Productprotein of unknown function DUF1111 
Protein accessionYP_003266528 
Protein GI262195319 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGAAA TCAAACGCTA TCGCTCATAC TTATGGCGGC TCTCGCTGGC GCTCTTATGC 
GCGTCGCTCA TCACCGCTTG CGGCGACGAT GAATCGCCGC AAGACCCCGA CGCCGGGCCC
ACGCCCGACG CCGGACCGAC CCCGGAGATC GCCGAGGGCA TCTTCGCCCC CATGGGCGAG
GTCATCCCCT CGGCCAACGC GGAGGAGAAG GCCATCTTCG AACGCGGTCT GGCCGTGGCC
AAGCGCCGCT TCGACGACCA GACCGGCCTG GGCCCGCACT TCAACGTCAG CTTCTGCGGC
GCCTGCCACG AGAAGCCGGT GTTTGGCGGT GCCGCCGGTC GCTATCGCAA CTTCCTGCTC
ACCGGTCAGG TGCTCGAGGC CGACGGATCG TTCGTGCCCA CGGGTATCAG CGGCGTGCAA
CCGCAGTACT CGAGCGACGA GGGCGTGCGC TTCGAGACCG ATCCCAAGAC CAACCGCATC
GCCACCCGCA ACCCGATCCC GTTCTTCGGC GTCGGACTGC TGGCCGAAAT CCCCGAGAGC
GAGATCCTGT CGCGCGCCGA TCCCGACGAC GCCGACGGCG ACGGCATCAG CGGCCGCCCC
AACTACGAGG ACGGCTTCAT CGGCCGCTTC GGCATGAAGT CGCAGAGCGC GGCCATCGAA
GCGTTCATCC GCGGCCCGCT CAACAACCAC CTCGGCATCA CCTCGAACCC TCTGTCCGCC
GAGAGCCGCG CGCGTCTGCC GGTCGACAGC GCGGGTGACG ACGCGCTCAA CAACGCGGCC
GGTCTGGCGC CGCAGGCGGG CGCGCCCAAC GAGCCGCTGT TCGACGACGA CGACGTGCCC
GACCCCGAGC TGGGCGAGCA AGACCTCTTC GACCTGGTGG CCTTCTCCAT GCTCCTGGCC
GCGCCGCAAC CCGACGCGCC CACCGCCGAG AGCGAAGCCG GCCGCGCCCG CTTCGAAACC
CTGGGCTGCG CGAGCTGCCA CACGCCGGCG CTGCTCGGCC CGCGCGGGCT CATCCCCGCA
TACACCGACC TGCTCATCCA CGACATGGGC GAGGAGCTGG CCGACGGCAT CTTCATGAAG
TCGGCGACCG GCAGCGAGTT CCGCTCGGCG CCGCTGTGGG GCGTGGCCGC GACCGCGCCC
TACCTCCACG ACGGCCGCGC CGACACCCTC GAGGAGGCCA TCCTGTGGCA CGGCGGCGAA
GCCAAAGCCG CGCGCGACGC CTACGAGGCC CTGAGCGAGA CCGGCAAAGA CGAGGTCATC
GCCTTCCTGC TCTCGCTCGG CGGAGCCAGC CAGTACAGCG AAGGCCTGGT GCCTCCGGGC
GCGCCCATCC CCGAGGTCGG CAGCTACGGC GGCCCGGCCA GCGAGCTGTC GGCCGAGGAT
GAGGACCTGT TCCTCCTGGG GCGGCAGATC TTCGACCGCG ACTTCTTCGC CGTCGGCGGC
CTGGGGCCGT TCTTCAACGG CGACGCCTGC CGCGCCTGCC ACTTCGACCC GGTCATCGGC
GGCGCTGGTC CCAGCGACGT CAACGTCATG CGCCACGGCC TCATCGGCGA GGACGGCGCG
TTCCAGCCGC CCGAGGGCGG CGGCACCATC GCGCCGCGCC ACGGCATCTA CTACGATCAG
CGCCCGGGCG TGGACGAGAA CGCCAACGTC TTCGAGGCCC GGCAGACCCC GCCGCTCTTC
GGTCTGGGTC TGATCGAGAG CATCACCGAA GAGGAGATCC TCGCCAATGA GGACCCCGAG
GACGAAGACG GCGACGGCAT CAGCGGCCGC GCCCATTACC TGGTGAACGA CGGCCGCCTG
GGCCGCCTGG GCTGGAAAGC CAGCGTCCCC AGCGTGGCCG AGTTCGCGCG CGACGCCATG
TCCAACGAGA TGGGCGTGAC CGTGCCCTTG CGCGAGGGCT TCACCTTCGG CTTCCTCGAG
GACGACGACG GCATCGCCGA CCCCGAGATC GACGAGCAGA CCCTCGCGGC GCTCAGCTTC
TACATGAGCT CGCTGGCGCC GCCGCCGCGC ACCTCGGTCG ATCCCGACAC CGAGGCCGAG
GGCGAGATCC TGTTCGAGAC CACCGGCTGC GCCGACTGCC ACGTGCCCAC GCTCAAGGAC
GCGGGCGGCA ACGACGTGCC GCTGTACAGC GACCTCCTGC TGCACGACGT CGCGCCCGAG
GAGTTCTTCG GCATCGAGGA CGGCGACGCC AGCACGCGTG AGTTCCGCAC CCCGCCGCTG
TGGGGCCTGG GTACCTCGGC GCCGTACATG CACAACGGCG TCGCGTACAC CATCGAGAGC
GCCATCCTGC GTCACGACAG CGAGGCGCGC ACCGCCCGCG AGGCCTACGA GAACCTCCCG
CCGGCCGAGC AGAACGCGCT GCTCGTGTTC CTCGAGTCGC TCTGA
 
Protein sequence
MSEIKRYRSY LWRLSLALLC ASLITACGDD ESPQDPDAGP TPDAGPTPEI AEGIFAPMGE 
VIPSANAEEK AIFERGLAVA KRRFDDQTGL GPHFNVSFCG ACHEKPVFGG AAGRYRNFLL
TGQVLEADGS FVPTGISGVQ PQYSSDEGVR FETDPKTNRI ATRNPIPFFG VGLLAEIPES
EILSRADPDD ADGDGISGRP NYEDGFIGRF GMKSQSAAIE AFIRGPLNNH LGITSNPLSA
ESRARLPVDS AGDDALNNAA GLAPQAGAPN EPLFDDDDVP DPELGEQDLF DLVAFSMLLA
APQPDAPTAE SEAGRARFET LGCASCHTPA LLGPRGLIPA YTDLLIHDMG EELADGIFMK
SATGSEFRSA PLWGVAATAP YLHDGRADTL EEAILWHGGE AKAARDAYEA LSETGKDEVI
AFLLSLGGAS QYSEGLVPPG APIPEVGSYG GPASELSAED EDLFLLGRQI FDRDFFAVGG
LGPFFNGDAC RACHFDPVIG GAGPSDVNVM RHGLIGEDGA FQPPEGGGTI APRHGIYYDQ
RPGVDENANV FEARQTPPLF GLGLIESITE EEILANEDPE DEDGDGISGR AHYLVNDGRL
GRLGWKASVP SVAEFARDAM SNEMGVTVPL REGFTFGFLE DDDGIADPEI DEQTLAALSF
YMSSLAPPPR TSVDPDTEAE GEILFETTGC ADCHVPTLKD AGGNDVPLYS DLLLHDVAPE
EFFGIEDGDA STREFRTPPL WGLGTSAPYM HNGVAYTIES AILRHDSEAR TAREAYENLP
PAEQNALLVF LESL