Gene Hoch_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3784 
Symbol 
ID8546177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5197778 
End bp5200693 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content72% 
IMG OID646388454 
ProductProtein of unknown function DUF2344 
Protein accessionYP_003268177 
Protein GI262196968 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.539179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.106507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACA TCTACGCAGA CTTCATCGAC CGCGTCGCCA AGCCGGCACG CTACCTGGGC 
GGCGAGTACC TGTCCGTCGT CAAGCCGGCC GAGGAAGTCG ACGTCCGCGT CGCCCTGGCG
TTCCCCGATG TCTACGACAT CGGCATGTCG CACCTCGGCA CCAAGATCCT CTATTCGCTG
CTCAACAAGC AGCCGCGGAT CGCGGCCGAG CGGGTGTTCG CCCCGTGGGT CGATATGGAG
GCCGAGCTGC GCGAGCGCGG GCTGCCCCTG GTGTCGCTGG AGTCGTCCAC GCCGCTCTCG
GACTTCGACG TCATCGGCTT CTCGCTGCAA TACGAGCTGA CCTTCACCAA CGTCCTCACC
ATCCTCGACC TCGGCGGCGT CCCGCTGCGC GCGGCCGAGC GCACGGACGA TGACCCCCTG
GTGCTGTGCG GCGGTCCCGT GGCCTCGCAC CCGGAGCCGG TCGCGCCCTT CTTCGACGCC
TGCTACATCG GCGAGGCCGA AGAGGAGCTG TCGGGGCTGC TGCTCGAGTG GGCCGAGATG
CGCCGCGCGG GCCGCGCCCG GCTCGACGCG CTGGCCGAGC TGGCCGGCCG CTATCCGATC
TATGTGCCCG CGCTCTACGA CACCGAGCGC GACCCCGAGA CCGACATGAT CGTGGTCGGC
GCGCCCCGCG ATGCCCGCGC GCCCGCCCGC GTGCGCCGCG GCGTGGTCCG CGATATCGAC
GCCTATCCCT TCCCCTCGGA TACGCCGGTG CCCTACGCCG AGGCGGTGTT CGACCGCGCC
GCGGTCGAGA TCGCGCGCGG CTGCACCGAG GGCTGCCGCT TCTGCCAGGC CGGCATGATC
TACCGCCCGG TGCGCGAGCG CTCGCCCGAG TCGATCGCCA AGAGCGTGAT CGACAGCGTC
GAGAAGGCCG GCTACGACGA GACCGCGCTC ACCTGCCTGA GCACGGCCGA CTTCTCGAGC
ATCACGCCGC TGGTCAAGAA CGTGATGAGC GAGCTGCGCA AGCGCAAGGT CACGCTGTCG
GTGTCGTCGC TGCGCGCCTA CGGCCTGGGC GAGGACATCC TCGACGAGAT GGCCTCGATG
CGCATCACGG GGCTCACCTT CGCGCCCGAG GCCGGCACCC AGCGCATGCG CGACGTGGTC
AACAAGAACG TGACCGAGGC GCATATCGAG GAGTCGACGA CGCGCGTGTT CGCGCGCGGC
TGGCACCGGC TCAAGCTGTA CTTCATGATC GGCCTGCCGA CGGAGGAGGA CGATGACGTG
GTCGGCATCG TCAACACCGG CCAGCGCATG CTGCACATCG GCCGCCGCGA GGCCGGCAAA
CGCGCCGAGG TCACGGTCAG CGTGTCCTCG CACGTGCCCA AGCCGCACAC GCCCTTTCAG
TGGTGCGCCC AGGACTCGCT GCCCGAGATC AAGCGCAAGC AGCAGCTCCT ACGCGGCGCG
CTGCGCGACC GCAACCTGCG CCTCAAATAC CACGACGCCG GCATCAGCTT CGTCGAGGGC
GTGATGTCGC GCGGCGACCG GCGCGTGGCC GACGCCATCG AGATGGCCTG GCGCCGCGGC
GCCCGCTTCG ATGGCTGGGA TGAGCTCTTC GACCTGGGCA TGTGGCAAGA GGTTTTCGGC
GCGTGCGAGA TCGACGCCGA CGTGTACCTG TCCACGCGTC CGATCACGGC CCGGCTGCCC
TGGGACCACA TCGATGTCGG TCTCGAGGAC GGCTTCCTGC TCGGCGAGTA CCGCAAGGCG
CTCAAGAGCC GGCTGTCGCC GCCCTGCGGC AAGGTCGCCG GCCAGCTCGT GCACCACAAT
AACCTCGACG ACGCGCGCGC CGACCAGCGC CGCCTGGTGT GCTACGACTG CGGCGTCGCC
TGCGATCTGT CGAAGATGCG CAGCGACCGG CTGGTGGCCC TGGGCGCGCT CGGCGCCGAA
CACGCGCCGC GGCGGCCCGA GCCCCGCGCC GAAGCGGCCG AGGCCAGCGA CACGGCGGCG
GCGACGGGCG ACGCCCAGCC GGCGGCCGAC GGCGACAAGG GCGCGAGCGC CGAGACGGCG
GCGAGCGAGC GGCCGCGCAA GGGCAAGAAG AGCCGCCGCG GGCCCAAGGT GTCGTTCCCC
GACCTGCCCA AGGTGGGCTA CCGGCTGCGC TACGCCAAGC TGGGCCGCGC GGCCTATCTC
GGGCACCTGG ACACCGGCCG CATGCTGGCG CGCCTGTTCC GCCGCGCGGA CCTGACTCTG
GCCTACAGCC GCGGCTATCA CCCCAAGCCG ATCATCCAGT TCAGCCCGGC GCTGCCGCTG
GGCGTGGCCA GCATGGGCGA ATTGCTCGAC GTGAGCGTCG AGGCGCCCTC GGCGGTGCCG
GCCGAGGCGC TGCTGCGGCG GCTGCGCGAG GTCTCGCCCG AGGGCATCCT GTTCGGCGAT
GCCTGGGCGC TGCCGCCGGG CAGCCCGGGC CTGGGCAAGC TGATCGAGGC CTACGATCTG
CTGCTGGCGC CGGCGCCCGG TCTGCCCGCG GACGAGGCCG CGCTGATGCG CGTGGCCGAC
GAGTTCCTGG GCCGCGCGTC GGTGCTGGTG CCGCGCAAGG AGCGCGAGAT CGATGTGCGC
GCCTTCGTCT CGCGCATCGA CGTGCTGGCC GAGCGCGCGG CCGAGCGGCT GGCGGGCGCG
CTGGGCTGGC CGCTGGCCGA GACCGCGAGC GCGCCCGCGC TGCTGCAGGT GCGGGTGCAC
ATGACGCCGC AGGGCTCGGC CAAACCCACC GAGATCGCCG AGGCGCTGGG GCTGTGGGGC
GACCCCGACC CGCGCGCGCC GCACGCGCTG CTGGCGCGCC TGGGCTTCCC GGGCGTCGAG
CCCACGGCCG AGGACCACGC CCACGCCCGC GGCGAGGGCA TCCATCTGGC CGCCGCGCAC
TCCGAGGAGG TCTCGGCCGC CTCGGCGCCG TCCTGA
 
Protein sequence
MRHIYADFID RVAKPARYLG GEYLSVVKPA EEVDVRVALA FPDVYDIGMS HLGTKILYSL 
LNKQPRIAAE RVFAPWVDME AELRERGLPL VSLESSTPLS DFDVIGFSLQ YELTFTNVLT
ILDLGGVPLR AAERTDDDPL VLCGGPVASH PEPVAPFFDA CYIGEAEEEL SGLLLEWAEM
RRAGRARLDA LAELAGRYPI YVPALYDTER DPETDMIVVG APRDARAPAR VRRGVVRDID
AYPFPSDTPV PYAEAVFDRA AVEIARGCTE GCRFCQAGMI YRPVRERSPE SIAKSVIDSV
EKAGYDETAL TCLSTADFSS ITPLVKNVMS ELRKRKVTLS VSSLRAYGLG EDILDEMASM
RITGLTFAPE AGTQRMRDVV NKNVTEAHIE ESTTRVFARG WHRLKLYFMI GLPTEEDDDV
VGIVNTGQRM LHIGRREAGK RAEVTVSVSS HVPKPHTPFQ WCAQDSLPEI KRKQQLLRGA
LRDRNLRLKY HDAGISFVEG VMSRGDRRVA DAIEMAWRRG ARFDGWDELF DLGMWQEVFG
ACEIDADVYL STRPITARLP WDHIDVGLED GFLLGEYRKA LKSRLSPPCG KVAGQLVHHN
NLDDARADQR RLVCYDCGVA CDLSKMRSDR LVALGALGAE HAPRRPEPRA EAAEASDTAA
ATGDAQPAAD GDKGASAETA ASERPRKGKK SRRGPKVSFP DLPKVGYRLR YAKLGRAAYL
GHLDTGRMLA RLFRRADLTL AYSRGYHPKP IIQFSPALPL GVASMGELLD VSVEAPSAVP
AEALLRRLRE VSPEGILFGD AWALPPGSPG LGKLIEAYDL LLAPAPGLPA DEAALMRVAD
EFLGRASVLV PRKEREIDVR AFVSRIDVLA ERAAERLAGA LGWPLAETAS APALLQVRVH
MTPQGSAKPT EIAEALGLWG DPDPRAPHAL LARLGFPGVE PTAEDHAHAR GEGIHLAAAH
SEEVSAASAP S