Gene Hoch_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4901 
Symbol 
ID8547308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6767336 
End bp6773299 
Gene Length5964 bp 
Protein Length1987 aa 
Translation table11 
GC content71% 
IMG OID646389574 
Producthypothetical protein 
Protein accessionYP_003269283 
Protein GI262198074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID[TIGR03382] Myxococcales GC_trans_RRR domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.958107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGC AGAGTATTGC GATTGCCTGT CTGGGCGCGC TCGGCCTGCT CGTGCACGCC 
GACTCCGCCC ACGCGGAGGG AAGCGCCGAG CTGGGCACGA CCCAGGGTGT CGATAGCACC
ACGGTGTTCC GCATCGACAT CGTCGATGCC GAGACCGAGA GCATCTTCTG GGAGGGTCCC
GTCCCGCTGA CGCTGCAGCT TCCCAGTGGG GGGTTCCACC CGACCTTCCC GGTGCTCCCC
AGCGGTCAGA CGGCGCCGGC GATCGCCGGC GAGGTCGGCG TCTACCAGCT CACGCTGGTC
GGCGCCGACA TCCCAGCCGG TACCGCCTGG AACCTGTCGG TGCGCGATGA TCAGGGCGAG
GAGATCCCAG GCCGCGTGTT CTCGCGCAAC TGGGTGATGC AGCCGGACGA CGCCAGCGCG
GACGGCGCGC TGTCGGCCAG CTTCTTCGCC AAGACGCCGA CGGCGGCCGG CCAGACCGGC
GTCATCGAGC TGCGCGTGGA CGGCGCCAAT AGCAACGACG GCGGGCCCAC GGCGTTCGGC
TTCCAGGCCA ACGCCAGCGG CATCCCGGGC GCCAACGCCG GCCGCAGCGT GCCCGACAAC
GACACCGGCA ATCCCATCGG CGTCGGCAAT ATCGCGCTGT ATCTCAACCC GCCCGCGACC
GCGACCTACT CGGTGGGCGC GCCCCAGGTC TCGAGCCTGG TGTTCCGCGG CGGCAGCCTG
GCGTGCAACA GCGCCGAGCC CGGCGCCGGC GGCACCTTCG CGTTCAACAG CAGCGTGGCC
GGCACCTTCC ACCTTATGTG CGACTTCGAC GCCGACGGCG TCGACATCTC GGACGATGAC
GACCTGCTCG TGGTCGGCAC CGCGACCCCG GGTACCAACA CGGTGGCCTG GAACGGCGAG
GACCGCAACG GCAGCACGGT CGCGGCCGGT AGCTATCAAT GTCAGGTGCG CCTCACCAGC
GGCGAGCTGC ACCTGGTGGC CGAGGACGTG GAGACCATGT TCCCGGGCCT GCGCATGTTC
GCCATCGACA GCGTCGGCAC CTCGACGCCG CTGGCCATGT ACTGGAACGA CTCGCAGATC
CAGGCCAGCG CCATCCCCAT GCCCAACGGC GAGCTCAGCC TCGAGACCTC GGGCGCGGCG
GGCGTGAGCT CGGGCGATCC CACGGGCGCG CCGGAGGCCA ACACCAACGC CCGCGCCTGG
GGCGACTTCG GCGAGGACGG CAAGGGCGAC GGCAGCTTGC TCGACACCTA CACCTGGCTG
GCCGATGACC TCAGCGCCTC GACCGAGGCC GTGGTCATCG ACAGCGACGC CGATAGCGAC
GGCGATGGGC TCACCGATCT CGAGGAGCTG TGCGATCTGG GCACCGACCC GGACGACACC
GACTCGGACG ATGACGGCCG CCTCGACGGT GACGAGGGCA CGGCCGACAC CGACGGCGAC
GGCCTGGCCG ACCCGCTCGA CCCGGACAGC GACAACGACG GCATCCTCGA CGGCACCGAG
GTCGGCATCA CCACGCCGAG CGCGGACACC GACGTGAGCC GCGGTAACTT CGTGCCCGAC
GCCGATCCCA ACTCCACGAC CGATCCGACC TCGGCGCACA GCGACACCCA GGGTTTCTCG
GATCCCACGA TCGGCGACCC CGACGGCTCC GAGGACAGCA ACCACAACGG CCGCGTCGAC
GCGGGCGAGA GCGATCCCAC CGAGTCCGCC GACGACTCGG CCATCGGCGG CCAACCGCTG
GTGGACAGCG ACGGCGACGG CCTGGTCGAC GCCGAGGAGA CCTTCTTCGG CTCCGATCCC
AACGACGCCG ACACCGACGA CGACGGCCTG CTCGACGGCG ATGAGCCCAA CTGGACCACG
GACACCGACG GCGACGGCGT GATCAACGTC TTCGACCCGG ACAGCGACAA CGACGGTCTG
TTCGACGGTA CCGAGGCCGG CGTGGCCACG GCCGACGCCG ACACCGACCT GAGCCGCGGC
GTGTTCCGCG CCGACGCCGC GCCCGGCACC ACGACCTTCG TGGTGGTCGC GGACTCGGAC
CGCGGCGGCG TGCGCGACGG CGCCGAGGAC CCCAACTACA ACGGCCGCAT CGACGGCGGC
GAGCTCGACC CCAACGACGG CGCCGACGAC GCGCTCAACG TGCCGCTCGA CACCGATGAC
GACGGGCTCA CCGACGACGA GGAGTCGCTC ATCGGCAGCG ATCCCAACGA CGCCGACACC
GACGACGACG GCCTGCGCGA CGGCGACGAG GTCCACTGGA TGCTCGACAG CGACGGCGAC
AGCCTGATCA ACGTGCTCGA CGCCGACAGC GACAACGACG GCCTGTTCGA CGGCACCGAG
GCCGGGGTGG CTGCGGCCGA CGCCGACACG GACGAGAGCC GCGGCACCTT CATCGCCGAC
GCCGACCCGA GCACCACGAC CAGCATGATC GACGCCGACA CCGACGACGG CGGCGTGCGC
GACGGCGCCG AGGACGCCAA CCACAACGGC GCCATCGACG GCGGCGAGAC CGACCCCAAC
GAGGGCGCCG ACGACGTGGC CCCCGAGGAC AGCGACGGCG ACGGCCTCAC CGATGTGGAG
GAGGCGACCT TCGGCTCGGA TCCCAACGAC GCCGACAGCG ACGACGACGG CCTGCGCGAC
GGCGACGAGC CCAACTGGAA CATCGACCAG GACGGCGACG GCTTCATCGG CGCGCTCGAC
CCGGACAGCG ACAACGACGG CATCTTCGAC GGCACCGAGG CCGGCGTGGT GACCGCGGAT
CCCGACACCG ACCTGGGCGC TGGCGCGTTC GTGGCCGACG CCGACCCCAG CTCCACGACC
AGCCCGATCG CGGCTGACAG CGACGGCGGC GGCGTGGATG ACGGCGCCGA GGATCCCAAC
CACAACGGCG CCATCGACAG CGGCGAGCTC GACCCCGAGG ACGCGGGCGA CGACGGCACG
CCGCCGGCCG ACAGCGACGG CGACGGCCTC AGCGACGATG AGGAGGCGGC GTTTGGCACC
GATCCCGACG ACGCCGACAC CGACGACGAC GGCGTGCGCG ACGGCGACGA GTACAACTGG
GCCCACGACT TCGACGGCGA CGGGCTGATC AACGCCCGCG ACGGCGACTC GGACGACGAT
GGTCTGTTCG ACGGCACCGA GCGCGGTGTG GTCACGCCGG ACCCGGACAC CAACACCAGC
GAGGGCGGCA GCTTCATCGC CGACGCCGAC CCGAGCACCA CCACCAACTC GCTCGACCGC
GACACCGACG ACGGCGGCGT GGAGGACGGT CTCGAGGACC TCAACCACAA CGGCGCCCTG
GACGCGCGCG AGATCGATCC CAACCTCGCC GATGACGACG GCCTGCTCGA CCGCGATCAG
GACACCATCA TCGACACCGA CGAGGGCGTG GCCGACGACG ACGACGACGG CATCCCCAAC
TACGAGGATC TCGACTCCGA CGGCGACGGC ATCCTCGACG AGGACGAGGC CGGCGATCTC
GACCGCGAGA CCGCGCCGGT GGACACCGAC GAAGACGGCA CGCCCGACTT CCTCGACCTC
GACACCGACA ACGACACCCT GCCCGATGCC GACGAGGCCG GCGACGACCA GCTCGACACC
CCGCCGGTGG ACACCGACGG CGACGGCACG GCCGATTTCC GCGACCTCGA CAGCGACGGC
GACGGCACGC CGGACGCGGA GGACCCGTGC CCGACCGATC CCGACGACGC CTGCATGATG
GTGGGCGAGG ACGATCGCGA CGGCGACGGC ATCCCGGACA ATACGGACAA CTGTCCGGAT
ACGCCCAACC AGGGTCAGCT CGACCAGGAC GGCGACGGCC TGGGCGACCT GTGCGACAGC
GACGCCGACG GCGACGGCTT CGACGATGAC ATCAGCATCG GCGGCGGCGG TTGCTCGTCG
GCGGGCGATG GCTCGCTGGG CGCGCTGCTG CTGGTGCTCC TGGCCCTGGG CCTGGTGACC
CGGCGGCGCC GCCGGCAGGC GATCGCGCTG GCCGCGACCC TGGCCGTGGT GCTGGTCCTG
GGCAGCGCGT CTTCGGCGCG CGCCCAGGTC CAGGACGAGG GCGCCTTTCC GGCCGAGCGC
TTTCGCCTGG CCGCCGATGA GGAGGGCGTG CTGCACACCG AGTGGGGCGC GGTGCCCGGG
CACATGGCCT GGGACCTGGC GCTGCTGTTC GGCTATCAGG ACGACCCGCT GACCATCTAT
CGCGAGCGCG ATGGCGATCG CGAGCGCGAC GGCGCGCTGG TGTCGAGCCG CGTGAGCGGC
AGCCTGGTCG CCAGCCTGGC GCTGTGGAAT CGCCTGGCGC TGGCCATCGA GCTGCCTCTG
ATCCTCACCC AGGACGACGA TCCGGTGAGC GGCGTGCCCG TGGGCGATCT CGAGCGCACC
GGCATCGGCG ACATCCGGGT GGCGCCCAAG GTGCAACTGC TCTCGGCCGC GAACAGCGGC
GTGGACCTGG CCATCATCCC GACCTTCACC CTGCCCACGG GCTCGGCCGA GGACTACCGC
GGCGAGCAGG GCGTGTCCTT TGCGCCCGAG GTGGCCATCT CGCGCGCGAT GGGCGCCTGG
CGTCTGGCCT CGAACATCGG CTACCGGGCG CGGCAAAACG CCCACCTGGC CGACCTCGAC
GTCAACGACG AGCTGTTCTT GCGCCTGGGC GCGGGCTATC GCCTGGGCGA GACCGGCGGT
CCGCCGCTCG AGCTCGACCT GGGCTTGTCG GCGGCCACGG CGGCCGCGTC GCCGCTGGGC
GATTACAACC AGAACCACCT CGAGCTGCTC ACCGGCGCGC GCTACCACCT GCCCGGCCCG
TTCTCGATCG GCCTGGGCTA CGGCGTGGGT GTGACGAACG GCTTCGGCAC CCCGGACTGG
CGCCTGTTCC TCACGGTGCG CGCGGCCGGG CGCTCCGATC CCGACAGCGA CGGCGACGGC
ATCCTCGATG ATGTGGACGC GTGTCCGAAC GAGCCCGAGG ACAAGGACGG CTTCGAGGAC
CGCGACGGCT GCCCCGAGAC CGACAACGAC GGCGACGGCA TCCCCGACAC CGAGGATGGC
GCACCCAACG ACCCCGAGGA CAAGGACGGC TATCAGGACG AGGACGGCGT GCCCGATCCC
GACAACGACG ACGACGGCAT CCCCGACACC GAGGAGGCCT GCCCCGACGA GCCCGAGAAC
AAGAACGGCT ACCAGGACGA GGACGGCTGC CCCGACGAGC TGCCCGATAC CGATGGCGAC
GGTCACGTGG ACCGCGTCGA CGAGTGCCCG GAGCAGCCCG AGGACATGGA CGGCTTCGAG
GACGAGGACG GCTGCCCGGA CGAGGACAAC GACGAGGACG GCGTGGTCGA CAGCGCGGAC
AAGTGTCCCA ACGAGGCCGG CCCGGTCGAG AACCGCGGCT GCCCGGACAC CGATCGCGAC
GGCGACGGCG TGGTCGATCG TCTGGACAAC TGTCCCGACG AGGCCGGCAG CGAGCGCAAC
CAGGGCTGCA AGCGCCGCCA GCGCGTGCGC CTGTCCGGCG ACCGCCTCGA GATCCTCGAC
CGCGTGTACT TCCGCAGCAA TCGCGCGGTG CTGCAGCGGC GCTCGAACCC GCTGCTGCAG
AACGTGGCCC AGGTGCTCAT CGCGCACCCG GAGATCGAGC ACGTGCGGGT CGAGGGTCAC
ACCGACAACC GCGGTGATCC CACCTACAAC ATGAACCTGT CGCAGAGCCG CGCCGAGGCC
GTGGTCGCGT TCCTGGTCCA GGAAGGCGTC GAGGCCAAGC GCCTGGCCGC GGTCGGCTTT
GGCGAGACCC AGCCGCTCGA GGACAACAAG ACCCGGCGCG GTCGCGCGGC CAACCGCCGC
GTCGAGTTCA ACATCCTGTG GGACAAGCCG GCCGATCCGC CCGCCGAGGT GATCGAGGAG
CCGAGCGACG ACGCCGCCGG TGAAGGCGCT GGCGACGACG CTGGCGACGC TGCCGGCGAG
GGCGAGGCCA GCCAGGGCTC GTAG
 
Protein sequence
MRKQSIAIAC LGALGLLVHA DSAHAEGSAE LGTTQGVDST TVFRIDIVDA ETESIFWEGP 
VPLTLQLPSG GFHPTFPVLP SGQTAPAIAG EVGVYQLTLV GADIPAGTAW NLSVRDDQGE
EIPGRVFSRN WVMQPDDASA DGALSASFFA KTPTAAGQTG VIELRVDGAN SNDGGPTAFG
FQANASGIPG ANAGRSVPDN DTGNPIGVGN IALYLNPPAT ATYSVGAPQV SSLVFRGGSL
ACNSAEPGAG GTFAFNSSVA GTFHLMCDFD ADGVDISDDD DLLVVGTATP GTNTVAWNGE
DRNGSTVAAG SYQCQVRLTS GELHLVAEDV ETMFPGLRMF AIDSVGTSTP LAMYWNDSQI
QASAIPMPNG ELSLETSGAA GVSSGDPTGA PEANTNARAW GDFGEDGKGD GSLLDTYTWL
ADDLSASTEA VVIDSDADSD GDGLTDLEEL CDLGTDPDDT DSDDDGRLDG DEGTADTDGD
GLADPLDPDS DNDGILDGTE VGITTPSADT DVSRGNFVPD ADPNSTTDPT SAHSDTQGFS
DPTIGDPDGS EDSNHNGRVD AGESDPTESA DDSAIGGQPL VDSDGDGLVD AEETFFGSDP
NDADTDDDGL LDGDEPNWTT DTDGDGVINV FDPDSDNDGL FDGTEAGVAT ADADTDLSRG
VFRADAAPGT TTFVVVADSD RGGVRDGAED PNYNGRIDGG ELDPNDGADD ALNVPLDTDD
DGLTDDEESL IGSDPNDADT DDDGLRDGDE VHWMLDSDGD SLINVLDADS DNDGLFDGTE
AGVAAADADT DESRGTFIAD ADPSTTTSMI DADTDDGGVR DGAEDANHNG AIDGGETDPN
EGADDVAPED SDGDGLTDVE EATFGSDPND ADSDDDGLRD GDEPNWNIDQ DGDGFIGALD
PDSDNDGIFD GTEAGVVTAD PDTDLGAGAF VADADPSSTT SPIAADSDGG GVDDGAEDPN
HNGAIDSGEL DPEDAGDDGT PPADSDGDGL SDDEEAAFGT DPDDADTDDD GVRDGDEYNW
AHDFDGDGLI NARDGDSDDD GLFDGTERGV VTPDPDTNTS EGGSFIADAD PSTTTNSLDR
DTDDGGVEDG LEDLNHNGAL DAREIDPNLA DDDGLLDRDQ DTIIDTDEGV ADDDDDGIPN
YEDLDSDGDG ILDEDEAGDL DRETAPVDTD EDGTPDFLDL DTDNDTLPDA DEAGDDQLDT
PPVDTDGDGT ADFRDLDSDG DGTPDAEDPC PTDPDDACMM VGEDDRDGDG IPDNTDNCPD
TPNQGQLDQD GDGLGDLCDS DADGDGFDDD ISIGGGGCSS AGDGSLGALL LVLLALGLVT
RRRRRQAIAL AATLAVVLVL GSASSARAQV QDEGAFPAER FRLAADEEGV LHTEWGAVPG
HMAWDLALLF GYQDDPLTIY RERDGDRERD GALVSSRVSG SLVASLALWN RLALAIELPL
ILTQDDDPVS GVPVGDLERT GIGDIRVAPK VQLLSAANSG VDLAIIPTFT LPTGSAEDYR
GEQGVSFAPE VAISRAMGAW RLASNIGYRA RQNAHLADLD VNDELFLRLG AGYRLGETGG
PPLELDLGLS AATAAASPLG DYNQNHLELL TGARYHLPGP FSIGLGYGVG VTNGFGTPDW
RLFLTVRAAG RSDPDSDGDG ILDDVDACPN EPEDKDGFED RDGCPETDND GDGIPDTEDG
APNDPEDKDG YQDEDGVPDP DNDDDGIPDT EEACPDEPEN KNGYQDEDGC PDELPDTDGD
GHVDRVDECP EQPEDMDGFE DEDGCPDEDN DEDGVVDSAD KCPNEAGPVE NRGCPDTDRD
GDGVVDRLDN CPDEAGSERN QGCKRRQRVR LSGDRLEILD RVYFRSNRAV LQRRSNPLLQ
NVAQVLIAHP EIEHVRVEGH TDNRGDPTYN MNLSQSRAEA VVAFLVQEGV EAKRLAAVGF
GETQPLEDNK TRRGRAANRR VEFNILWDKP ADPPAEVIEE PSDDAAGEGA GDDAGDAAGE
GEASQGS