Gene Hoch_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0066 
Symbol 
ID8542436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp95846 
End bp99292 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content73% 
IMG OID646384853 
ProductLantibiotic dehydratase domain protein 
Protein accessionYP_003264600 
Protein GI262193391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.433774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAT CGCCGAATTC CCCGCCGGTA AGCGCTGACG CGCGCTTCGT GCTGCGCACC 
CCGCTGTTGC CGCTGCAGAG CTTTCTCGAC TGGACCGCGG CCGGCACCGG CGTCTCGGTC
GAGGCGGTGC GGAACGCGCG CGCGCACCTG CGTCGCCTGA TCGATCAGCC CGTGGTGCGC
GAGGCGCTGT ATCTGGCGTC TCCGGGGCTG GTCGGCGACA TTCCTCTGTG GGAGCGGGAG
CCGCACAGCG TCCGCGGGCA GAAGATCGAG CGCGCGCTGG TGCGCTACGT GTCGCGCATG
AGCACCCGGG CGACGCCCTT CGGCCTGTTC TCGGGCGTCG CAGTTGGACA CGTGGGCGAA
AACACCCAGC TCGCGTGCGT GGACGCCGGC GCGTACCGGC GCAGCACGCG CCTCGACAAC
GACTATCTGT TCGCGCTGTG CAGCGCGCTG CGCGAGATCC CGGCGCTGCG CGAGGCGCTG
CACTGGCGGC CCAACAGCAG TCTGTACTCG CTGGCCGGCC GCTACCGCTA CGCCGAGGCG
CGCCTGCGCG GCACCCTGCG CAGCTATCAT CTAGTTGCCA TCGGCAGCAT GTCGTATATC
GCCGACACCC TGGAGCGCGC GCGGGGCGGC GCCTCCTTGC AGGCCCTGGC CCGGGCGCTG
GTCGCGGACG ACCCCGAGAT CGAGATGGGC GAGGCCGAGG CGTTTATCGA CGAGCTGGTC
GAAAGCCAGG TGCTCGAGTG CGATCTCGAG CCCGCGGTCA CCGGCCTCGA GCCGCTCGCC
GGGCTGCTGG CGATCCTCGA GGCGATCGCG CCGAGCGCGC GCGTGACCGG GGTGCTGCGC
GGGGTCTCGG CGCGCCTCGC GGCCCTCGAC GAGCGCGGCG TGGGCTGCGA CATCGCCGCC
TACGAGGACA TCGAGGATTC GCTGCGCGAG CTGCCGGCGG CCATCGACAA GGCCCGCCTG
TTTCAGGTCG ATCTCATCAA GCCGGCGCCC GAGGCGGTGC TGGGACGCGG GCTGGTCGAC
ACTGTGGCGC GCGGGGTCGA GGTGCTGCGG CGGCTCACGC CGCAACCCGG TAGCGGTCTC
CTCGATCGCT TCCGCGAGGC CTTTCGCGAG CGCTACGAAT CCCGCGAGCT GCCGCTGGTC
GAGGTTCTCG ACGAGGAGTC CGGGATCGGC TTCGGCACCA GCGATGACCC GGCGGCGTCG
GGCGCGCCGC TGGTCGCCGA TCTGCGCTTC GCGCCGCGAG CGGGCGATGG CCAGGAGACC
TGGACGGACT TCCATCACGA GCTGCTGCGC CGCCTCGAGC ACATCTGGGC CGAGGGCGGC
CGCGAGCTGG TGCTCGGCGA CGACGATATC GACGCCCTGT CCGCGAGCCA GCCGGCGCGC
CAGCCCGACG CCTTCTCGGT GCTGGGCGCG GTGCGGGCCA GCTCGCCGCA GGCGCTGGCC
GAGGGCGATT TCGAGGTCGA TCTGCGCAGC GCGGCCGGGC CCTCGGGCGC GCGTCTGCTG
GGCCGCTTTT GCCACGGCTC CGAGCCGGTT CACGAGCTGG TGCGCGCGCA TCTGCGCGCC
GAGGAGGCGC TGCGCCCCGA GGCCTGCTTC GCCGAGGTGG TGCATCTCAA CGAGGGCCGG
CTGGGCAACA TCCTGTGCCG CCCGGTGCTG CGCAGCTACG AGATCCCGTT TCTCGGACGC
TCGGGCGCGC CGCCCGAGCG CCGTCTGCCG GTGCAGGATC TGCTGGTGAG CGTGCGCGGG
GAGCGCATCG TGCTGCGCTC GCGCAGCCTC GATCGCGAGG TGGTGCCGCG CCTGAGCACG
GCGCACAACT TCTCGCGGCG CAGCATCGGG ATCTATCGCT TCCTGTGCGC GCTGCAGGCG
CAGGATGGCG ACGCGGTGTC GTGGTCGTGG GGACCGCTGG CGCGGGCCCG CTTCCTGCCG
CGCGTGCGCT GGGGCCGCGT GCTGTTCACC CGCGCCCGCT GGCTGCTCGA CGAGCGCGCG
CTGGCGCCGC TGGCCGAGGC CGTGCGCACG CGTCGCCACA AGCGCGCGAA GCCGGACACG
CGTACGGACG CGAAGCCGGG CGAGGCCGCC GCCGCCGAGC AGCGGGCGCT GGCCGAGCAG
CGGGCGCTGG CCGAGCAGCG CTCGATCTTC GAGGCGCTGC TCGGGCTGCG CGAGGCGTTG
GGGCTGCCGC GGCATGTGCG GCTGGGCGAG GGCGACAACG AGCTGGCGGT CGATCTCGAC
AACCCGCTGT CGGCGCTGAG CTTTGCGCAC CTGGTGGCCA AGCGCTCGAG CGCGACCCTG
TACGAATTCG ACCCCGAGCC CGCGCGCCAG CCCGCCCGCG GCCCCGAGGG CCGCTTTACC
CACGAGCTGG TGCTGTTGTT CACACGCGAC CCGAGCGCGG CCCGGACCGG CGCTCGCGAG
GACATGAAGA CGGCACCCGC GCTCGCGAGC GCGGCCGTCG AGGACGACGT CCCGGCGCGC
GGCACCCCGG CGCGCGGCGT CCCGGCGCGC GGCACCCCGG CGGTCGGTAC CCCGGTCGTG
GCAGCGGCGG CGAGCGCGGT GCCGCGCAGC TTCGCGCCGG GCGGGCCGTG GCTGTATCTC
AAGCTGTACA CGGGCGTGTC CACGGCCGAT ACCGTGCTGC GCGATGTCAT CGCCCCGGTG
CGCGAGCTGG CGTTCGACTC GGGCGCGGCG CAGCAGTGGT TTTTTCTGCG CTATCACGAC
ACCGGGCCGC ATCTGCGGGT GCGCTTCCGC GGTCCGCCCG GGCGACTCTA CAGCGAAGTC
CTGCCGGCCG CGCACGCGGC TCTGCAGCCG CTGATCGAGG ATGGCAGCGT GTGGCGCGTG
CAGATCGACA CCTACGAGCG CGAGCTGGAG CGCTACGGCG GCGCCGCCGG CATCGAGCTG
TACGAGGAGA TCTTCTGGCA CGACAGCGAC GCGGTGCTCG ACATCGTCGA GCTGCTCGAG
GGCGACGCCG GCGCCGACGC GCGCTGGCGC CTGGCCCTGC GCGGCGCCGA CATGCTGCTC
GACGATTTCG GCATGAGCAC GCGCGCGCGG CGCGAGCTGA TGGCGCGCGC GCGCGACAGC
TTCCGCGCCG AGTTCCGCGC CGACACCGCC ATGTTCAAGA AGATCGGCGA GCGCTTTCGC
GCCGAGCGCG GTGAGCTCGA GCGCCTGCTC GGCGCCGACC CGGCCGACGA CGCCGCCAGC
GATCTCGCGC CCGGGCTCGA GCTGCTGGCC CGGCGCAGCG AGCGCGTGCG CGCCGCGATC
GGCGCCTATC TCGACCGCGT GTCCAACCGC GACAGCGCCA TCCACATGCT CGAGCGCTGC
GCCAGCAGCG TGGTGCACAT GCACGTCAAC CGAATGCTCC ATGTGAGCCA GCGCGCGCAG
GAGCTGGTGC TCTATGATTT CCTGCACCGC TGGTACGCGG CCCGTAGCGC CCGCAAAACT
ACCTTGACGA AGAGTACGGA GAAGTGA
 
Protein sequence
MSKSPNSPPV SADARFVLRT PLLPLQSFLD WTAAGTGVSV EAVRNARAHL RRLIDQPVVR 
EALYLASPGL VGDIPLWERE PHSVRGQKIE RALVRYVSRM STRATPFGLF SGVAVGHVGE
NTQLACVDAG AYRRSTRLDN DYLFALCSAL REIPALREAL HWRPNSSLYS LAGRYRYAEA
RLRGTLRSYH LVAIGSMSYI ADTLERARGG ASLQALARAL VADDPEIEMG EAEAFIDELV
ESQVLECDLE PAVTGLEPLA GLLAILEAIA PSARVTGVLR GVSARLAALD ERGVGCDIAA
YEDIEDSLRE LPAAIDKARL FQVDLIKPAP EAVLGRGLVD TVARGVEVLR RLTPQPGSGL
LDRFREAFRE RYESRELPLV EVLDEESGIG FGTSDDPAAS GAPLVADLRF APRAGDGQET
WTDFHHELLR RLEHIWAEGG RELVLGDDDI DALSASQPAR QPDAFSVLGA VRASSPQALA
EGDFEVDLRS AAGPSGARLL GRFCHGSEPV HELVRAHLRA EEALRPEACF AEVVHLNEGR
LGNILCRPVL RSYEIPFLGR SGAPPERRLP VQDLLVSVRG ERIVLRSRSL DREVVPRLST
AHNFSRRSIG IYRFLCALQA QDGDAVSWSW GPLARARFLP RVRWGRVLFT RARWLLDERA
LAPLAEAVRT RRHKRAKPDT RTDAKPGEAA AAEQRALAEQ RALAEQRSIF EALLGLREAL
GLPRHVRLGE GDNELAVDLD NPLSALSFAH LVAKRSSATL YEFDPEPARQ PARGPEGRFT
HELVLLFTRD PSAARTGARE DMKTAPALAS AAVEDDVPAR GTPARGVPAR GTPAVGTPVV
AAAASAVPRS FAPGGPWLYL KLYTGVSTAD TVLRDVIAPV RELAFDSGAA QQWFFLRYHD
TGPHLRVRFR GPPGRLYSEV LPAAHAALQP LIEDGSVWRV QIDTYERELE RYGGAAGIEL
YEEIFWHDSD AVLDIVELLE GDAGADARWR LALRGADMLL DDFGMSTRAR RELMARARDS
FRAEFRADTA MFKKIGERFR AERGELERLL GADPADDAAS DLAPGLELLA RRSERVRAAI
GAYLDRVSNR DSAIHMLERC ASSVVHMHVN RMLHVSQRAQ ELVLYDFLHR WYAARSARKT
TLTKSTEK