Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0066 |
Symbol | |
ID | 8542436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 95846 |
End bp | 99292 |
Gene Length | 3447 bp |
Protein Length | 1148 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646384853 |
Product | Lantibiotic dehydratase domain protein |
Protein accession | YP_003264600 |
Protein GI | 262193391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.433774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAT CGCCGAATTC CCCGCCGGTA AGCGCTGACG CGCGCTTCGT GCTGCGCACC CCGCTGTTGC CGCTGCAGAG CTTTCTCGAC TGGACCGCGG CCGGCACCGG CGTCTCGGTC GAGGCGGTGC GGAACGCGCG CGCGCACCTG CGTCGCCTGA TCGATCAGCC CGTGGTGCGC GAGGCGCTGT ATCTGGCGTC TCCGGGGCTG GTCGGCGACA TTCCTCTGTG GGAGCGGGAG CCGCACAGCG TCCGCGGGCA GAAGATCGAG CGCGCGCTGG TGCGCTACGT GTCGCGCATG AGCACCCGGG CGACGCCCTT CGGCCTGTTC TCGGGCGTCG CAGTTGGACA CGTGGGCGAA AACACCCAGC TCGCGTGCGT GGACGCCGGC GCGTACCGGC GCAGCACGCG CCTCGACAAC GACTATCTGT TCGCGCTGTG CAGCGCGCTG CGCGAGATCC CGGCGCTGCG CGAGGCGCTG CACTGGCGGC CCAACAGCAG TCTGTACTCG CTGGCCGGCC GCTACCGCTA CGCCGAGGCG CGCCTGCGCG GCACCCTGCG CAGCTATCAT CTAGTTGCCA TCGGCAGCAT GTCGTATATC GCCGACACCC TGGAGCGCGC GCGGGGCGGC GCCTCCTTGC AGGCCCTGGC CCGGGCGCTG GTCGCGGACG ACCCCGAGAT CGAGATGGGC GAGGCCGAGG CGTTTATCGA CGAGCTGGTC GAAAGCCAGG TGCTCGAGTG CGATCTCGAG CCCGCGGTCA CCGGCCTCGA GCCGCTCGCC GGGCTGCTGG CGATCCTCGA GGCGATCGCG CCGAGCGCGC GCGTGACCGG GGTGCTGCGC GGGGTCTCGG CGCGCCTCGC GGCCCTCGAC GAGCGCGGCG TGGGCTGCGA CATCGCCGCC TACGAGGACA TCGAGGATTC GCTGCGCGAG CTGCCGGCGG CCATCGACAA GGCCCGCCTG TTTCAGGTCG ATCTCATCAA GCCGGCGCCC GAGGCGGTGC TGGGACGCGG GCTGGTCGAC ACTGTGGCGC GCGGGGTCGA GGTGCTGCGG CGGCTCACGC CGCAACCCGG TAGCGGTCTC CTCGATCGCT TCCGCGAGGC CTTTCGCGAG CGCTACGAAT CCCGCGAGCT GCCGCTGGTC GAGGTTCTCG ACGAGGAGTC CGGGATCGGC TTCGGCACCA GCGATGACCC GGCGGCGTCG GGCGCGCCGC TGGTCGCCGA TCTGCGCTTC GCGCCGCGAG CGGGCGATGG CCAGGAGACC TGGACGGACT TCCATCACGA GCTGCTGCGC CGCCTCGAGC ACATCTGGGC CGAGGGCGGC CGCGAGCTGG TGCTCGGCGA CGACGATATC GACGCCCTGT CCGCGAGCCA GCCGGCGCGC CAGCCCGACG CCTTCTCGGT GCTGGGCGCG GTGCGGGCCA GCTCGCCGCA GGCGCTGGCC GAGGGCGATT TCGAGGTCGA TCTGCGCAGC GCGGCCGGGC CCTCGGGCGC GCGTCTGCTG GGCCGCTTTT GCCACGGCTC CGAGCCGGTT CACGAGCTGG TGCGCGCGCA TCTGCGCGCC GAGGAGGCGC TGCGCCCCGA GGCCTGCTTC GCCGAGGTGG TGCATCTCAA CGAGGGCCGG CTGGGCAACA TCCTGTGCCG CCCGGTGCTG CGCAGCTACG AGATCCCGTT TCTCGGACGC TCGGGCGCGC CGCCCGAGCG CCGTCTGCCG GTGCAGGATC TGCTGGTGAG CGTGCGCGGG GAGCGCATCG TGCTGCGCTC GCGCAGCCTC GATCGCGAGG TGGTGCCGCG CCTGAGCACG GCGCACAACT TCTCGCGGCG CAGCATCGGG ATCTATCGCT TCCTGTGCGC GCTGCAGGCG CAGGATGGCG ACGCGGTGTC GTGGTCGTGG GGACCGCTGG CGCGGGCCCG CTTCCTGCCG CGCGTGCGCT GGGGCCGCGT GCTGTTCACC CGCGCCCGCT GGCTGCTCGA CGAGCGCGCG CTGGCGCCGC TGGCCGAGGC CGTGCGCACG CGTCGCCACA AGCGCGCGAA GCCGGACACG CGTACGGACG CGAAGCCGGG CGAGGCCGCC GCCGCCGAGC AGCGGGCGCT GGCCGAGCAG CGGGCGCTGG CCGAGCAGCG CTCGATCTTC GAGGCGCTGC TCGGGCTGCG CGAGGCGTTG GGGCTGCCGC GGCATGTGCG GCTGGGCGAG GGCGACAACG AGCTGGCGGT CGATCTCGAC AACCCGCTGT CGGCGCTGAG CTTTGCGCAC CTGGTGGCCA AGCGCTCGAG CGCGACCCTG TACGAATTCG ACCCCGAGCC CGCGCGCCAG CCCGCCCGCG GCCCCGAGGG CCGCTTTACC CACGAGCTGG TGCTGTTGTT CACACGCGAC CCGAGCGCGG CCCGGACCGG CGCTCGCGAG GACATGAAGA CGGCACCCGC GCTCGCGAGC GCGGCCGTCG AGGACGACGT CCCGGCGCGC GGCACCCCGG CGCGCGGCGT CCCGGCGCGC GGCACCCCGG CGGTCGGTAC CCCGGTCGTG GCAGCGGCGG CGAGCGCGGT GCCGCGCAGC TTCGCGCCGG GCGGGCCGTG GCTGTATCTC AAGCTGTACA CGGGCGTGTC CACGGCCGAT ACCGTGCTGC GCGATGTCAT CGCCCCGGTG CGCGAGCTGG CGTTCGACTC GGGCGCGGCG CAGCAGTGGT TTTTTCTGCG CTATCACGAC ACCGGGCCGC ATCTGCGGGT GCGCTTCCGC GGTCCGCCCG GGCGACTCTA CAGCGAAGTC CTGCCGGCCG CGCACGCGGC TCTGCAGCCG CTGATCGAGG ATGGCAGCGT GTGGCGCGTG CAGATCGACA CCTACGAGCG CGAGCTGGAG CGCTACGGCG GCGCCGCCGG CATCGAGCTG TACGAGGAGA TCTTCTGGCA CGACAGCGAC GCGGTGCTCG ACATCGTCGA GCTGCTCGAG GGCGACGCCG GCGCCGACGC GCGCTGGCGC CTGGCCCTGC GCGGCGCCGA CATGCTGCTC GACGATTTCG GCATGAGCAC GCGCGCGCGG CGCGAGCTGA TGGCGCGCGC GCGCGACAGC TTCCGCGCCG AGTTCCGCGC CGACACCGCC ATGTTCAAGA AGATCGGCGA GCGCTTTCGC GCCGAGCGCG GTGAGCTCGA GCGCCTGCTC GGCGCCGACC CGGCCGACGA CGCCGCCAGC GATCTCGCGC CCGGGCTCGA GCTGCTGGCC CGGCGCAGCG AGCGCGTGCG CGCCGCGATC GGCGCCTATC TCGACCGCGT GTCCAACCGC GACAGCGCCA TCCACATGCT CGAGCGCTGC GCCAGCAGCG TGGTGCACAT GCACGTCAAC CGAATGCTCC ATGTGAGCCA GCGCGCGCAG GAGCTGGTGC TCTATGATTT CCTGCACCGC TGGTACGCGG CCCGTAGCGC CCGCAAAACT ACCTTGACGA AGAGTACGGA GAAGTGA
|
Protein sequence | MSKSPNSPPV SADARFVLRT PLLPLQSFLD WTAAGTGVSV EAVRNARAHL RRLIDQPVVR EALYLASPGL VGDIPLWERE PHSVRGQKIE RALVRYVSRM STRATPFGLF SGVAVGHVGE NTQLACVDAG AYRRSTRLDN DYLFALCSAL REIPALREAL HWRPNSSLYS LAGRYRYAEA RLRGTLRSYH LVAIGSMSYI ADTLERARGG ASLQALARAL VADDPEIEMG EAEAFIDELV ESQVLECDLE PAVTGLEPLA GLLAILEAIA PSARVTGVLR GVSARLAALD ERGVGCDIAA YEDIEDSLRE LPAAIDKARL FQVDLIKPAP EAVLGRGLVD TVARGVEVLR RLTPQPGSGL LDRFREAFRE RYESRELPLV EVLDEESGIG FGTSDDPAAS GAPLVADLRF APRAGDGQET WTDFHHELLR RLEHIWAEGG RELVLGDDDI DALSASQPAR QPDAFSVLGA VRASSPQALA EGDFEVDLRS AAGPSGARLL GRFCHGSEPV HELVRAHLRA EEALRPEACF AEVVHLNEGR LGNILCRPVL RSYEIPFLGR SGAPPERRLP VQDLLVSVRG ERIVLRSRSL DREVVPRLST AHNFSRRSIG IYRFLCALQA QDGDAVSWSW GPLARARFLP RVRWGRVLFT RARWLLDERA LAPLAEAVRT RRHKRAKPDT RTDAKPGEAA AAEQRALAEQ RALAEQRSIF EALLGLREAL GLPRHVRLGE GDNELAVDLD NPLSALSFAH LVAKRSSATL YEFDPEPARQ PARGPEGRFT HELVLLFTRD PSAARTGARE DMKTAPALAS AAVEDDVPAR GTPARGVPAR GTPAVGTPVV AAAASAVPRS FAPGGPWLYL KLYTGVSTAD TVLRDVIAPV RELAFDSGAA QQWFFLRYHD TGPHLRVRFR GPPGRLYSEV LPAAHAALQP LIEDGSVWRV QIDTYERELE RYGGAAGIEL YEEIFWHDSD AVLDIVELLE GDAGADARWR LALRGADMLL DDFGMSTRAR RELMARARDS FRAEFRADTA MFKKIGERFR AERGELERLL GADPADDAAS DLAPGLELLA RRSERVRAAI GAYLDRVSNR DSAIHMLERC ASSVVHMHVN RMLHVSQRAQ ELVLYDFLHR WYAARSARKT TLTKSTEK
|
| |