Gene Hoch_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0229 
Symbol 
ID8542608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp339038 
End bp342976 
Gene Length3939 bp 
Protein Length1312 aa 
Translation table11 
GC content71% 
IMG OID646385025 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003264763 
Protein GI262193554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.83599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGG TAATACGACC TTGGCTGTTT GCCGCTGGCT TACCCTTGTT CTTCGGACCT 
GCGGTCGGCT GTGACGGAGG GTTAGGAGAT ATCGATACCT CGCCTGCCGG CGAGGCCCCG
GCGCAGCGCC TGCCCGGAGA CACCGAGGCA GCGGCCCCCG GTGGTGGCAA CGGCGATGAC
GGCTCCGCGG GCAACGAGAG CGGCGTCGGC GATCCCGCGC TGGCCGACAA GCCGGCCTCG
TTCGTGCGCG ACCTGCGCGA CGACTTCGTC GAGGAGCCCG GGCTCACCGG CGAGCACAGC
TACATCGTGC ACTTCCGCGC CGCCCCGCTG GCCACCTACA GCGGCGGCGT CACCGGCCTG
GCGGCGACCA ACCCGCGCGC GGCCGGGACC TTTGGCAAGC TCGACGCCAA CAGCCCGGCC
AGCCGGGCGT ACAAGAGCTA CCTGCTGAGC GAGCAGGCAA GCGCCCTGAG CACGATCGAG
AAGCAGCTCG GGCGCCGGCT CAAGAGCGGC ACGCAGTACA CCACCGCGAT CAACGCCACG
CTCGTGCGCA TGAGCCAGGA CGAGGCCAAG CGCGTGTCGC GCATGGCCGG CGTGCGCTTC
GTCGAGCGCG ACCAGGCGGT GTGGCCCGAG ACCGACCGCG GTCCCATCTT CATCGGCGCG
CCCGGCATCT GGGACGGCAG CGCCACCGGC ATGCCCAGCG AAGGCGAGGG CATCACCGTC
GGCATCATCG ACTCGGGCAT GAACATCCAG GCCGAGATCG ACGGCGTCAC CGGCGCGCAT
CCGTCCTTCG CCGAGATGGG CGATGACGGC TACGTGCACA CCAACCCGCT GGGTTCGGGC
GTGTACCTCG GCGGCTGCGT CGACAACCCC GAGTGGTGCA ACGACAAGCT CATCGGCGTG
TTCTCGTTCC TCAATGCCCA GCCCAACCCG GGCGTCGACC CCAACGCGCC CGACGACGAT
CCGCTGTGGG GCTTCAAGGA CACCAGCGGC CACGGCTCGC ACGTGGCCTC GACCGCGGCC
GGCAACGTGC TCTTCGACGT GCCCGTGGTC GACGCCGACG GCAACCCCTC GTCGTTCTCC
TACGAGCGCA TCTCGGGTGT GGCCCCGCAC GCCAACCTGG TGTCGTTCAA GGTCTGCGCG
CCCTCGTGCT TTTTCTCGGA CATCGCCGGC GCGGTCGAGC AGGCCATCGA GGACGGCGTG
GTCGACGTGC TCAACCAGTC GATCGGCAAC TCGGGCGGCA GCCCGTGGAA CTCGACCTCG
GCGCAGGCGT TCTTGTCGGC GCGCGCGGCC GGCATCTTCG TCGCCGCCTC GGCCGGCAAC
GACGGCCCCG ACCCGGGCAC CGCCGGCCGC GGCAACAGCG CGCCGTGGGT GGCCGGCGTG
GCCGCGACCT CGCACGACCG CCGCTTCCCC GACAAGTTCC TCACCGACAT GGCCGGTGGC
GACACCCCGC CGCCGCCCGA GCTCAGCGGC CTCACCATCA GCGGCGGCAT CAGCGGACGC
CTGGTCTACG CCGGTGAGTT TCCGGTCGGC AACCCGGGCG AGCCCAACTT CGACCAGCCC
GAGCAGTGCC TCGAGCCCTT CCCGGCCGGC ACCTTCGAGC CCGACATGAT CGTCGTCTGC
GACCGCGGCG CCATCGCCCG CACGGCCAAG GGCCAGAACG TGCGCGACGG CGGCGCCGGC
GGCCTGGTGC TCGCCAACCT CGCCGGCGGC GCCACCAGCG TGGTCGCCGA TCCCCACGTG
CTCCCGGCGA TTCACATCGA CGCGGCGCAG GGCGACCTGC TGCGCGCGTG GCTGGCCAGC
GGCAGCGGTC ACACCGGCAC CATCACGGCC ACCGAGCAGC CCGTGAGCGA CCTGAGCGCG
GCCGACATCA TGGCCTCCTT CAGCTCGCGC GGCCCCTACG ACGCCTTCGA CATCCTGGCG
CCCAACGTCG CCGCCCCGGG CCTGTCGATC TTCGCCGCCG GCGCGCAGGT GCTCTTCGAC
CATCCCGGCT CGCCCTCGGT GCCCGGCCTG TTCGGCACCA TCCAGGGCAC CTCGATGGCC
AGCCCGCACG TCGCCGGCGC GGCCGCGCTG ATGAAGTCGG TGCATCCCGA CTGGAGCGAC
GCCGAGATCC TCAGCGCGTT CATGACCACG GGCCAGACCG AGGTGCGCAA AGAAGACGGC
GTGACCCCGG CTGATCCCCT CGATTACGGT GGTGGCCGCG TGCGCGTCGA CCTCGCCGCC
CAGGTCGCGC TGGTTTTCGA CCAGTCGGCC GAAGGCTTCG CGGCGGCCAA CCCGGCGCTC
GGCGGTGATC CGCTGGCGCT CAACGTCGCC GCCTTGACCG AGGACGTGTG CATCTCCAAC
TGCGTGTGGC AGCGCACCGT GCGCGCCGTC CGCGCGGGCA GCTTCGCGGC CACCGGCACC
GCCGGCATCA CCATCGAGCC GGCCGCGTTC ACCCTGGCCG AGGGCGAGGA GATGACGCTC
ACCTTCACCG CCAACACCGA CGGCCTGCCC GAGGAGATCT ACGGCTTTGG CGCCGTGACC
ATCACCTCGG ACATCGCCGA CGCGCCGGTG CAGACCATGC AGGTCGTGAT CAAGCCCGGC
CGCTCGAACA TCCCGCGCGC GATGTCCATC GAGGCCACGC GCGACCGGGA CTCGCGCCTG
ATCGACGAGC TGCGCGCGGT GCCCATCCCC GACCTCAGCG TCACCTCCTT CGGCCTCAGC
CGGGCGCAGG TCTCCGACGC CGCCTCCGGG GCCGACTCCG ACACCTCGTC GCCCTTCGAC
GACCTCGAGG ACGGCGTTCA CTTCGAGCTG GTGCCCTTCC CCGACTTCGC GCAGCAGTTC
ATCGCCGAGA CCATGAACTC GACCTCGCCC GACCTCGACC TGTTCGTCGG CCTCGACCTC
AACGGCGACG GCCTGCCCTC GGAGGACGAG CTGTACTGCT CGGCGGCCAC GGCCTCGGCG
GCCGAGCGCT GCACCTTCGA GCTCGGCGGC GACCTGTCCG ATTTCCCGGA CTTCTGGGTG
CTGGTGCAGA ACTTCCGCGC GTCCACGCCG GGCGCCGTGG ACACCTACGA CATCGCCACC
ACGTCCGTCG GCGTGAGTGA GGACGGCGCG CTGTCGCTGT CGGCGCCGAG CGCGGTGCCG
GGCGGACAGC CGTTCTCGGC GCGCGTGGTC TGGGACGCCG AGATGAGCGA GGGCGAGCTG
TATTACGGAC GACTGTCCGT GTTCAGCGAC AGCGCGCACA GCGACGCGAG CTCGCTCGGC
AACATCGACG TGCGCCTGCT GCGCGGGCCC GACGACGTGC AGATCATCAT GCCGCCGCGG
CTGCGCGCCG ACGAGGTGGC CGAGGTGACC GTGCGCGTGC AGCCCAACCA CACTGACGAG
CCGCGCGCGT ACGACATCGT CGTGCCCGTG GAGTTCGGCC TGTTCTACAT GCCCGGCTCG
GCCGCGGCCG ATGGTGGCAT CTTCGAGGAC GGCGCGGTGC GCTTCAGCAT CACGCGCGCG
CCGAACACCA CCGAGGCGCG GGACCTGCAC TTCCGCGTGC ATGTGCGGCT GCCGCTGGCG
GGCGTGGAGC TGCCGTTCAC CGAGACCGAC GCCGTCGACC TGCCATTCAC CGAGGAGGCC
ACGAGCGTCG CGTTTGCGAA CGTCCAGTAC TTCACATTCA TGGGCTTCCT GCCGCCGCTG
GTCGACGGCA GCACCGTGGT GGTGGGTGAG GTTGCGACCG CGGCCTTCCG GCTCCGCCTC
GTGACCGACA AGAGCACGGT CTTCAACGGC GTGGCGGACG CCACGGTGTA CAATGCCGAC
GGCGAGCCGG TGGCCGAGGG TCTGTTCCTC GCCCTCGGCA CCGATGTGCT CGAGTTCGAC
TTCGACACCG CCGGCCTCGA GCCGGGCGAG TACACCGTCG TGGCCGAATT GACCGACACC
TTCGTGTACT CGATGACCGT GAACCTGGTG GCCCCGTAG
 
Protein sequence
MKKVIRPWLF AAGLPLFFGP AVGCDGGLGD IDTSPAGEAP AQRLPGDTEA AAPGGGNGDD 
GSAGNESGVG DPALADKPAS FVRDLRDDFV EEPGLTGEHS YIVHFRAAPL ATYSGGVTGL
AATNPRAAGT FGKLDANSPA SRAYKSYLLS EQASALSTIE KQLGRRLKSG TQYTTAINAT
LVRMSQDEAK RVSRMAGVRF VERDQAVWPE TDRGPIFIGA PGIWDGSATG MPSEGEGITV
GIIDSGMNIQ AEIDGVTGAH PSFAEMGDDG YVHTNPLGSG VYLGGCVDNP EWCNDKLIGV
FSFLNAQPNP GVDPNAPDDD PLWGFKDTSG HGSHVASTAA GNVLFDVPVV DADGNPSSFS
YERISGVAPH ANLVSFKVCA PSCFFSDIAG AVEQAIEDGV VDVLNQSIGN SGGSPWNSTS
AQAFLSARAA GIFVAASAGN DGPDPGTAGR GNSAPWVAGV AATSHDRRFP DKFLTDMAGG
DTPPPPELSG LTISGGISGR LVYAGEFPVG NPGEPNFDQP EQCLEPFPAG TFEPDMIVVC
DRGAIARTAK GQNVRDGGAG GLVLANLAGG ATSVVADPHV LPAIHIDAAQ GDLLRAWLAS
GSGHTGTITA TEQPVSDLSA ADIMASFSSR GPYDAFDILA PNVAAPGLSI FAAGAQVLFD
HPGSPSVPGL FGTIQGTSMA SPHVAGAAAL MKSVHPDWSD AEILSAFMTT GQTEVRKEDG
VTPADPLDYG GGRVRVDLAA QVALVFDQSA EGFAAANPAL GGDPLALNVA ALTEDVCISN
CVWQRTVRAV RAGSFAATGT AGITIEPAAF TLAEGEEMTL TFTANTDGLP EEIYGFGAVT
ITSDIADAPV QTMQVVIKPG RSNIPRAMSI EATRDRDSRL IDELRAVPIP DLSVTSFGLS
RAQVSDAASG ADSDTSSPFD DLEDGVHFEL VPFPDFAQQF IAETMNSTSP DLDLFVGLDL
NGDGLPSEDE LYCSAATASA AERCTFELGG DLSDFPDFWV LVQNFRASTP GAVDTYDIAT
TSVGVSEDGA LSLSAPSAVP GGQPFSARVV WDAEMSEGEL YYGRLSVFSD SAHSDASSLG
NIDVRLLRGP DDVQIIMPPR LRADEVAEVT VRVQPNHTDE PRAYDIVVPV EFGLFYMPGS
AAADGGIFED GAVRFSITRA PNTTEARDLH FRVHVRLPLA GVELPFTETD AVDLPFTEEA
TSVAFANVQY FTFMGFLPPL VDGSTVVVGE VATAAFRLRL VTDKSTVFNG VADATVYNAD
GEPVAEGLFL ALGTDVLEFD FDTAGLEPGE YTVVAELTDT FVYSMTVNLV AP