Gene Hoch_3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3063 
Symbol 
ID8545451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4226083 
End bp4228584 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content70% 
IMG OID646387734 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003267462 
Protein GI262196253 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.561591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.891031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCCC CGCGTAAGTC ACCATCGCTG CTCACGCTCG CCGCCTGCCT GGCCCTCGCC 
GGTTGCGGCG ACAACCTCGA CCCCATCGGC GCCGACCCGG ACGCGGGCGT GCCCACCCCG
GACGCGCCCT CGCCGGTGAA CTCCATTATT CGCCCGGAGA ATCCCATTCC CGGGCACTAT
CTGTTCCTGC TCGACGCCGA GCAGGTGCCG CCGCCGAGCG TGCAGGAGGT CGCCGATGAG
CTGACCGCGG ACCGCGGCGT GGTGCAGTCG ATCATGGACG GCAGCATCCT GGCCTTCATG
GCCAATGAGC TCGACGACGA GTCGGCGCTG GCGATCCTCG AGGACCCGCG CGTGACCGGC
GTCGGACAGG ACGGCTACCT CGAGGTCGAC TGGGCGCCGA CGCCGTTGGT CGACCAGAAC
AATCCGGTCT GGGGCCTCGA CCGCATCGAT CAGAGCGCGC TGCCCTTTGA CGACATCTAC
CAGCAGAACA CCACCGGCGC CGGCGTCCAC GCCTACATCA TCGACACCGG CGTCAACGCC
GCGCACAGCG AGTTCGCCGG TCGCATGGGC AACGGCATGA GCGCGATCCT CGACGGCGGC
GGCACGGCCG ACTGCCAAGG ACACGGCTCG CACGTCGCCG GCACCGTCGG CGGCACCCAG
TGGGGCGTGG CCAAGGACGT CACCATCCAC GCCGTGCGCG CGCTCGGCTG CGACGGTCGC
GGCAGCCTCA GCGGCGTCAT CAAGGCCGTC GATTGGGTCA CCATGCACCA CGTCAAACCC
GCCGTGGTCA ACATGAGCCT CGGCAGCGGA CGCAACGATC TGGTCAACCA GGCGGTGCAG
GCGTCCATCG ACGCCGGCTT GATCTACGCG GTCGCGGCCG GCAACGAGAA CACCGACTCG
TGCAATCGCA GCCCGGCCAG CGTCGGCGAC GCGCTCACGG TCGGCGCCTC GGACATCGAG
GACACGCGCG CCTGGTTCTC GAACTTCGGT AGCTGCGTCG ACCTGTTCGC GCCCGGCGTC
GATATCACCT CGAGCCTGCA CAACGACAAC AGCGGCAGCC GCACCATCAG CGGCACCTCC
ATGGCCACGC CGCACGTGGC CGGCGTCATC GCCCTGTACC TCGAGAGCCA CCCCGACGCC
ACCCAGGCCG AGGTCAACGC CGCCCTGATC GCGGCCGCGA CCCCGGACGT CATCGCCGAC
CCGCAGGGCT CGCCCAACCT GCTGCTGTAC TCGCTGTTCA TCGACCCGCC GGTGTCGCCG
TGTGTGGCTG ATCCCGGCGC GCCCGGCTGC GACCAGCCGG CGAGCTGCGC CGAGGTCCTG
GCCCGCGATC CCGCCGCCAC CGACGGCGAG TACACCCTGT ACATCGGCGG CGACGAGGGC
CAGCCCTGGA CCGCGTACTG CCACGACATG GCCGGCACCC CGGCCGAGTA CCTCACCCTG
GTCAACACCG GCGGCGACAG CAACTTCGGC CAGTACACGG CCGGCGGCTC GAGCCCGGGC
ACCAACGTGC GCACGAGCTA CACCCGCGTG CGCATCGACC CCGCGGATAT GTCGATCGTC
ATCGTGGACA AGACCTTCTC CACCTCGACC GGCAGCCTCG ACCACGGCGG AACCGCGGTC
GACCGCATGT CCTATGCCAT CGCCATGAGC TGCGATTCGT CGGACAGCGG CGTGGCCAAC
GTCGATCTGC GCGGCACGCC CTTCGCCATC GACGGTGAGT TCTGCCAGGG CGGCTTCAAC
TCCGTCGGCG GAACCACCGT GAGCGCGAAT GACCAGGTGG CCGACCTTAC CGGCGGCGGG
TTCTGCGGCT ACACCGCGCC CGTCGTGCCC GGCGCGCCCA ACTGCCCGGG CGGCTTCAAC
GACCAGCCCT CGCAGCACAG GCTGCCGCTG CGCTACCTGG CCGTGCCCGC GTCGTGCGCC
GAGATCAAAG CCGCGAACGC CGGCGCCGCC GACGGCGAGT ACACGCTGTA TGTCGATCGC
GACGAGAGCA AGCCCTGGAC CGCGTACTGC CACGACATGG ACGGCACGCC CAGCGAGTTC
CTGAGCCTGC CCAGCGGCGC CGACAAGAAC TTCGCCCAGT ACACGGGCGG CGGCGCGACC
CCGGGCGAGA GCGTGCGCAC CAGCTACAGC CGCGTGCGCA TCAACCCCAA GGATCTGAGC
ATCGACATCT TGGACCAGAC CTTCGCCAGC TCGAGCGGTT CGCTGGTGCA CGGCGGTCAG
CCCGTCAATT TCATGCCCTA CGGCGTGGCC ATGAGCTGCG ACCAGAGCGC CAGCGGCGTC
GCCAGCATCG ACCTCAGCGG CACGCCCTTC GCCGTGGCTG GCGAGTTCTG CCAGGCGGGC
TGGCGTCCGC AGGGCGCGTC TACCCCGAGC CAGGGCGGAC AGGTGGTCGA GATCAGCGGC
GGCGGCTACT GCGGCTGGTC GGCGCCGAGC GGCCAGAGCT GCCCGTACAA CCCCTTCAAC
GCCAACCCGC AGAACGCCGT CATCTCGCTC AGCTACCAGT GA
 
Protein sequence
MNSPRKSPSL LTLAACLALA GCGDNLDPIG ADPDAGVPTP DAPSPVNSII RPENPIPGHY 
LFLLDAEQVP PPSVQEVADE LTADRGVVQS IMDGSILAFM ANELDDESAL AILEDPRVTG
VGQDGYLEVD WAPTPLVDQN NPVWGLDRID QSALPFDDIY QQNTTGAGVH AYIIDTGVNA
AHSEFAGRMG NGMSAILDGG GTADCQGHGS HVAGTVGGTQ WGVAKDVTIH AVRALGCDGR
GSLSGVIKAV DWVTMHHVKP AVVNMSLGSG RNDLVNQAVQ ASIDAGLIYA VAAGNENTDS
CNRSPASVGD ALTVGASDIE DTRAWFSNFG SCVDLFAPGV DITSSLHNDN SGSRTISGTS
MATPHVAGVI ALYLESHPDA TQAEVNAALI AAATPDVIAD PQGSPNLLLY SLFIDPPVSP
CVADPGAPGC DQPASCAEVL ARDPAATDGE YTLYIGGDEG QPWTAYCHDM AGTPAEYLTL
VNTGGDSNFG QYTAGGSSPG TNVRTSYTRV RIDPADMSIV IVDKTFSTST GSLDHGGTAV
DRMSYAIAMS CDSSDSGVAN VDLRGTPFAI DGEFCQGGFN SVGGTTVSAN DQVADLTGGG
FCGYTAPVVP GAPNCPGGFN DQPSQHRLPL RYLAVPASCA EIKAANAGAA DGEYTLYVDR
DESKPWTAYC HDMDGTPSEF LSLPSGADKN FAQYTGGGAT PGESVRTSYS RVRINPKDLS
IDILDQTFAS SSGSLVHGGQ PVNFMPYGVA MSCDQSASGV ASIDLSGTPF AVAGEFCQAG
WRPQGASTPS QGGQVVEISG GGYCGWSAPS GQSCPYNPFN ANPQNAVISL SYQ