Gene Hoch_5330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5330 
Symbol 
ID8547742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7327007 
End bp7329334 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content67% 
IMG OID646390004 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003269708 
Protein GI262198499 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.879852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCC GACGAGCGAA TTTGGGATAC GGCATCTGGT CAGGCGTCCT CTTGGGCGCT 
CTGGCTGTTT CCAGCGGCTG CACCGACACC GGGATGGCCG TAGGCGCCGA CGAGGCGACC
TTCACGAACA TCCTCGATGA CAAAGCCGGC CCCTCCTACG ACCCGAGCAC GATCATCGTC
AAGTTCCGCG ACGTGGCCAC GGCGACGGCG GTCAACGACG CGATGGCGCT GGTCAAGGGC
TCGTTCAAGG ACGAGAACTT CGACGGCAAG GACGACCAGT TCGCGCACAT CGCGGGCGGC
AACTTGGCCG CGGTCAAGCT GGACGACAAG AACGGCGCCG ACGCCGCGAT CAAGGCGCTC
GCCGGTCACC CGGCCATCGA GTACGCCGAG CGCAACTACC TGTACAGCAT CCAGGCTGTG
CCCAACGATC CCGACTTCGG TGATCTCTGG GGCCTGAACA ACACCGGACA GGGCGGCGGC
ACGCCGGGCG CCCACGTGTC GGCGCTCGAC GCCTGGGACC TGTCCGTGGG TTCGCACGAC
GTGATCGTCG GCGTGATCGA CACCGGCGTG GCGTACGACC ACCCGGACCT GGTCGCCAAC
ATGTACATCA ACCCGGGCGA GATTCCGGGC AACGGCATCG ATGACGACGA AAACGGCTTC
ATCGACGACG TCCACGGCAT GAACGCGATC ACCGGCAGCG GCGATCCCTA CGACGACAAC
GACCACGGTA CGCACGTCTC GGGCACCATC GGCGCCTCGG GCGACAACGG CGTGGGCGTG
GTCGGCGTCA ACTGGGACGT CACCATCATG GGCCTCAAGT TCCTCTCGGC CGCGGGTAGC
GGCTCGACCG AGGGCGCCAT CGCCTGCATC GACTACGCCG TCATGATGAA GAACAGCGGC
GTCGACATCC GCGCGCTCAA CAATAGCTGG GGCGGCGGCG GCTTCAGCCA GGCGCTCGAG
GACGCGATCG CGGCTGCCGA TGCGGCCGAC ATCCTGTTCG TCGCGGCTGC CGGTAACTCC
AACAGCAACA ACGACTCGAC GCCGAGCTAC CCGGCCAGCT ACGAGGTCCC CAACGTCCTG
GCCGTGGCCT CGACCACGCG CACCGACGCC CGCTCCTCGT TCTCGAGCTA CGGCGCCACC
TCGGTCGATC TCGGCGCCCC CGGCTCCGAC ATCCTCTCGA CCATCCCGGG CGGCGGCTAC
GCCAGCTTCA GCGGCACCTC GATGGCGACC CCGCACGTGA CCGGCGCGGC CGCGCTGGCG
CTGGCCGCCT CGCAGGGCTC GCTGTCCACC GCGGAGCTCA AGGACCTGCT GATGAACACC
GGCGATCCCA TCGCCTCGAT GGACGGCGTG ACCGTCAGCG GCAAGCGCCT CAACCTGGCC
AACCTGCTGG CCGAGGCCGA CCCGCAGCCG GGCTTCCGCA TGGAGGTTCC GACCACCCCG
CTCATCATCT CGCAGGGTGA CAGCGTCAGC CTCGGCTTCG CCGTCTCCTC GGTGCTCGAC
TACTCCATCC CGGTCGACAT GACCCTCGAA GCCACGCCGG CCCTCGACGC CGAGGTGGTC
TTCACGCCCA ACCCGGTGCC CGTGGACAGC AGCGGTGAGC TGAGCATCAC CACCTCGACC
GCGACCGCGC CCGGCACCTA CAGCCTGGTC GTCAACGGCA CCAGCGGCGA GCTGCTGCGC
TCGCGCACCC TCCAGCTCAC CGTGTACCCC GAGGGCACCA CGGTCCACGA GTACGAGACC
GTCGATGGCG TGTCGATCCC CGATAACGAT CCGGCCGGCG TCACCAGCAC CCTCTCGGTG
CCCGACTCGC TCGAGATCAT CGACCTGACC GTGGACGTGA ATATCACCCA CACCTACATC
GGCGACCTGA TCGTCGAGCT GACCTCGCCC GCGGGCACCA CGGTTCGCCT CCACGATCGC
GCTGGCGGCG GCACCGACAA CCTGTACCAG TCGTTCGACC CGGCCGATTT CGACGGCGAG
AGCGCGGCTG GCGAGTGGAC CCTGTTCGTG TCCGACAACG CCGGTATCGA CCTCGGTACC
ATCGATAGCT GGGGCCTCAC CATCGTGGGT CTGGGCTTCG AGCCGGGCAT CGGTCTCGAC
CTGGTCTCGG CCGAGCGTCG CGGCTCGCGC GGTGCCGACA TCACCATCAG CTGGAGCGGC
AGCGATGCCG AGTTGATCGA CGTCTACCGC TACGGCGTGC TCATCGACAC GGTGCCCAAC
ACCGACTCGT ACCGCGATCG TTTCCGCTCC AGCGGCTCGG TCTTCATCTA CCAGGTGTGC
GAGGCCGGAA CCGACACCTG CTCGCAAGAG ATGACGGTCA CTCTGTAA
 
Protein sequence
MRIRRANLGY GIWSGVLLGA LAVSSGCTDT GMAVGADEAT FTNILDDKAG PSYDPSTIIV 
KFRDVATATA VNDAMALVKG SFKDENFDGK DDQFAHIAGG NLAAVKLDDK NGADAAIKAL
AGHPAIEYAE RNYLYSIQAV PNDPDFGDLW GLNNTGQGGG TPGAHVSALD AWDLSVGSHD
VIVGVIDTGV AYDHPDLVAN MYINPGEIPG NGIDDDENGF IDDVHGMNAI TGSGDPYDDN
DHGTHVSGTI GASGDNGVGV VGVNWDVTIM GLKFLSAAGS GSTEGAIACI DYAVMMKNSG
VDIRALNNSW GGGGFSQALE DAIAAADAAD ILFVAAAGNS NSNNDSTPSY PASYEVPNVL
AVASTTRTDA RSSFSSYGAT SVDLGAPGSD ILSTIPGGGY ASFSGTSMAT PHVTGAAALA
LAASQGSLST AELKDLLMNT GDPIASMDGV TVSGKRLNLA NLLAEADPQP GFRMEVPTTP
LIISQGDSVS LGFAVSSVLD YSIPVDMTLE ATPALDAEVV FTPNPVPVDS SGELSITTST
ATAPGTYSLV VNGTSGELLR SRTLQLTVYP EGTTVHEYET VDGVSIPDND PAGVTSTLSV
PDSLEIIDLT VDVNITHTYI GDLIVELTSP AGTTVRLHDR AGGGTDNLYQ SFDPADFDGE
SAAGEWTLFV SDNAGIDLGT IDSWGLTIVG LGFEPGIGLD LVSAERRGSR GADITISWSG
SDAELIDVYR YGVLIDTVPN TDSYRDRFRS SGSVFIYQVC EAGTDTCSQE MTVTL