Gene Hoch_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2267 
Symbol 
ID8544653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3152046 
End bp3153590 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content68% 
IMG OID646386972 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003266703 
Protein GI262195494 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0802197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0539974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGCAC GCAACGTGTT CCGCTTTATG GCTGCTGGTC TGATCAGCAG CGCGCTCGCC 
ACTGGCTGCG CCACGGTCGA AGAAGATTTC ATTGAGGACG TCGGCTCCAC CAGCGCCCCG
CTGCTGGGCA TGGACAGCCC CGACATCATC CCCGGCCAAT ACATCGTGTC GTTCAAGAGC
GACATCGGTG CCCAGAACGT GAACGCGGCC ATGAACCGCG TCTCGCTCAA GAGCAAGAGC
AGCCGCATCC AGCACACCTA CGCCGGCGCC TTCAACGGCT TCGCTGCCAA GCTGAGCGAC
GCGGACCTGC AGGACATCCT CAAGAACGAC AGCGTCGCCT TCGTCGAGGC TGACCAGATG
ATGTACGCCA CGGCGACCAA GCCGAGCTCG GGTCAGCTCG AGCTCGACCG TCACCACGCC
TGCCCGGCTG TCGATGACGG CGTGTTCGAC GATCACGGCT GCGACGGCTC CGGCGTGCGC
GTGTACATCG TGGACACCGG CATCCGCGGC TCGCACAACG AGTTCACCGG CCGCATGGCC
ACGGGCTTCG ACGCCATCAA CGACGGCAAC GGCACCAACG ACTGCCAGGG TCACGGCACC
CACGTCGCCT CGACCGCGGC CGGCAACCAG TTCGGTATGG CCAACCGCGC CACCCTGGTT
CCGGTGCGCG TGCTGAGCTG CTCGGGTTCG GGCTCGAACT CGGGCGTCAT CGCCGGCGTC
AACTTCGTCG CCAGCGACTG CCAGGGCCGC CGCTGCGTGG CCAACATGAG CCTCGGCGGC
GGTGCCTCGA GCGCTCTCGA CAACGCCGTC ACCAGCGCGG TCAACGCCGG CATCGCCTTC
GCGGTCGCCG CCGGTAACGA CAACAGCAAC GCCTCGGGCT TCTCGCCGGC TCGCGCCGCC
GCGGCCATCA CCGTCGGCGC CGCTTCGGAC TCGGGTTACC CGGCCAGCAC CAACTCGAAC
AGCGTCACCC GCGCCTCGTT CTCGAACTTC GGCAGCGTGG TGGACATCTG GGCCTCGGGT
CTGTCGATCC TCGGCGCCAA CATCAACAGC AACAGCTCGA CCCAGACCAT CTCGGGCACC
TCGATGGCCA GCCCGCACGT GGCCGGCGCC ATCGCCCAGA TGCTGGGCTG CCTCGGCAAC
ATGACTCCGG CCCAGGTCGA GGCGCAGCTC AACGCCAAGT CGATCACCGG CGCGATGTCC
AACGAGCAGG GCGCTGAGGA CCGCTTCCTG TGCAGCGACT TCAACTCGGC CAACGACGCT
GGCGACTGCG ATTGCGGCGG TGGCAGCGAG CCGCCCCCGG CCGACTCCTG CGAGGGCCGC
TGCGGCACCT TCGACTCGGG CGCTTCCTGC CAGTGTGACG ACCAGTGCGC GGACTTCGGT
GACTGCTGCC CGGACAAGGC CGCTGAGTGC GACGCTCCGC AGCCCGGCCC CGACACCTGC
TTCCAGGCTT GCGGCGTGTT CAACAGCAGC CGTCAGTGCC AGTGCGACTC GGCTTGCTCG
AACTTCGGCG ACTGCTGCCC CGACCTCGGC CAGTTCTGCA ACTGA
 
Protein sequence
MQARNVFRFM AAGLISSALA TGCATVEEDF IEDVGSTSAP LLGMDSPDII PGQYIVSFKS 
DIGAQNVNAA MNRVSLKSKS SRIQHTYAGA FNGFAAKLSD ADLQDILKND SVAFVEADQM
MYATATKPSS GQLELDRHHA CPAVDDGVFD DHGCDGSGVR VYIVDTGIRG SHNEFTGRMA
TGFDAINDGN GTNDCQGHGT HVASTAAGNQ FGMANRATLV PVRVLSCSGS GSNSGVIAGV
NFVASDCQGR RCVANMSLGG GASSALDNAV TSAVNAGIAF AVAAGNDNSN ASGFSPARAA
AAITVGAASD SGYPASTNSN SVTRASFSNF GSVVDIWASG LSILGANINS NSSTQTISGT
SMASPHVAGA IAQMLGCLGN MTPAQVEAQL NAKSITGAMS NEQGAEDRFL CSDFNSANDA
GDCDCGGGSE PPPADSCEGR CGTFDSGASC QCDDQCADFG DCCPDKAAEC DAPQPGPDTC
FQACGVFNSS RQCQCDSACS NFGDCCPDLG QFCN