Gene Hoch_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2054 
Symbol 
ID8544436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2841254 
End bp2842477 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content68% 
IMG OID646386757 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003266492 
Protein GI262195283 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.390649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.93164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGA CCTTCAGCTC TCTTCTCGCG ACTGGCCTGG TGGGCGGTCT CCTCGGCGGC 
GCGCTCCTGA CCGGTTGCGC CATGGACTCC CTCGGCGATG CGCCCGAGGA CTTCGAGGTG
GGCACCACCA TGGCGCCGCT GCTGGGCGGC GGTCCCTCTG CGCAGGTCAT TCCCGATCAG
TACGTGGTGG TGTTCCACGA CGAGGTCGGC GTGGCCAGCG TCAACAGCGC CATGGACCTG
GTCGAGGAGA GCAAGGAAAA CGAGATTCTG CACACCTACA CGGTGCTCAA CGGTTTCGCC
GGCAAGCTCG ACGACGCCAC CCTGGACAAG CTGCGCCGCG ACTCGCGCGT GGCCTACATC
GAGCGCGACC AGGTGATGAG CATCGACGCC ACGCAGACCG GCGTGCGCCC CGGTCTCGAC
CGAGTCGATC AGCGCAACCT GCCCGGCAAT AACAAGTACA ATGACCGCGG CTTCAACGGC
TCCGGCGTAC ACGTGTACGT CATCGACACC GGCATCCGCT CGCACTCCGA GTTCGGCAAT
CGCCTCAAAG CGGGCGCCAC GGCTATCAAC GACGGCCGCG GCACCGACGA CTGCAACGGC
CACGGCACCC ACGTGGCCAG CACCATCGCG GGCACGGTCA CGGGCGTGGC CAAGAACGCC
CGCGTCCATC CGGTGCGCGT GCTCGACTGC AACGGCTCGG GCTCGACCTC GGGCGTCATC
GCCGGCGTCG ACTTCGTGCG CACCAATGGC GTGCGCCCGG CGGTGGCCAA CATGAGCCTG
GGCGGCGGCG CCTCAAGCGC GCTCGACAGC GCCATCAACA ACCTCTTCGA CTCCGGCGTG
CTGCCGGTGG TGGCCGCTGG CAACGAGAAC CAGGACGCCT GCAACGTGTC GCCGGCGCGC
GCCTCCAAGG CCTTCACCGT GGCCGCGGTC GACAACAACG ACCGCCGCGC CTCGTTCTCG
AACTTCGGCT CCTGCGTGAA CATCCACGCC CCGGGCGTGA GCGTGCGCGG CGCCAGCATC
AGCGGCAACA ACGCCTTCGT CAACCTGTCC GGCACCTCGA TGGCCAGCCC GCACGTGGCC
GGCGTGGCCG CGATGTTCCT CGACCGCAAG AACGGCGCCA CCGCCGGCAA CGTGCGCAAC
AAGCTGCAGA ACATCGCCAC CACCGGCAAG GTGACCAACC GCCAGGGCTC GCCCAACCGC
CTGCTGTTCA ACGACATCCG CTGA
 
Protein sequence
MRKTFSSLLA TGLVGGLLGG ALLTGCAMDS LGDAPEDFEV GTTMAPLLGG GPSAQVIPDQ 
YVVVFHDEVG VASVNSAMDL VEESKENEIL HTYTVLNGFA GKLDDATLDK LRRDSRVAYI
ERDQVMSIDA TQTGVRPGLD RVDQRNLPGN NKYNDRGFNG SGVHVYVIDT GIRSHSEFGN
RLKAGATAIN DGRGTDDCNG HGTHVASTIA GTVTGVAKNA RVHPVRVLDC NGSGSTSGVI
AGVDFVRTNG VRPAVANMSL GGGASSALDS AINNLFDSGV LPVVAAGNEN QDACNVSPAR
ASKAFTVAAV DNNDRRASFS NFGSCVNIHA PGVSVRGASI SGNNAFVNLS GTSMASPHVA
GVAAMFLDRK NGATAGNVRN KLQNIATTGK VTNRQGSPNR LLFNDIR