Gene Hoch_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2015 
Symbol 
ID8544397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2779743 
End bp2780957 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content69% 
IMG OID646386718 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003266453 
Protein GI262195244 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.600934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCTC GGAAGTTCGT GTTCCTCCTC GCCGCCAGCA CCCTTGGCGG CTCTCTGCTC 
ACCGGCTGCG CCACCGACAT GGAGGTGGAC GCCGGTAGCG ACCTCGACAC GGTGGCGACC
ACGGTCGCGC CGCTGCTCGG TGGTGAAGGG TATGCCGCGC GCATCCCCGG GCAGTATCTG
GTCATGTTCC ACGACGGCAT CGCGACGACC AGCGTCGACG CGGCCCTGGA CATGGTGGAA
GCCAGCCCGG CCAACGAAGT GCTGTTCACC TACTCGGTGA TCAACGGCTT CGCCGCCAAG
CTCGACGACA AGTCCCTGGA CGCGCTCCGT CGCAATCCGT CCGTGGCCTA TATCGAGTAC
GATCAGGTGG CCACCATCAA CGCCGTGCAG AGCGGCGCGC GCCCCGGTCT CGACCGCATC
GATCAGCGCA ACCGGCCGCA CAACGGCAGC TATGACGATC GCGGCTTCAA CGGCACCGGC
ACGCACATCT ACGTGATCGA CACCGGCATC CGCGCGCACT CCGAGTTCAG CGGCCGCCTG
GGCGCCGGCG CGACCGCCAT CAACGACGGC CGCGGCACCG ACGACTGCAA CGGCCACGGC
ACCCACGTCG CCAGCAGCGC CGCCGGCACC CTCACCGGCG TAGCCAAGAA CGCCACCCTG
CACGCCGTGC GCGTGCTCGA CTGCAACGGC TCGGGTTCGA ACTCGGGCGT CATCGCCGGC
ATCGACTTCG TGCGCACCAA CGGCGTGCGT CCGGCCGTGG CCAACATGAG CCTGGGCGGC
GGCGCCTCCT CGGCCGTGGA CACCGCCATC CGCAACCTGT TCAACAGCGG CGTGCTGCCG
GTGGTGGCCG CGGGCAACGA GAACCAGAAC GCCTGCAACG TCTCGCCCGC GCGCGCCCCC
GAGGCGCTCA CCGTGGCCGC GGTGGACGAC AACGACCGCC GCGCCTCGTT CTCGAACTTC
GGTAGCTGCG TGGACATCTT CGCCCCGGGC GTGAACGTGC GCGGCGCCAG CATCAACGGC
TCGAACTCGT TCGTCAACCT GTCCGGTACC TCGATGGCCA GCCCGCATGC CGCCGGTGTG
GCCGCCATGG TGCTCGACAA GAACACGGGC GCCTCGGCCA GCTCGGTGAC CAGCAGCATC
ATCTCGGCCG CGACCACGGG CGTGGTCAGC AACCGCAGCA GCGCGCCCAA CCGGCTGCTG
TTCAACGGTA TCTGA
 
Protein sequence
MFPRKFVFLL AASTLGGSLL TGCATDMEVD AGSDLDTVAT TVAPLLGGEG YAARIPGQYL 
VMFHDGIATT SVDAALDMVE ASPANEVLFT YSVINGFAAK LDDKSLDALR RNPSVAYIEY
DQVATINAVQ SGARPGLDRI DQRNRPHNGS YDDRGFNGTG THIYVIDTGI RAHSEFSGRL
GAGATAINDG RGTDDCNGHG THVASSAAGT LTGVAKNATL HAVRVLDCNG SGSNSGVIAG
IDFVRTNGVR PAVANMSLGG GASSAVDTAI RNLFNSGVLP VVAAGNENQN ACNVSPARAP
EALTVAAVDD NDRRASFSNF GSCVDIFAPG VNVRGASING SNSFVNLSGT SMASPHAAGV
AAMVLDKNTG ASASSVTSSI ISAATTGVVS NRSSAPNRLL FNGI