Gene Hoch_0055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0055 
Symbol 
ID8542425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp86097 
End bp87347 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID646384843 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003264590 
Protein GI262193381 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.16054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTC GCTCCGCTTC TTCTCCCGTC GCTCCGTCCA CCTCGCTGGC GCTGGCCGCC 
CTGCTGGCGC TGGCCGCAGG CAGCTCCGCC TGCGCCGAGG GCTCGATCTT CGGCGCCGAC
AAGTTCTGCG GCGACACGCG CACGGCGCCC AAGGTCTACT ACGGCACCCT GCTGCCCTCG
CTGGTGCCGC TCACGCCCGA GCAGTTGCTC TCGGTGGGCA GCTTCCGGCT GTGCAGCGGC
ACGCTGATCG CGCCCACCTG GGTGCTCACG GCCCAGCACT GCGGCCTCGC GGTCGGCGCC
AACTTCTGCA TGGGCGACTC GCCCAGTGAC CCGGACGTGT GCATCCGCTC CTCGGCCGTG
TACAACCATC CCGACGCCGA CATCACCCTG GTCGAGCTGA GCGCAGACGC GCGCGGCGCG
GTGCCGGGTG TGGTGCCGCT GCCGTACATG GACGAGTCCC TGGGCCAGGA GTGGATCGGC
CGCCGCGCCG AAGCTGCCGG CTACGGCCAG ACCGAGACCG GCGCGCTGGG CACGCGCTAC
TTCACGGCCG AGCCCATCGT CGCGCTCGAC AGCGGCTTCG CGACCATCGA CGGCGAGGGT
CAGCGCGGCG TGTGTTTTGG CGACTCGGGC GGACCGCTGC TGATCATCGC CGACGATGGC
ACCGTGCGCG TCGCCGGCGT GCTCAGCAAC GGCGACGACA CCTGCGTGGG CCGCGACAAC
TTCACCCGCG CCGACAGCTA CCGCGACTGG ATCGCCAAGT ACGCGGGCGA GCCGCAGCTC
GGCGAGGGCG ATCAGAGCTG CGTCCAGCTC GGCCGCGTCG GCCGCTGCGA GACCGAGCGC
GCGGTGTGGT GCGGCAACGA GCGGGTCCAG ACCGAGGTCT GCGCGGCCGG TACCGCGTGC
GGCTGGGACG CGGGCGACGG CGGCTTCCGC TGCATCGCGG GCGAGGATCC CTGCGGCGGG
GTGGACGCGG TCGGCACCTG CGACGGCGAG ATCGCCCGCT GGTGCGACAA CGGCGTGCCC
CAGGCCCGCG ACTGCGGGGT GTGCGGCGAG ATCTGCCTGC CCTCGGTCAA CGGCGTGGGC
GCCTACTGCG TGCCCGATAA CTGCAACGGG CTCGACTATC TCGGCCAGTG CGAGGGCGAC
GTGGTGGTGT GGTGCGACGC CGGTCAGCGG CTCGAACGCG ACTGCGCCGC GCTCGGCCAG
GTGTGCAAGC TCATCGACGA GCAGATCGGT TTTTACTGCG CCGATCCTTG A
 
Protein sequence
MPPRSASSPV APSTSLALAA LLALAAGSSA CAEGSIFGAD KFCGDTRTAP KVYYGTLLPS 
LVPLTPEQLL SVGSFRLCSG TLIAPTWVLT AQHCGLAVGA NFCMGDSPSD PDVCIRSSAV
YNHPDADITL VELSADARGA VPGVVPLPYM DESLGQEWIG RRAEAAGYGQ TETGALGTRY
FTAEPIVALD SGFATIDGEG QRGVCFGDSG GPLLIIADDG TVRVAGVLSN GDDTCVGRDN
FTRADSYRDW IAKYAGEPQL GEGDQSCVQL GRVGRCETER AVWCGNERVQ TEVCAAGTAC
GWDAGDGGFR CIAGEDPCGG VDAVGTCDGE IARWCDNGVP QARDCGVCGE ICLPSVNGVG
AYCVPDNCNG LDYLGQCEGD VVVWCDAGQR LERDCAALGQ VCKLIDEQIG FYCADP