Gene Hoch_2673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2673 
Symbol 
ID8545060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3683925 
End bp3685064 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content72% 
IMG OID646387368 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003267097 
Protein GI262195888 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.314842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACTGGC GCAGCGAATC GCAATCCCCC TCCCGTCGGC GCCGCGCCCT GCCCTCTGGC 
TGGCTCGCTC CGGCTGTCGC GCTGTGCGCC GCCGCCCTCA TCGCCTGCGG CGGCGCCGGC
GAGCAGCAGG GCGCCAGCGA GGCGGTCCAG GCCGCGCAAG AGCCGCTCAG ACCGGCCGAG
ATCGCCCAGC GCAGCAAGCC GGCCATCGTC CGCGTCGAGG TCCGCTCACC GCGCGGCGAA
GGCGTGGGCA CGGGCTTCAT CCTCGACGCC AGCGGCCGCA TCGCCACCAA CCTGCACGTC
ATCGGCGGTG CCACCGAGAT CGAGGTCGTG CTGCTCGACG GCACCCGTCT GCCGGTGAGC
ACCATCGCGG GCACCGACCC CGAGCGCGAT CTGGCCGTGA TCGAGGTCGA CAGCGAGCGC
GCGCTGCCGA CCCTGCCCCT GGGCAACAGC GATCAGGTGC TGGTCGGCGA CCCGGTGGTC
GCCATCGGCA ACCCGCTGGG CGTCCTCGAC TACACCGTGA GCGACGGTCT CATCAGCTCC
GTGCGCGAGA TCAACCCCGA GCTCAAGGTG CTGCAGATCT CGGCGCCCAT CTCGCAGGGC
TCGAGTGGCG GACCGCTGTT CAACCAGCTC GGCGAGGTCA TCGGCGTGGC CACCTTCATC
GCCGGCGCGG GCCAGAACCT CAATTTCGGC ATCCCCAGCA ACTACCTGCG CCCGCTGCTC
GAGCGCGACG ACCAGCTCAC CCCGCAGGCC CTGTCAGAGG CCCTGGCCGA GAAGTACGCG
CCGCCGCCCG AGCAGCCGCG CGGGCCGGTG CGCCGCCAGG TCCCGGCTCA TCCCCTGAGC
GTGCTCGAGG GCTGCGGCGA GGACGCCATG CAGCGCGCTG TGGACGAGAT CTCCGAGGCC
ATTCAGCTCG GCGCGCCGCT CTACAACCAG GGCAACCACG AGGCCTGCTT CCGCATCTAC
GAGGGCACGG CCATCCGCCT CGAGCGCGAG CTTGCGTGTC CGGGCCTGCG CGATGCCCTG
GGCCAGGGCC TGCTGCGCGC CTCGACCTTG AACGACCACA CCGCCAAAGC CTGGGCCATG
CGCGACGCGT TTGACGGCGT GCTCAGCGTG GTCGCCCGCA AACTCGGCGT CACGCCCTGA
 
Protein sequence
MYWRSESQSP SRRRRALPSG WLAPAVALCA AALIACGGAG EQQGASEAVQ AAQEPLRPAE 
IAQRSKPAIV RVEVRSPRGE GVGTGFILDA SGRIATNLHV IGGATEIEVV LLDGTRLPVS
TIAGTDPERD LAVIEVDSER ALPTLPLGNS DQVLVGDPVV AIGNPLGVLD YTVSDGLISS
VREINPELKV LQISAPISQG SSGGPLFNQL GEVIGVATFI AGAGQNLNFG IPSNYLRPLL
ERDDQLTPQA LSEALAEKYA PPPEQPRGPV RRQVPAHPLS VLEGCGEDAM QRAVDEISEA
IQLGAPLYNQ GNHEACFRIY EGTAIRLERE LACPGLRDAL GQGLLRASTL NDHTAKAWAM
RDAFDGVLSV VARKLGVTP