Gene Hoch_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1830 
Symbol 
ID8544212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2521735 
End bp2523132 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content75% 
IMG OID646386536 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003266271 
Protein GI262195062 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.135779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCT TCGGCGCCGC GCTGCTGGCG CTGCTGTGCG GCGCGGCCGG CTGCGCGCTC 
GCGCCCCAGG CGCCGGGCCC GGCCTCGGAC TCGGAGGCGA GCGCGGCCGC GACGGCCACG
AACGCGCGCG CGATCGTCGG CGGCAGCGCC AGCGCGGGCG ATGCCGCCGT GGTCGCCATC
GGCCAGCGCC GCAGCACGTG CGAGGCGCGC TTGGTGGCGA GCTGCTCGGG CTCGCTGATC
GCGCCGCGGC TGGTGCTCAC GGCCGCCCAC TGCGTGCTCG ACCCGCGGCT GTCGAACGCG
CTCGAGGTGT ACTTCGGCGG CGACACTGAG ACCGACACCG CCGCCGACCC GGACGCGGCG
CTGCGCGAGG TCGTGCACGT GGCCGCGCAT CCCGACTACG CGGGCGCGGG CGACGCCGCC
GATCTCGCCG TGCTGGTGCT CGCGCGCGAG GCCCCGGTAG CGCCGCTGCG CGTCGACGCG
GGCGCACTCG ACAGCTCCTG GGAGGGCCGG CGCGTGCGCC TGCTCGGCTT TGGCCAGAGC
TACTCGAGCG ACCCGCGCAC CGGCGTCAAG CGCTCGGGCA CGGCCGTGAT CACGGCCGTC
GGCAGCGTGG ATTTTCGCGT CGAGCCCGAC CCGGCGATGT CGTGCCACGG CGACAGCGGT
GGGCCGGTGC TGGCCGAGCG CGACGGCGTC GAGGTGCTCA TCGGCGTGAG CGCGCGCGGC
GACCCCGGCT GCCAGCTCTA CGGCGACAAC GTGCGCGTCG ACGGCTTCCG CGACAGCTTC
CTGGCCGAGT GGCGCGCGGC CGCCGCCACC TGGCCCGCGC CCGCGGAGGT CAGCGCCGAG
GACCTCCGCG CGCGCGGCGA GGCGCTGTGC ACCGGCGCGT GCGCCAGCGA CGCCGAGTGC
CCGGCCGGCC TGCGCTGCCT GCCCAGCATC GGCGACGGCG TCGGCCCGAG CCTGCGCTGC
ACCCTGCCCG GATTCACGGC CGGCGTGTTC GGCGACGCGT GCAGCTCCGA CGCGGCCTGC
GATGATCGCT GCGCGCGCGT GCGCGGCGAT GACAGCGCCG ACGCCTGCCT GTGTTACCGG
TCCTGCACAG CTCCGCCCAC GGGCGACGCT GGCGGCGGCT GCCGGGTCGC CCCTGGAGCC
GGCCCCGGCG GCTCTGGCGG CTTTGGCGGC TCTGGCAGTC CCGTCCTCTG GCTCAGTGTG
TTGCTGCTCA CTTTGGTTTG CCGTGTGCGT GGCCGGCTTA GGTGTCTCAC ATTGCCGAAT
GCGCTTCGCG TAGGATGGGC CTCTTCGCAC TCAGAGAGAG GATCCAACAC CGATGTATCG
AAGCACGTGG AACTACGTAT CGACCGCAAT GATGAGCGCC ACGCTGGCGC TGGGGCTGGG
CGCTTGCGCC GCTGGTGA
 
Protein sequence
MIRFGAALLA LLCGAAGCAL APQAPGPASD SEASAAATAT NARAIVGGSA SAGDAAVVAI 
GQRRSTCEAR LVASCSGSLI APRLVLTAAH CVLDPRLSNA LEVYFGGDTE TDTAADPDAA
LREVVHVAAH PDYAGAGDAA DLAVLVLARE APVAPLRVDA GALDSSWEGR RVRLLGFGQS
YSSDPRTGVK RSGTAVITAV GSVDFRVEPD PAMSCHGDSG GPVLAERDGV EVLIGVSARG
DPGCQLYGDN VRVDGFRDSF LAEWRAAAAT WPAPAEVSAE DLRARGEALC TGACASDAEC
PAGLRCLPSI GDGVGPSLRC TLPGFTAGVF GDACSSDAAC DDRCARVRGD DSADACLCYR
SCTAPPTGDA GGGCRVAPGA GPGGSGGFGG SGSPVLWLSV LLLTLVCRVR GRLRCLTLPN
ALRVGWASSH SERGSNTDVS KHVELRIDRN DERHAGAGAG RLRRW