Gene Hoch_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2269 
Symbol 
ID8544655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3156773 
End bp3158158 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content69% 
IMG OID646386974 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003266705 
Protein GI262195496 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.778693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0483153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGCAA GGAATCTGTT TCGACTGATG GCGGTCGGCA TATTGGGCGG GTCGTTCGCG 
ACCGGCTGTG CCGCCGTCGA AGAGGACTTC GCGGAGTCGG TGGCGACGAC CACGGCGCCG
CTGCTGGGCG ACGCGTCGAC GGCGATCGAG GGCGAGTACA TCGTCAAGTT CAAGGACGAT
GTAGCCGCCA AGGGCGTGGG CATCGCGATG AACCGGGTGA GCCTCGCCAG CGCGAACAGC
CGCATCATGC GCGAGTACAA GACCTTCCCG GGCTTCGCCG CCAAGCTGGA CAAGCGCGAT
CTGCAGAATC TGCAGAAGAA CCCCGACGTC GAGTACCTCG AGTTCAACGG CCGCATGGAG
GCGCTGAAGA TCGAGAATGT GCAGGCCGAC GGCATCGACC GCATCGACCA GCGCAGCGGC
CGCAACGGGC AGTACAACGA CAGCGATCTG ACGGCCAGCG GTGTGACCGC GTACATCGTG
GACACGGGCA TCCGCTCGAC GCACAACGAG TTCACGGGCC GCGTGGCCGG CACCATCGAC
TTCGTGGGCG ACGGTCAGAC CGAGGACTGC AACGGTCACG GCTCGCACGT GGCCTCGACC
GTGGCCGGCA CCCAGTTCGG CGTGGCCGAC GGCGCGCAGA TCTACGGCGT GCGCGTGCTC
AACTGCTCGG GTTCGGGCTC GTTCGCGGGC GTGATCAGCG GCATCGACTT CGTGGCCCAG
GACTGCTCGG GCGACTGCGT GGCCAACATG AGCCTCGGCG GCGGTTTCTC GCAGGCGGTC
AACGACGCGG TGGAGGCCGC GGTGGCGGCC GGCATCCCGT TCGCGGTGGC CTCGGGTAAC
TCCAACGCCG ACTCGTGCGG CTTCTCGCCG GCGAGCGCGC CCTCGGCGAT CACGGTGGAC
GCGGCGGCCG ACAACGACAG CCGCGCGTCG TTCTCGAACT TCGGTAGCTG CTCGGACCTC
TACGCCCCGG GCGTGTCGAT CCTGGGCGCC GACATCGGCA GCGACAGCGA CACGCAGTCG
ATCTCGGGCA CCTCGATGGC GAGCCCGCAC GTGGCCGGCG TGATGGCCCA GATCCTCGAC
TGCAACCCCG GCGCGACGCC GGCCGAGGTC GAGGCGATCC TGAAGGCGGC GGCGACCTCG
GGCGCGATCA GCAACCCGAA CGGCACCGCC AACCTGCTGC TCTTCACCGG CGCCGAGCTG
TGCGACGAGT TGCCCGACCC GGATCCCGAC CCGGATCCGG ATCCGGAGCC CGATCCCGAC
CCGGAGCCCG ACCCGGCGAG CTGCGAGGGC CGCTGCGGCA GCTTCGATAG CGGCGCCGTC
TGCCAGTGCG ACCGCTTCTG CCAGTTCTTC GGCGACTGCT GCTCGGACTA CTTCAACGAG
TGCTGA
 
Protein sequence
MQARNLFRLM AVGILGGSFA TGCAAVEEDF AESVATTTAP LLGDASTAIE GEYIVKFKDD 
VAAKGVGIAM NRVSLASANS RIMREYKTFP GFAAKLDKRD LQNLQKNPDV EYLEFNGRME
ALKIENVQAD GIDRIDQRSG RNGQYNDSDL TASGVTAYIV DTGIRSTHNE FTGRVAGTID
FVGDGQTEDC NGHGSHVAST VAGTQFGVAD GAQIYGVRVL NCSGSGSFAG VISGIDFVAQ
DCSGDCVANM SLGGGFSQAV NDAVEAAVAA GIPFAVASGN SNADSCGFSP ASAPSAITVD
AAADNDSRAS FSNFGSCSDL YAPGVSILGA DIGSDSDTQS ISGTSMASPH VAGVMAQILD
CNPGATPAEV EAILKAAATS GAISNPNGTA NLLLFTGAEL CDELPDPDPD PDPDPEPDPD
PEPDPASCEG RCGSFDSGAV CQCDRFCQFF GDCCSDYFNE C