Gene Hoch_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2268 
Symbol 
ID8544654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3154440 
End bp3155963 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content69% 
IMG OID646386973 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003266704 
Protein GI262195495 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.241267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0271027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGCAC GTAATCTGTT TCGGCTGATG GCGGTCGGCA TACTGGGCGG GTCGTTCGCG 
ACCGGCTGTG CCGCCGTCGA AGAGGACTTC GCGGAGTCGG TGGCGACGAC CACGGCGCCG
CTGCTGGGCG ACGCGTCGAC GGCGATCGAG GGCGAGTACA TCGTCAAGTT CAAGGACGAT
GTAGCCGCCA AGGGCGTGGG CATCGCGATG AACCGGGTGA GCCTCGCCAG CGCGAACAGC
CGCATCATGC GCGAGTACAA GACCTTCCCG GGCTTCGCCG CCAAGCTGGA CAAGCGCGAT
CTGCAGAATC TGCAGAAGAA CCCCGACGTC GAGTACCTCG AGTTCAACGG CCGCATGGAG
GCGCTGAAGA TCGAGAATGT GCAGGCCGAC GGCATCGACC GCATCGACCA GCGCAGCGGC
CGCAACGGGC AGTACAACGA CAGCGATCTG ACGGCCAGCG GTGTGACCGC GTACATCGTG
GACACGGGCA TCCGCTCGAC GCACAACGAG TTCACGGGCC GCGTGGCCGG CACCATCGAC
TTCGTGGGCG ACGGTCAGAC CGAGGACTGC AACGGTCACG GCTCGCACGT GGCCTCGACC
GTGGCCGGCA CCCAGTTCGG CGTGGCCGAC GGCGCGCAGA TCTACGGCGT GCGCGTGCTC
AACTGCTCGG GTTCGGGCTC GTTCGCGGGC GTGATCAGCG GCATCGACTT CGTGGCCCAG
GACTGCTCGG GCGACTGCGT GGCCAACATG AGCCTCGGCG GCGGTTTCTC GCAGGCGGTC
AACGACGCGG TGGAGGCCGC GGTGGCGGCC GGCATCCCGT TCGCGGTGGC CTCGGGTAAC
TCCAACGCCG ACTCGTGCGG CTTCTCGCCG GCGAGCGCGC CCTCGGCGAT CACGGTGGAC
GCGGCGGCCG ACAACGACAG CCGCGCGTCG TTCTCGAACT TCGGTAGCTG CTCGGACCTC
TACGCCCCGG GCGTGTCGAT CCTGGGCGCC GACATCGGCA GCGACAGCGA CACGCAGTCG
ATCTCGGGCA CCTCGATGGC GAGCCCGCAC GTGGCCGGCG TGATGGCCCA GATCCTCGAC
TGCAACCCCG GCGCGACGCC GGCCGAGGTC GAGGCGATCC TGAAGGCGGC GGCGACCTCG
GGCGCGATCA GCAACCCGAA CGGCACCGCC AACCTGCTGC TCTACACCGA TTCAGCCGAG
CTGTGCGGTG GCGCGCCGGA TCCCGATCCC GATCCCGATC CCGACCCGGA GCCCGACCCG
GATCCCGATC CCGACCCGGG GAGCTGCTCG GGTCGCTGCG GCAGCTTCGA TAGCGGCGCC
ATCTGCCAGT GCGACGACGC TTGTGAGTCG TTTGGCGACT GCTGCCCCGA CTTCGAGGAC
GAGTGCCAGG GCGGACCGAC GCCTGGCCCC GACACCTGCT TCGACGCCTG CGGCGTGTTC
GACAGCAGCC GCGAGTGCCA GTGCGACTCG TCCTGCGTGT TCTTCGGTGA CTGCTGCGCC
GACCTCGGCG ACTTCTGCTC ATAG
 
Protein sequence
MQARNLFRLM AVGILGGSFA TGCAAVEEDF AESVATTTAP LLGDASTAIE GEYIVKFKDD 
VAAKGVGIAM NRVSLASANS RIMREYKTFP GFAAKLDKRD LQNLQKNPDV EYLEFNGRME
ALKIENVQAD GIDRIDQRSG RNGQYNDSDL TASGVTAYIV DTGIRSTHNE FTGRVAGTID
FVGDGQTEDC NGHGSHVAST VAGTQFGVAD GAQIYGVRVL NCSGSGSFAG VISGIDFVAQ
DCSGDCVANM SLGGGFSQAV NDAVEAAVAA GIPFAVASGN SNADSCGFSP ASAPSAITVD
AAADNDSRAS FSNFGSCSDL YAPGVSILGA DIGSDSDTQS ISGTSMASPH VAGVMAQILD
CNPGATPAEV EAILKAAATS GAISNPNGTA NLLLYTDSAE LCGGAPDPDP DPDPDPEPDP
DPDPDPGSCS GRCGSFDSGA ICQCDDACES FGDCCPDFED ECQGGPTPGP DTCFDACGVF
DSSRECQCDS SCVFFGDCCA DLGDFCS