Gene Hoch_5489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5489 
Symbol 
ID8547902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7531591 
End bp7533120 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content69% 
IMG OID646390162 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003269865 
Protein GI262198656 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.62553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.129463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGGA GTCAAAGCAT GAAGCGCGTC CTCTTTGCCA CGCTCACCAG CGCGACCTTG 
TTCGCGTCCT GCGCCTCCGC AGATTTGACG GGAAGTGCCG GCTCCGAGCC CGCCACCATG
ACCCGCGATC AGATCGACGC GGTGATCTAC GAGCACCTGG AGACCCAGGG CGATTTCCAC
TGGAACGACG TCGACGACCG CACGCTGTGG AGCGCGGTGG TGCTCGGCGA CTACCACGTC
TCGGTCGGCT ACCGCGAGGA CGCCGAGGCC TCGGTCAAGG CCGTGGCCAG CGCCGAGGAT
GACGCCGTCG CGCGCCAGGC GACGCGCAAC GAGCTGCTCT CCTTCATCGC CGAGGAGCGC
GCCCAGGACG GCGCCGCGTT CAAGAGCGCG GCCGACCTGG TGCTGTTCGA GGAGGACACC
CTGGCGGTGC TCGACGTCGA GGTCGACAGC TACGAGACGC TCGTGGAGCT GCGCCAGCGC
GACGACGTGC GCTACGTCGA GCCGCTGAAC TACCTCATGG CCGAGGAGCT GCAAAAGTCC
TCCGCCGGCT GCAGCAACGA CGCGGATTTC AACATCCCGG CCGAGGACTT CGACGTGGTC
TCGGGCAACA TCGTGCCCTG GAACTTCGTC GAGATGAACA TCCCCCAGGC CTGGCAGCAG
AGCACGGGCA GCGGCGTCAC CGTGGCGCTC ATCGACTCCG GCACCAGCGC CGCCCAGGCC
AAGCTCAACG GCGCCTTCGC CTCGGGCTTC TCGGCCGATC GCTTCATCGA GCGCGTCGGC
ACCTTCCGCC CCGGCTCCAG CAGCACCCCC GACGGGCCCG ACGACGACTG CGGCCACGGC
ACCTACATGG CCGGCACCAT CGCCGCGCCG ATGAGCGGTG ACGGCTCCAT GGTCGGCGTG
GCCCACGGCG CCAATCTGCT GGCCATCCGC GGCATCGACG ACGTCATCAT CAACAGCTCG
GACGAGAAGA AGGGCGTGGC CGACGCCCTG ATCCTGGCCG CCGAGCGCAG CGACGTCAAG
GTCATCAGCA TGTCGCTCGG ACACGTGTTC TCGAGCAGCC GCGTCGCCGA CGCCGTCCGC
TACGCCCACG CTCGCGGCAA GCTGATCTTC GCCGCCGCCG GGACTTCGAC CTCGTTCACG
AACTGGGTCG GCGTGACCTT CCCCGCCAGC ATGGACGAGA CCGTCGCGGT CACCGGCATC
GAGACCGGCA ACGGCTTCGT GCGCTGCGAC GTGTGCCACG TCGGACGCGC GGTCGAATTC
GTCGTGGTCA TGCAGCGCGC CTCGGACGGC GATCGCACCT CGCTGTCGCT CACCATGAGC
GGCAACACCC CGGGCCGTGT CGGCGGCTCC TCGACCGCGA CCGCGACCAT GGCCGGCGTG
GCCGCGCTGG TGTGGGCGAA GAACCCATCG CTCAGCCGCG ACCAGGTGCT CGACATCCTG
CGCGCCTCGT CCTCGGAGTT CCCCTCGCGC GACAGCAAGT TCGGCTTCGG CACCGTGGAC
GCCGCCCTGG CCGTGTCGCT CGCCAACTGA
 
Protein sequence
MKGSQSMKRV LFATLTSATL FASCASADLT GSAGSEPATM TRDQIDAVIY EHLETQGDFH 
WNDVDDRTLW SAVVLGDYHV SVGYREDAEA SVKAVASAED DAVARQATRN ELLSFIAEER
AQDGAAFKSA ADLVLFEEDT LAVLDVEVDS YETLVELRQR DDVRYVEPLN YLMAEELQKS
SAGCSNDADF NIPAEDFDVV SGNIVPWNFV EMNIPQAWQQ STGSGVTVAL IDSGTSAAQA
KLNGAFASGF SADRFIERVG TFRPGSSSTP DGPDDDCGHG TYMAGTIAAP MSGDGSMVGV
AHGANLLAIR GIDDVIINSS DEKKGVADAL ILAAERSDVK VISMSLGHVF SSSRVADAVR
YAHARGKLIF AAAGTSTSFT NWVGVTFPAS MDETVAVTGI ETGNGFVRCD VCHVGRAVEF
VVVMQRASDG DRTSLSLTMS GNTPGRVGGS STATATMAGV AALVWAKNPS LSRDQVLDIL
RASSSEFPSR DSKFGFGTVD AALAVSLAN