Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2815 |
Symbol | |
ID | 5695672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3396688 |
End bp | 3398145 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641265429 |
Product | protease Do |
Protein accession | YP_001530695 |
Protein GI | 158522825 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.351815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATT ACGCCACTCT CCCCACGGCA ATCAGCAGGC GGTGCGTGGC GGGCGCCGGA ATGGCCCTTG CCCTGCTTAC CCTGCTTTTT ACCATGAACT CGGCGGCCCT GGCCGCGATT CAAAAACAGC CGGCCCTGAT TCCCGAATCA TTCAGCGAAC TGGCCGACGC TATCGGTCCG GCCGTGGTCA ACATCCGCAC CGAAACAACA ACGGCCCAGA GCGACCGCCT GTCCCGCCAT TTTTTTAACT CCCCCTTTGG AGAAAACGAC CCGTTTAACG AGTTCTTCGA GCGGTTTTTC AATGCCCCCC ACGGCCGGCA GTTCAAGCAA CGCAGCCTGG GCTCGGGGTT TGTCATCGAT TCACGGGGGC TGATCGTCAC CAACAACCAC GTGGTGGAAA ACGCGGACAA GATTATTGTC AAGCTCAAAG ACGGTGATGA GTTTGACGCC ACGGTGGTGG GAACCGACGC CAACACCGAT CTGGCCCTGC TGGAAATCGA GGCAAAGCGG CCCCTTCCCA GCCTGGAGCT GGGCGATTCA GACGACCTGA AGGTGGGGGA ATGGGTGGTG GCCATCGGCA GCCCCTTCGG CCTGGAACAG ACCGTCACCG CGGGCATCGT CAGTGCCAAG GGCCGGGTGA TCGGGGCCGG TCCCTATGAC GACTTCATTC AGACCGACGC CTCCATCAAC CCCGGCAACA GCGGCGGCCC CCTGGTCAAC CTGGCCGGCG AGGTGGTGGG CATCAACACC GCCATCATTG CCAGCGGCCA GGGCATCGGG TTTGCCATTC CCGCCAACCT TGCCAACAAC ATTCTTGAGC AGCTGGAGAC CAAAGGCCAC GTGATTCGCG GCTGGCTGGG CGTGGGCATT CAGCCGGTGA GCAAAGAGAT GGCCGAGTAT TACAACCTGG AAAGCGGCAA AGGGGCACTG GTCACCGAGG TCTTTCCCGG GGATCCGGCG GACAAGGCCG GCATCAAGAC CCAGGACATC ATTCTTGAGG TCAACGGTAA AGAAATTAAA GACAGCCGGG ATCTTTCCGC CATGATCGCC AGCCTGCCCG TGGGAGAAAC CATCAAAGTA ATGCTGCTGC GGGACGGAAA GAAAAAAACC GTAACCGTCA AGATCAAGGA ACGGGACGAC ACCCGCGTGG CGGGCAAATC GGAAACCGGC ACCCAGAGCG CCATGGACCT GGAAGTGGCC GACATCACCG AAGAGGTGGC CAGAAAACTG AACCTGAACA GCACCGAGGG CGTGTACGTG TCTGAAGTGG CCCCCGGTGG AAAGGGCGAT CAGGCCGGCA TTCAGCCGGG AGATGTGATC AGAGAGATCA ACCGGCAGCG CATTCAGAAC ACGGCCGACT TTGAGGCCAT CCTGAAAGGT ATCAAGGAGG GAGACCCCCT GCTGATCCTG ATTCGAAAAT CCAACGGCAT GTTCATGGTG ATAAAAACCA CCCGGTAG
|
Protein sequence | MKHYATLPTA ISRRCVAGAG MALALLTLLF TMNSAALAAI QKQPALIPES FSELADAIGP AVVNIRTETT TAQSDRLSRH FFNSPFGEND PFNEFFERFF NAPHGRQFKQ RSLGSGFVID SRGLIVTNNH VVENADKIIV KLKDGDEFDA TVVGTDANTD LALLEIEAKR PLPSLELGDS DDLKVGEWVV AIGSPFGLEQ TVTAGIVSAK GRVIGAGPYD DFIQTDASIN PGNSGGPLVN LAGEVVGINT AIIASGQGIG FAIPANLANN ILEQLETKGH VIRGWLGVGI QPVSKEMAEY YNLESGKGAL VTEVFPGDPA DKAGIKTQDI ILEVNGKEIK DSRDLSAMIA SLPVGETIKV MLLRDGKKKT VTVKIKERDD TRVAGKSETG TQSAMDLEVA DITEEVARKL NLNSTEGVYV SEVAPGGKGD QAGIQPGDVI REINRQRIQN TADFEAILKG IKEGDPLLIL IRKSNGMFMV IKTTR
|
| |