Gene Dole_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2815 
Symbol 
ID5695672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3396688 
End bp3398145 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content59% 
IMG OID641265429 
Productprotease Do 
Protein accessionYP_001530695 
Protein GI158522825 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.351815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATT ACGCCACTCT CCCCACGGCA ATCAGCAGGC GGTGCGTGGC GGGCGCCGGA 
ATGGCCCTTG CCCTGCTTAC CCTGCTTTTT ACCATGAACT CGGCGGCCCT GGCCGCGATT
CAAAAACAGC CGGCCCTGAT TCCCGAATCA TTCAGCGAAC TGGCCGACGC TATCGGTCCG
GCCGTGGTCA ACATCCGCAC CGAAACAACA ACGGCCCAGA GCGACCGCCT GTCCCGCCAT
TTTTTTAACT CCCCCTTTGG AGAAAACGAC CCGTTTAACG AGTTCTTCGA GCGGTTTTTC
AATGCCCCCC ACGGCCGGCA GTTCAAGCAA CGCAGCCTGG GCTCGGGGTT TGTCATCGAT
TCACGGGGGC TGATCGTCAC CAACAACCAC GTGGTGGAAA ACGCGGACAA GATTATTGTC
AAGCTCAAAG ACGGTGATGA GTTTGACGCC ACGGTGGTGG GAACCGACGC CAACACCGAT
CTGGCCCTGC TGGAAATCGA GGCAAAGCGG CCCCTTCCCA GCCTGGAGCT GGGCGATTCA
GACGACCTGA AGGTGGGGGA ATGGGTGGTG GCCATCGGCA GCCCCTTCGG CCTGGAACAG
ACCGTCACCG CGGGCATCGT CAGTGCCAAG GGCCGGGTGA TCGGGGCCGG TCCCTATGAC
GACTTCATTC AGACCGACGC CTCCATCAAC CCCGGCAACA GCGGCGGCCC CCTGGTCAAC
CTGGCCGGCG AGGTGGTGGG CATCAACACC GCCATCATTG CCAGCGGCCA GGGCATCGGG
TTTGCCATTC CCGCCAACCT TGCCAACAAC ATTCTTGAGC AGCTGGAGAC CAAAGGCCAC
GTGATTCGCG GCTGGCTGGG CGTGGGCATT CAGCCGGTGA GCAAAGAGAT GGCCGAGTAT
TACAACCTGG AAAGCGGCAA AGGGGCACTG GTCACCGAGG TCTTTCCCGG GGATCCGGCG
GACAAGGCCG GCATCAAGAC CCAGGACATC ATTCTTGAGG TCAACGGTAA AGAAATTAAA
GACAGCCGGG ATCTTTCCGC CATGATCGCC AGCCTGCCCG TGGGAGAAAC CATCAAAGTA
ATGCTGCTGC GGGACGGAAA GAAAAAAACC GTAACCGTCA AGATCAAGGA ACGGGACGAC
ACCCGCGTGG CGGGCAAATC GGAAACCGGC ACCCAGAGCG CCATGGACCT GGAAGTGGCC
GACATCACCG AAGAGGTGGC CAGAAAACTG AACCTGAACA GCACCGAGGG CGTGTACGTG
TCTGAAGTGG CCCCCGGTGG AAAGGGCGAT CAGGCCGGCA TTCAGCCGGG AGATGTGATC
AGAGAGATCA ACCGGCAGCG CATTCAGAAC ACGGCCGACT TTGAGGCCAT CCTGAAAGGT
ATCAAGGAGG GAGACCCCCT GCTGATCCTG ATTCGAAAAT CCAACGGCAT GTTCATGGTG
ATAAAAACCA CCCGGTAG
 
Protein sequence
MKHYATLPTA ISRRCVAGAG MALALLTLLF TMNSAALAAI QKQPALIPES FSELADAIGP 
AVVNIRTETT TAQSDRLSRH FFNSPFGEND PFNEFFERFF NAPHGRQFKQ RSLGSGFVID
SRGLIVTNNH VVENADKIIV KLKDGDEFDA TVVGTDANTD LALLEIEAKR PLPSLELGDS
DDLKVGEWVV AIGSPFGLEQ TVTAGIVSAK GRVIGAGPYD DFIQTDASIN PGNSGGPLVN
LAGEVVGINT AIIASGQGIG FAIPANLANN ILEQLETKGH VIRGWLGVGI QPVSKEMAEY
YNLESGKGAL VTEVFPGDPA DKAGIKTQDI ILEVNGKEIK DSRDLSAMIA SLPVGETIKV
MLLRDGKKKT VTVKIKERDD TRVAGKSETG TQSAMDLEVA DITEEVARKL NLNSTEGVYV
SEVAPGGKGD QAGIQPGDVI REINRQRIQN TADFEAILKG IKEGDPLLIL IRKSNGMFMV
IKTTR