Gene Dbac_0671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_0671 
Symbol 
ID8376324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp745616 
End bp747037 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content62% 
IMG OID644999913 
Productprotease Do 
Protein accessionYP_003157210 
Protein GI256828482 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0470073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATG CACTTCGACT CTTTACCATT ATGTTCGTGT TTCTGGCCTC GGCCTCCATG 
GCTGCGCAGT TGCCGGATTT CACGGAGCTC GCGGAAAAGT CGGGCCAGGC GGTGGTTAAC
ATCAGTACGG TCAAGCTCGT GAAAAATCAG GGCAACATGC AGCAGTTTTT CCCGAGGGGG
CCACAGGGAC AGCACCCGTT CGGAGATTTT TTCGACCAGT TCGAGCGTTT TTTCGGAGAG
CAGGGGCAAG GAACTCCGCG TGAACAGCGT TCTCTGGGAT CGGGTTTCGT CTTCTCCGCC
GACGGGTACA TCGTCACCAA CAATCACGTC ATCGAGGGCG CGGATTCCAT CAAGGTCAAC
CTCCAGGTCG ACAAGAACGG AGACCGTTCC TACGACGCCG AGGTCATCGG GACGGACAAG
GAGACGGATC TGGCGCTGCT GAAGATCAAG GCCGACAAGC CGTTGCCGTA CCTCGCCTTT
GGTGACTCCG ACGTGCTCAA GGTCGGGCAA TGGGTCATGG CCATCGGCAA CCCCTTTGGC
CTTGATCATA CCGTCACGGC CGGAATTGTC AGCGCCAAGG GGCGCACCAT CGGCGCCGGT
CCCTACGACA ACTTCATCCA GACCGACGCC TCCATCAACC CCGGCAACAG CGGTGGTCCG
CTCATCGACC TGGACGGAAA GGTCATCGGC ATCAATACGG CCATCGTTGC TTCGGGTCAG
GGCATCGGTT TTGCCATCCC CAGCGATCTG GCCAGACAGG TCATTGAGCA GCTCAAGGAA
TACAAGAGCG TGAAGCGCGG CTGGCTCGGC GTGTCCATCC AGAATGTGGA CGAGAACTCC
GCCAAGGCCT TGGGCCTTGA CCAGGCCAGC GGCGCCCTGG TCTCGTCCGT GACTGTCGGA
GACCCGGCGG AAAAGGCCGG AATCAAGGCA GGAGACGTCA TTGTCGCGGT GGATGGAGTG
TCGGTGGCCG ACGCCGGCGA TCTGACCCGC AAGATCGGCG ACCTCTTGCC CGGCGTGAAG
ATCACGCTTT CGGTCTGGCG CGAAGGCAAG ACCGTCACGA TCCCTCTGGT TCTGGGTGAG
CGCAGCGCGG AGAAGGTCGC TCAGGGCCGG CCCGGCGCTC CTGGCAGCCA GGGCGAGGAT
GTCCTGGGCC TGAGCGTTCG GCCCGTGGCC GAGGCCGAGG CGAAGGCGCT GGAACTCGAC
CGGGCCCAGG GGCTTCTGGT GGTTGAAGTG AGCGAGGGAT CCCCGGCTGC GCAAAACGAC
TTGAGCGCAG GGGATGTCAT CCTTGAAGCC AACGGCAAGG CCGTGAACAC GGTCAAGGCC
CTCAAGGACG TGATCGAAGG CGATGGCAAG GAAAAGGGCG TCGTCATGCT GCTGGTCAAG
CGCCAGGGTC GCAACGTGTT CCGCACCGTG CCCCTTTCCT AG
 
Protein sequence
MKYALRLFTI MFVFLASASM AAQLPDFTEL AEKSGQAVVN ISTVKLVKNQ GNMQQFFPRG 
PQGQHPFGDF FDQFERFFGE QGQGTPREQR SLGSGFVFSA DGYIVTNNHV IEGADSIKVN
LQVDKNGDRS YDAEVIGTDK ETDLALLKIK ADKPLPYLAF GDSDVLKVGQ WVMAIGNPFG
LDHTVTAGIV SAKGRTIGAG PYDNFIQTDA SINPGNSGGP LIDLDGKVIG INTAIVASGQ
GIGFAIPSDL ARQVIEQLKE YKSVKRGWLG VSIQNVDENS AKALGLDQAS GALVSSVTVG
DPAEKAGIKA GDVIVAVDGV SVADAGDLTR KIGDLLPGVK ITLSVWREGK TVTIPLVLGE
RSAEKVAQGR PGAPGSQGED VLGLSVRPVA EAEAKALELD RAQGLLVVEV SEGSPAAQND
LSAGDVILEA NGKAVNTVKA LKDVIEGDGK EKGVVMLLVK RQGRNVFRTV PLS