Gene Dbac_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1100 
Symbol 
ID8376764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp1208447 
End bp1209787 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content60% 
IMG OID645000333 
Productheat shock protein HslVU, ATPase subunit HslU 
Protein accessionYP_003157620 
Protein GI256828892 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCT TGACCCCACG TGAGATCGTG TCCGAACTGG ACAAATACAT TGTCGGCCAG 
ACCCAGGCCA AGCGCATGGT GGCCATCGCC CTGAGAAACC GCTGGCGTCG TCGTCAGCTC
GATCCCGAGC TGGCCGAGGA GATCGCGCCC AAGAACATTC TCATGATCGG CCCCACGGGC
GTTGGCAAGA CCGAGATCGC CCGCCGCCTG GCGAAACTTG CCGGTTCGCC CTTCATCAAG
GTCGAGGCCA CCAAGTTCAC CGAAGTGGGC TATGTCGGCC GCGACGTGGA ATCCATCATC
CGCGACCTGA TGGAGATCGG CGTCAACCTG GTCCGCCAGG AAGAAGAGGC CAGCGTGCGC
ATCAAGGCCG AGGTTTCGGC CGAAGAGCGC CTGCTGGACC TGCTGCTGCC TACCAAACCC
CTGGAGTCGG CCGGAATCGA TTATATCGGT CCCGAATCCC AGGCCGAAGG CTCCACCCGC
GAGAAGCTTC GGCAGCTGTG GCGGGCGGGC AAGCTCGACG ACCGCATGGT GGAGGTGGAG
GTTGCCACCG GGGGCGGCGT GCAGGTCATG GGCGTTCCGG GCATGGAAGG CATGGAAATG
CAGATGCAGG ACATGTTCTC CAAGGTCTTT CCCAAGAAAA AGAAGACCAA GAAGGTGGCG
GTGAAGAGCG CCTACGATAT CCTCATCCAG TCCGAATGCG AGCGCCTCAT CGATATGGAC
AAGGTGCACG AGACCGCCCG CGAAAGGGTG CAGGAATCAG GCATTGTCTT TCTGGACGAG
ATCGACAAGA TCTGCGGCGC GAACAGCTCC GGCAAGGCCG ACGTATCGCG CGAGGGCGTG
CAGCGCGACC TCCTGCCCAT CGTCGAAGGC AGCACCGTCA ACACCAAATA CGGCATGGTC
CGCAGCGATC ATATCCTCTT CATTGCCGCC GGGGCGTTCC ACATGTCCAA GCCCTCGGAC
CTGGTGCCCG AGTTGCAGGG ACGCTTCCCC TTGCGCGTGG AGCTTTCGGC CCTGACCAAG
GAGGATTTTT ACCGCATCCT GACCGAACCC AAAAACGCCC TGACCGTGCA GTACAAGGCG
CTTCTTGGCA CCGAAAAGGT CGAGATCACC TACACCGACG AGGCGCTGCT CGAAATCGCC
CGCTTCGCCC AGAAGATCAA CGAAGAGACC GAGAACATCG GCGCGCGCAG GCTCTATACC
ATCATGGAGA AAATCGTTTC TGACCTGTCC TTCGACGCTC CGGACATGGA ACAGGCCACT
GTGACCATCG ACAAGGACTA TGTGGCCAAG GCCCTGCTAG ATGTGCAGGA AGACCGGGAT
CTTACGCGCT ACATCCTGTA G
 
Protein sequence
MNTLTPREIV SELDKYIVGQ TQAKRMVAIA LRNRWRRRQL DPELAEEIAP KNILMIGPTG 
VGKTEIARRL AKLAGSPFIK VEATKFTEVG YVGRDVESII RDLMEIGVNL VRQEEEASVR
IKAEVSAEER LLDLLLPTKP LESAGIDYIG PESQAEGSTR EKLRQLWRAG KLDDRMVEVE
VATGGGVQVM GVPGMEGMEM QMQDMFSKVF PKKKKTKKVA VKSAYDILIQ SECERLIDMD
KVHETARERV QESGIVFLDE IDKICGANSS GKADVSREGV QRDLLPIVEG STVNTKYGMV
RSDHILFIAA GAFHMSKPSD LVPELQGRFP LRVELSALTK EDFYRILTEP KNALTVQYKA
LLGTEKVEIT YTDEALLEIA RFAQKINEET ENIGARRLYT IMEKIVSDLS FDAPDMEQAT
VTIDKDYVAK ALLDVQEDRD LTRYIL