Gene Daro_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0204 
SymbolhslU 
ID3569899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp219783 
End bp221123 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content57% 
IMG OID637678642 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_283433 
Protein GI71905846 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.337304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TGACGCCGCA GGAAATCGTT TCCGAACTCG ACAAGCACAT CGTCGGCCAG 
AAAAATGCCA AGAAGGCTGT TGCCATTGCC TTGCGTAATC GTTGGCGCCG TTCGCAAGTG
GCCGAGCCCT TGCGCCAGGA AATTACGCCG AAGAATATCC TGATGATCGG CCCGACAGGT
GTCGGCAAGA CCGAGATTGC TCGCCGCCTG GCCCGTCTGG CCAATGCGCC GTTCATCAAG
ATCGAGGCGA CCAAGTTTAC CGAAGTCGGC TACGTCGGTC GTGACGTCGA GACGATCATC
CGCGATTTGG TCGAAATGGC CATCAAATCG CATCGCGAAC GGGCCATGAA GGCAATGCGT
GCTCGGGCCG AAGACGCGGC CGAGGAGCGC ATTCTCGATG TGTTGCTGCC AACGGTGCGC
GGTCCAAACT TTTTTGCCGA GAATTCCGAG TCGACCGCAG CGGAAAACAC GACGCGCCAG
AAATTCCGCA AAAAACTGCG CGAAGGCGAA CTGGACGACA AGGAAGTTGA CATTGAAGTG
GCTGCGCCAA GTCTCCAGGC CGAGATATTT GCCCCGCCGG GGATGGAAGA GCTGACCCAG
CAGATCCAGG GCATGTTTCA AAGCGTCGGC GGCGGCAAGA AAAAGTCACG CAAGCTGTCG
ATCAAGGAGG CATTAAAGCT GCTGACCGAC GAGGAAGCTG CCAAGTTGGT CAATGATGAC
GACGTCAAGC AGGAGGCGGT CAAGGCGGTC GAGCAGAACG GCATCGTTTT TCTCGACGAG
CTGGACAAGA TCGCCAGTCG CTCCGAGATG CACGGCGCCG ATGTTTCGCG CCAAGGCGTG
CAGCGCGACT TGCTGCCGCT GGTCGAAGGG ACGACGGTGT CGACCAAGTA CGGGATGATC
AAGACCGACC ACATCCTGTT TATTGCCTCC GGTGCCTTCC ACCTATCCAA ACCGTCCGAC
CTGATTCCCG AACTGCAGGG CCGCTTTCCG ATTCGGGTCG AGCTGGATTC GCTGTCAGTA
GCCGATTTTG AGTGCATCCT GACGCAGACT GATGCCTGTC TGACGCGTCA ATACCAGGCG
CTGCTCGAGA CGGAGGGAGT CCAGCTTGAG TTTGTTGAGG ACGGTATCCG GCGTCTCGCT
GAAATCGCCT TCCAGGTGAA TGAAAAAACC GAGAACATCG GTGCCCGTCG TCTGCATACG
GTCATGGAAA AGCTGCTTGA AGAGGTGTCC TTCGATGCAG GCAAGGTAGG CCTCGACAAG
GTTCTGATCG ATGCAGCCTA CGTCAATACC AAGCTGGGCG AACTGGCTGC CGACGAAGAC
CTGTCGCGTT ACGTTCTGTA A
 
Protein sequence
MTTMTPQEIV SELDKHIVGQ KNAKKAVAIA LRNRWRRSQV AEPLRQEITP KNILMIGPTG 
VGKTEIARRL ARLANAPFIK IEATKFTEVG YVGRDVETII RDLVEMAIKS HRERAMKAMR
ARAEDAAEER ILDVLLPTVR GPNFFAENSE STAAENTTRQ KFRKKLREGE LDDKEVDIEV
AAPSLQAEIF APPGMEELTQ QIQGMFQSVG GGKKKSRKLS IKEALKLLTD EEAAKLVNDD
DVKQEAVKAV EQNGIVFLDE LDKIASRSEM HGADVSRQGV QRDLLPLVEG TTVSTKYGMI
KTDHILFIAS GAFHLSKPSD LIPELQGRFP IRVELDSLSV ADFECILTQT DACLTRQYQA
LLETEGVQLE FVEDGIRRLA EIAFQVNEKT ENIGARRLHT VMEKLLEEVS FDAGKVGLDK
VLIDAAYVNT KLGELAADED LSRYVL