Gene Dole_2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2481 
Symbol 
ID5695331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3008228 
End bp3010483 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content61% 
IMG OID641265089 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001530362 
Protein GI158522492 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCA AGGAACTCAG CGCCACCCTC GGCTTTGCCG TGAGCGAGGC AAAAAAACAC 
CGTCATGAAT TCGTGTGCCT GGAGCATATC CTGTATGCGA TTCTGCACGA GGCCACCGGC
GTCGAGATCA TCTACGGGTG CGGCGGCAAT GTGGATGCGT TGAAGGAGGC GCTTCAGGAT
TTCTTCCGGC ACAAGATCGA CCGGGTGTCC GAGGAAGATG AATACGTGCT GCAGCAGACC
ATCGGGTTTC AGCGGGTCAT TCAACGGGCC GTGAACCACG CGCGGTCGGC GGAAAAGGAA
AAGGTGGCCG TGGGCGATAT TCTGGCGGCC CTTTTTGAGG AGAAGTCCTC CCACGCGGTC
TATTTTCTGG AGTCCGAAGG GGTCACCCGG CTGGAGGTGC TGAAGTATAT CTCCCATGGA
ACCGATGACG GGACGCTTTC GCCGGATGTG GAAGAGGGCA CAAGGACCGG CCGGCCGGAA
AAGAAAAAGG TCACCGACCC CCTGGAGGTA TTCACCGTTG AACTGGTCCG GCGGGCCGCC
GAAGGCAAGC TTGATCCGGT GATCGGCCGG GAGATGGAGC TGCGGCGGGC CATACAGGTG
CTCTGCCGGC GGCGCAAGAA TAACCCGGTG TTTGTGGGAG AGCCGGGCGT GGGCAAAACC
GCCATTGCCG AGGGGCTGGC CCAGAAGATT TCCGATGGGG ATGTGCCGGA TATGCTGGCC
GACACCCGGA TTTACTCGCT GGACCTGGGA TCCCTGCTGG CCGGCACCAA GTTCCGGGGC
GATTTTGAGC AGCGGCTGAA GAAGGTGGTG CTGGCCCTTC AGAAACAGCC CGGCGCGATT
CTGTTCATCG ACGAGATTCA TACCATCGTG GGCGCCGGCG CCACCAGCAG CGGGTCCATG
GATGCCTCCA ACATTCTCAA GCCGGTGCTG GCCACCGGTG ACATTCGCTG CATCGGTTCC
ACCACCTACG AGGAGTACAC CAACCATTTT GCCAAGGACC GTGCCCTGTC CCGCCGGTTT
GAAAAGATCG AGATCGGCGA ACCGTCGGTG CCCGCCTGCG TAAAGATTCT TCGGGGGTTA
AAGTCCCGTT ACGAGGAGCA CCATCATATC ACCTTCACCG ATGCCGCCAT TAAGAGCGCG
GCGGACCTTT CCGCCCGGTA CCTGAATGAC CGGTACCTGC CGGACAAGGC CATTGACGTG
ATTGACGAAG CCGGGGCCGC CATTCGCCTC TCCGGCGGGG CTCACCGGAC CAAAGTCCAT
TCCACGGACA TTGAAAAGAT CGTGTCCGAC ATGGCCCGTG TGCCGGTGCG CAGCGTGTCG
AAAAATGACC GGCAGCGGCT GGAGGGCCTG GAACGGGAGC TGAAACAGGT GGTTTTCGGC
CAGGACGAGG CCATCGCGTT TCTGACCACC GCCATCAAGC GGAGCCGGGC CGGGCTGGGC
AAGCCGGAAA AGCCCATCGG TTCTTTTCTG TTTACCGGTC CCACAGGCGT GGGCAAGACC
GAAATCGCCC GGCAGATGGC GGCGATTCTG GGCGTGGCGT TTATTCGGTT TGACATGAGC
GAATACATGG AAAAACACGC CGTGGCCCGG CTGATCGGCG CGCCCCCCGG CTATGTGGGG
TTTGACCAGG CCGGCCTGCT CACCGACCGG ATTCGCAAGC ATCCTTACAG CGTGCTCCTG
CTTGATGAAA TCGAGAAGGC CCACGCCGAT GTTTACAATA TTTTGCTTCA GGTGATGGAT
CATGCCACCC TTACCGACAA CAACGGCAAG GAGGCCGATT TTAGAAACGT GATCCTGATC
ATGACCTCCA ACGTGGGGTC CCGGGAGATC AGCAGCCAGG CCATCGGCTT TTCCGGCGAC
ACCGGCAGTC CGGCGGGCCG GGGGAAAAAG GCGGTGGAAA ATTTCTTCTC ACCGGAGTTT
CGCAATCGGC TGGACGGCGT CATCGGGTTC AACCGGCTGG CGCCGGCGAT CATGGAAAAA
GTGGTGGACA AGTTTGTCGG CCAGCTGGCG GCCCAGCTGG AAGCGCGCCG GATCACCCTT
GCCATGGATG CCGATGCCCG GCAGTGGCTG GCCCGGCACG GGCATGACCC GGCATATGGC
GCGCGTCCCC TGGAGCGGCT CATTCAGGCG GCGATCATGG ACGTGCTGGC CGACGAGATC
CTGTTCGGCC GCCTTGAAAA GGGAGGGGCC GTGCGTGTGG GGGTTGCGGG CGAAGAACTC
TCTTTTGCCT ATGATTCGCC GGATTCCCTT CACTGA
 
Protein sequence
MISKELSATL GFAVSEAKKH RHEFVCLEHI LYAILHEATG VEIIYGCGGN VDALKEALQD 
FFRHKIDRVS EEDEYVLQQT IGFQRVIQRA VNHARSAEKE KVAVGDILAA LFEEKSSHAV
YFLESEGVTR LEVLKYISHG TDDGTLSPDV EEGTRTGRPE KKKVTDPLEV FTVELVRRAA
EGKLDPVIGR EMELRRAIQV LCRRRKNNPV FVGEPGVGKT AIAEGLAQKI SDGDVPDMLA
DTRIYSLDLG SLLAGTKFRG DFEQRLKKVV LALQKQPGAI LFIDEIHTIV GAGATSSGSM
DASNILKPVL ATGDIRCIGS TTYEEYTNHF AKDRALSRRF EKIEIGEPSV PACVKILRGL
KSRYEEHHHI TFTDAAIKSA ADLSARYLND RYLPDKAIDV IDEAGAAIRL SGGAHRTKVH
STDIEKIVSD MARVPVRSVS KNDRQRLEGL ERELKQVVFG QDEAIAFLTT AIKRSRAGLG
KPEKPIGSFL FTGPTGVGKT EIARQMAAIL GVAFIRFDMS EYMEKHAVAR LIGAPPGYVG
FDQAGLLTDR IRKHPYSVLL LDEIEKAHAD VYNILLQVMD HATLTDNNGK EADFRNVILI
MTSNVGSREI SSQAIGFSGD TGSPAGRGKK AVENFFSPEF RNRLDGVIGF NRLAPAIMEK
VVDKFVGQLA AQLEARRITL AMDADARQWL ARHGHDPAYG ARPLERLIQA AIMDVLADEI
LFGRLEKGGA VRVGVAGEEL SFAYDSPDSL H