Gene Dret_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1366 
Symbol 
ID8419195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1593073 
End bp1595316 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content56% 
IMG OID645037942 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_003198232 
Protein GI258405490 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000942242 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.444853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTA AGGAACTTGA GAGAATATTC GCGTCGGCAG TTCACGAAGT AAAAAGGCGT 
CAGCATGAGT TCCTGACGCT CGAACATATA TTGTATGCAT ATGTCCAAGA TACGAACGGG
AAGGCCTTTT TGCACCATTG CGGGGCGAAT CTCAAAGAAT TGAAGACTGA ACTTGAGCAG
TTCTTTGTGA ACCACCTCGA GGTCTATCCG AGTGGACAAT CCAGAGAAGT CATCCAGACC
TTAAGCGTCC AGCGGGTTTT GCAACGCGCG ATCATGCACA TGCAATCCGC GGGCAAGGAA
GCGGTTCAGG TCGGCGATCT TCTGGCCTCG CTTCTCGAGG AAGAGGAGTC CTACGCAGCC
TACTATTTGC GCCATCAGGG CGTCGAGCGA TTGGATGTAC TCGAATACCT CTCCCACGGC
CAGGAACACC AGACTGCTGA GGAAAGCGCC GAGAGCAAGG GCAGTTCAGC CCTGGACCAG
TATGCGGTGG ATCTGATCCA GCGCGCCAGG GACGGCATGA TCGACCCCTT GATCGGTCGC
GAAGGAGAGC TGCAGCGGAC CGTGCAAGTG CTGGCCCGGC GGCGGAAAAA CAACCCCATT
TATGTCGGGG ATCCCGGCGT GGGCAAGACC GCCTTGGCCG AGGGGCTGGC TCGGAACATC
GTGAATAAAG AGGTGCCGGA GTCGTTTCAG GACGCGCGCC TCTTCAGCCT CGATATGGGG
GCCTTGTTGG CCGGCAGTAA ATACCGCGGC GATTTTGAGG CGCGTTTGAA GGCTGTGATC
AAGGAATTGA CCCAGATGGA CAAGGCCATC TTGTTCATCG ATGAGATCCA CACCATTGTT
GGGGCCGGCG CTACCAGCGG CGGCAGCATG GATGCCTCCA ATATCCTCAA GCCGGTCTTG
GCCTCCGGGG AGTTGCGGTG TATTGGATCG ACCACATATG AAGAGTACAA AAACCATTTT
GAAAAAGACC GCGCCCTGTC CAGGCGATTT CAGAAGATTG AGGTCCCGGA GCCGTCCGAA
GCCGACTCGA TCGCGATCCT TCAAGGGTTG CGTCCCTATT ACGAACGGCA CCACGGTATC
CAGTACACGA CCGCAGCCAT CCGGGCCGCC GTGGAACTGA CCTCACGGTA TGTTTCAGAC
AAATATCTTC CGGACAAGGC CATAGATGTT GTCGATGAGG CCGGCGCTTT GTTCCGTCTG
GAGACGGGGA AGAAGCGGCG CAAGCGTGTG ACCCCGAAGG ATATCGAGAA AGTTGTGGCC
AGTGTCGCCA AGGTCCCGAT CAAAAGTGTC AGCCATTCGG ACAAGGAGCG GTTGGCGCAC
CTCGACTCTG AACTCAAAGG GGTCGTCTTT GGTCAAGACA AGGCGGTTTC GGTCATCGGC
CAGTCGATCA AGCGTTCCCG GGCCGGATTG CGGGAAGCTG GAAAGCCGGT GGGCAATTTC
CTGCTGGTCG GCCCCACGGG TGTGGGCAAG ACCGAATTGG CCAAGCAGCT GTCCAACGTG
CTGGGCATTC ACTTTATGCG CTTTGACATG AGTGAATACA TGGAAAAGCA CGCGGTGGCC
CGGCTTATCG GGGCGCCCCC TGGCTATGTG GGCTTTGATC AGGGCGGTTT GTTGACCGAC
GCGGTGCGCA AGCATCCCCA TTGCGTCCTG CTTTTGGACG AGGTCGAAAA AGCGCATCCG
GACCTGTTTA ATATCTTGCT TCAGGTCATG GATTACGCGA CATTGACGGA CAATAACGGC
CGTAAGGCGG ATTTTCGTCA CGTCGTTTTG CTTATGACCT CGAATGCCGG GGCCCGGGAA
ATGAGCAGCA AGAGCATCGG TTTTGGCCAG GAATCGGCCG GCGAAAAGAC CAACAAAGGG
GTCCGGGCTG CAGAACAATT GTTCAGCCCC GAATTCCGTA ACCGCCTGGA TGCCATTGTC
CCCTTTGCTC CGCTGAGTCA GGCGCTGATG GAACAGATCG TGGACAAATT CATTGCCGAA
CTCAATGCCC AACTCGCTGA GAAACGAGTT CATATCCATC TCCAACCCAA GGCGAAAGCG
GTACTGGCCC GCAAAGGATA TGATCCCGAC TATGGCGCTA GGCCGCTTGG GCGCGTTCTC
CAGGAAGAGA TCAAGGACAA GCTTGCTGAC GCCATGCTGT TCGGTAAGCT GCAGGACGGG
GGAGAAGTCG TTGTTGGGAC ACGCAAATCC GGAACAGAGG ATCAACTGAG TTTGCGTTTC
CAGTCGCGCG ATTCCAAGAG CTAA
 
Protein sequence
MLSKELERIF ASAVHEVKRR QHEFLTLEHI LYAYVQDTNG KAFLHHCGAN LKELKTELEQ 
FFVNHLEVYP SGQSREVIQT LSVQRVLQRA IMHMQSAGKE AVQVGDLLAS LLEEEESYAA
YYLRHQGVER LDVLEYLSHG QEHQTAEESA ESKGSSALDQ YAVDLIQRAR DGMIDPLIGR
EGELQRTVQV LARRRKNNPI YVGDPGVGKT ALAEGLARNI VNKEVPESFQ DARLFSLDMG
ALLAGSKYRG DFEARLKAVI KELTQMDKAI LFIDEIHTIV GAGATSGGSM DASNILKPVL
ASGELRCIGS TTYEEYKNHF EKDRALSRRF QKIEVPEPSE ADSIAILQGL RPYYERHHGI
QYTTAAIRAA VELTSRYVSD KYLPDKAIDV VDEAGALFRL ETGKKRRKRV TPKDIEKVVA
SVAKVPIKSV SHSDKERLAH LDSELKGVVF GQDKAVSVIG QSIKRSRAGL REAGKPVGNF
LLVGPTGVGK TELAKQLSNV LGIHFMRFDM SEYMEKHAVA RLIGAPPGYV GFDQGGLLTD
AVRKHPHCVL LLDEVEKAHP DLFNILLQVM DYATLTDNNG RKADFRHVVL LMTSNAGARE
MSSKSIGFGQ ESAGEKTNKG VRAAEQLFSP EFRNRLDAIV PFAPLSQALM EQIVDKFIAE
LNAQLAEKRV HIHLQPKAKA VLARKGYDPD YGARPLGRVL QEEIKDKLAD AMLFGKLQDG
GEVVVGTRKS GTEDQLSLRF QSRDSKS