Gene Dret_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1543 
Symbol 
ID8419372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1786758 
End bp1789697 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content45% 
IMG OID645038117 
Producttype III restriction protein res subunit 
Protein accessionYP_003198407 
Protein GI258405665 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACG ACTCAGGCTC ACATCCTATC TATTACCATA AGCTTGTCCG GGACAAGATT 
CCTTCCATCA TTCGGGATAA CGATCACCAA CCTACTGTCT TAAGTCTTTC GGGCCAGGAT
TTGACCCAGG CAGCAAGTCA AAAGCTTTTG GAAGAGGCTT ATGAACTGTT CACAGAGGTT
CAGGTGGGAG AGAAGCCCTC TGTGCTTAAG GAGTCAGCCG ATGTCCTGGA AGTTGTATTA
ACCATCCTGA AGCAGTTGGG CTATAGCTTT GATGACCTGA TTTCTGAAAT GGAACTACGA
CGGGAGCAAA GAGGCGGTTT TGAGCAGGGC CTTTTTCTGG AAAGCGTTGA CGGGCAGTTT
CTCAGTCACA GATTACAGCA GAGCCCGAGT TATGTATGTT CTCACTTGGA AATAGACTCC
TTATTGGATC TTTTTCGAGG AGAAATGGAA CGTAGTGACA AGGTTTGGAT AGCATCGGCC
TTTTACTCCC CGGCAATAAC CAATATTTTG ATCAGTGAAT TTGAACGCTT TATCTGCAAA
GGAGGAGAGG CCAGAGTCAT CTTGTCTACA ATGAACGGTT TTATTAAGCC CGAGTATCTG
ACCCATCTGC GTGATCATGT GCCTTACTTG AATGTTAAGG TATTCCATCC CCCTGATATC
CCTTTTCATA TCCAGCCCAA GCGAGATTTT CATGTCAAAG CCTATATATT CAAGCACAGA
ACTGGCAAAG GCTCGGCTAT CATTGGATCT TCCAACTTAT CTCAGGGTGG GTTTTCCGAT
AACATTGAAT GGAACTATTA CTCAGCCGGG GAGATCAACC TTCCTTTTGA AAATAAGCAG
ACTCCATGGG AAAAGATAGT TCATGAGTTT GAATCCCTGT GGGCCAATGA GTGTGTGCAT
GTTACCGATG ATTTTTTGGC CGGTTACAGG AAGCGACACA GGGATGTTTT TCAGGAAAGA
GAGAAGCCGT CTGATGCGTA TGGAAGCGAG GAGCAAAGCC CAGTTCAGCC GGGCATTGGG
GCAGACAGCG CTGCTGCATA CGGGAAAAGA AAGCAATCCA AAAGTGAATC AGAATCAGTG
AGCCCCAACA TCGCCCAAGG TGAGGCTCTG GAAGGGCTTT TGAAACTCAG AAACAGAAAA
GCCAGGTCCG GAGCGGTTAT TGCAGCCACC GGGGTGGGCA AGACCTATTT GGCTGCATTT
GACTTCATTC AAAGCGGCAA AGACAAATGC CTTTTTATCG CCCACAGAGA AGACATCCTC
AGAAAAGCCA AGGAAAGTTT TTCCCATGTC CTTGGCCCAG AGGGGTTGGA GATCTTCAGC
GGCAGGAGCA AGGAGATCTC ACATGGGTCA AGGGCTGTTT TTGCTATGAT CCAGACCTTG
GGACGGCAGG ACAATATGGA ACGTTTTCAT CCCGAGGAGT TTGACTACAT TGTCATGGAT
GAGTTCCACC ATGCCATGGC TGCTACCTAT CGCAGGGTCT TGGATTATTT TCAGCCCGAT
TTCCTGTTAG GTCTCACAGC TACTCCTGAG CGCATGGATG GACGGGATGT CCTCTGGCTA
TGCGATTACA ATATTGCATA TGAAATGAGG CTTTTCCAGG CCATTGACAA AGAATTGCTG
GCCCCTTTTC AGTACTTCGC TGTCCATGAT CCGACTGATT ATGCCCAGAT TTCCTGGAAA
CGAACAGATT ATGATCAAGA AGAACTGACC AAGGCCCTGG CAAATGACAC CAGAACAACC
ATTATTGCCA ATAACTTGAA AAAATTTCTC CCGTACCAGG GTAAGATAAA GGCCTTGGCA
TTTTGTAGCT CAGTTGACCA TGCCCAGTAT ACAGCTGCGC GTTTGACCCA GGAGCATGAT
TTTGAGGCCA TGGCTTTGGT AGGCGATTCT TCGCAGGATC AAAGAGAAGA AGCCGTGGCC
AGGCTTGAGG AGGAAAATGA TCCTCTCAAG CTTATATGTT GCGTGGATAT CTTCAATGAG
GGCATCGATA TCCCTAAGCT CAGCCATGTA CTGCTCCTTA GGCCTACCCA GTCTTTTACA
GTCTTTCTCC AGCAACTTGG CCGAGGGCTT CGAAAGATAC AGAGCGAAGA TGTGGAAAAG
CATCTTGTGG TCATAGATTT TGTCGGTAAT TTTCGAACTG CACATGTAGC GCCCTTGGCC
TTGGCCGGCT ATACCTCAAT TCAAGAATTT ACCCAGGATA GTGAAGGTTC AAAGGAAAGC
AAACTTGATT TAAGCAATCC GCCTAAGGGA TGTTTTGTTT CACCTGATCT GGAAGTGCAA
AGAATATGGG AGAGTAAGCT GAGGGAGATT GCTCCCATGT CCAGAGCAGA GCAGTTACGA
GCCCTTTATG ACGAGGTGGT ACAGGATTTA GGGCTTATTT CTCCAGGACT TTGTGAATTT
TATGCCGATC CCCAAAAAGC AGACCCCCAT GCATTCATTA AGTATTTTGG AAGCTGGATT
AAGACCAAAA AAGCATTTAA AGACCTGTTG GATTTTGAAC AGAACTTATT AGGAACTCAA
GGCGAGTCGT TTCTGGAGTA TCTGGAAAAA GATTTGAACC CGGTCAAGTC TTATAAGATG
GTTGTTCTCA AAACATTGCT CTCATTTGAG GGAGTCTCAT GGAATGTCTC TGAAATTGCT
CAAGGGTTTT TAAGTTATTA CCTTAATCAC CCTGAATATC TATCAGATTA TGATGATTTG
GATAGGAAAG AGAACCCAGA GGAGATTTCT CTGCAAAGAG TAGAAAGGCA TATCATGAAT
ATGCCTTTGA AGTACTTGAG CAATAAAGGC AAAGACTGGT TTGTATTAGA TAGAGAAAAA
AAAGTCTTTT CAGTAAAAAA TGATCTTATT GATTATTGGA GTGCAGCATT CTATAAGCAA
CTTATGCTCG ATAGAGCTGA TTATGCATTA GCCAGATATT TTTATCGGAA AACACAATAG
 
Protein sequence
MPNDSGSHPI YYHKLVRDKI PSIIRDNDHQ PTVLSLSGQD LTQAASQKLL EEAYELFTEV 
QVGEKPSVLK ESADVLEVVL TILKQLGYSF DDLISEMELR REQRGGFEQG LFLESVDGQF
LSHRLQQSPS YVCSHLEIDS LLDLFRGEME RSDKVWIASA FYSPAITNIL ISEFERFICK
GGEARVILST MNGFIKPEYL THLRDHVPYL NVKVFHPPDI PFHIQPKRDF HVKAYIFKHR
TGKGSAIIGS SNLSQGGFSD NIEWNYYSAG EINLPFENKQ TPWEKIVHEF ESLWANECVH
VTDDFLAGYR KRHRDVFQER EKPSDAYGSE EQSPVQPGIG ADSAAAYGKR KQSKSESESV
SPNIAQGEAL EGLLKLRNRK ARSGAVIAAT GVGKTYLAAF DFIQSGKDKC LFIAHREDIL
RKAKESFSHV LGPEGLEIFS GRSKEISHGS RAVFAMIQTL GRQDNMERFH PEEFDYIVMD
EFHHAMAATY RRVLDYFQPD FLLGLTATPE RMDGRDVLWL CDYNIAYEMR LFQAIDKELL
APFQYFAVHD PTDYAQISWK RTDYDQEELT KALANDTRTT IIANNLKKFL PYQGKIKALA
FCSSVDHAQY TAARLTQEHD FEAMALVGDS SQDQREEAVA RLEEENDPLK LICCVDIFNE
GIDIPKLSHV LLLRPTQSFT VFLQQLGRGL RKIQSEDVEK HLVVIDFVGN FRTAHVAPLA
LAGYTSIQEF TQDSEGSKES KLDLSNPPKG CFVSPDLEVQ RIWESKLREI APMSRAEQLR
ALYDEVVQDL GLISPGLCEF YADPQKADPH AFIKYFGSWI KTKKAFKDLL DFEQNLLGTQ
GESFLEYLEK DLNPVKSYKM VVLKTLLSFE GVSWNVSEIA QGFLSYYLNH PEYLSDYDDL
DRKENPEEIS LQRVERHIMN MPLKYLSNKG KDWFVLDREK KVFSVKNDLI DYWSAAFYKQ
LMLDRADYAL ARYFYRKTQ