Gene Dret_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2402 
Symbol 
ID8420262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2747212 
End bp2750295 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content64% 
IMG OID645039003 
ProductUvrD/REP helicase 
Protein accessionYP_003199262 
Protein GI258406520 
COG category[S] Function unknown 
COG ID[COG1379] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00375] conserved hypothetical protein TIGR00375 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA CACAATTGTA TTACGCGGAC CTGCACATCC ATTCCAAATA TTCCCGGGCG 
ACCAGCAGAC AACTCACCCC GCGCCATCTC GCCGCCTGGG GCTGGGTCAA AGGGATCGAC
CTCCTGGCCA CCGGGGATTT CACGCACCCG GCCTGGTTGG CCTTGCTCCA GGAACAACTC
GAGGAGGACG GCCGCGGCCT GTTACGTCTC AAAGAGCCAA CTCGCCTGGA ACAGGAACTG
CCCTGGCTGG AAACTCCGGT ACCGAAAGCG CCGCGGTTCA TGCTCGGTAC GGAGATCAGC
TCCATCTACA AGCGCGGCGG CGTGGTGCGC AAAGTCCACA ACCTGGTCTA TCTCCCGAAC
TTCGACGCCG TCCGCCGCTT CAACGAACGC CTGGACCAGG TCGGCAATCT TGAGGCCGAT
GGCCGCCCCA TCCTGGGCCT CGATTCAGAG CACCTTTTGG AAATGGTCCT GGAGACCGAT
CCCCTCGGTT TCGTCATCCC CGCCCATATC TGGACGCCCT GGTTTTCCCT GTTCGGCTCC
AAATCCGGCT TCAACGCTCT TGAGGAGTGT TTCGGGAGCC TTTCCCAGCA TATCTTTGCC
GCCGAAACCG GGTTGTCCTC GGACCCGGCC ATGAATTGGC ACTGGTCCGC CCTGGACCAC
CTGACCCTGG TCTCCAATTC CGACGCCCAT TCCGGGGCCA ACCTCGCCCG GGAGGCGACC
CTGTTCAGCG GCACCATGGA TTTCGCCACG ATTCGCGAAG GACTGCGCGA CCGCAGTACA
GGGACGTTTC AGGGCACCTT GGAGTTTTTC CCCGAAGAAG GGAAATACCA CCACGACGGC
CACCGTAAAT GCGGCCTCTC CTGGGACCCG CGGCAGACCG AGGACCACGG CGGTATCTGT
CCGGTCTGCG GACGTCCGGT GACAGTCGGG GTATTGAACC GCATCCTGGA ACTGGCCGAC
CGCAAAGAGC CCCTCCGGCC TCCGGACCAC CCCGATTTTG TCTCCCTGGT CCCCCTTCCG
GAACTGCTCT CGGAACTGCT GAGCGTCGGG CCCAAAAGCA AGACCGTGCA ACGCCGCTAT
TGCCGCCTCC TGGCCCAATT CGGCCCAGAA CTGACCATCC TGCGCCACAC CCCAGTTGAG
GAACTCCGCC AGCATTGGTC CCTTCTCGGC GAGGCTATCG GTCGCATGCG CTCGGGCCGC
GTCCAGCGTC AGCCCGGGTT TGACGGGCAA TACGGGGTCA TCCGCGTCTT CAGCGACCGG
GAACGCCGCG AGTGGCGCCA CGGCCCGAGC CTGGTCCTGC CCACAGATCT GGACCACCCG
TCCTGCACCG CCCAGACCGC CCCTGGTGGC GCTGGGGCCG GGAGGGAAGA AGGCGATGTC
GAAAAGCACC CGACCCCGAA CCCCGAACAG GAAGCCGCTG TCCAGGCCGG ACCGGAGCCA
GTGCTCGTCC AGGCCGGACC GGGCACCGGA AAAACCAGAA CCCTTCTGGC CCGGGTTCGC
CGGCTGCTCG AGCAGGGAAC GCCAGCCAAG GAGATTTTGC TTCTGACCTT CACCCGGGCC
ACGGCCGAGG AACTCCGAAA CCGCCTCCAG CTGGAAACGT CGGCTGCCGC CGAAATCCAG
GCCGGGACCC TGCACAGCTT GGCCTATGCC CATTTTGTCC AGCACCACGG CCGGGACCCG
GTCCTTCTCT CCGAAGAAGA CGCCAGGAGC CTCTTTGTGA AAGCTGTGGC CGCCGATCGC
CACGAGGGCC AACAGATGTG GACCCGCTGC CAGTTGGCAC GGGAATCCGG TATCCCCTCC
CGGGTACCCG CCCCCGGAGA GGCCGAGTAC CAGGAGGCCA AGGCCCGGCG CGGTGTCGTC
GATTTCACCG ACCTGCTCCA AAAATGGCTC CAGGGACTCA CGAACGGGGC TCGGCCGGAG
TGGACCCAGG TCCTGATCGA TGAAATTCAG GACCTGACCG GGCTCCAGCT GAACCTGGTC
CAGGCCCTTT GCCCGCCCCA GGGCCACGGT TTCTTCGGTA TCGGCGACCC GGATCAATCC
ATTTATGGCT TCCGCGGGGC GGTCAGCACC ACCGCCGAGT GGCTCAGCCG GACCTGGCCC
GAACTGACAC CCCTCTCCCT GCAGCGCAAT TACCGCGCCG CCGCGCCCCT GGTCCAAGCC
GCCGGAACGG TTTTCCCTGA ACGGCCGCCG CTACGCCCCC AACGTTCAGT CCAAGGACAG
ATCATTGTCT GTGAAACACC CAGCGCCTGG CGAGAAGCCT CCTGGATCGC CACCCAGGCC
CGGACTCTGA TCGGGGCCAC AGCCCACAGC CAGGCCGATA GTGGCGACAC CGGCGAGACC
TCCCCCGGGG ACATCGCCGT GCTCGTACGC ACCAGAGCCT TGCTGCGGCC GCTGGCGGAA
AAACTGGACC AGGCCGGAGT GCCGTGTTCG GTGCCCGAGG AAGAGCCGTT CTGGCGCGAC
AACCGGGTCC AGGACCTCCT GGAAACGGTG CGCGCGGCCT TCGACAGGGC CGTCACCCCG
CCGTGGGATT GTCCTGAGGC CGTTCTCGGG CAAGGGCCGG CGGGTATGGC CGCGTATTGG
GCCCAATCGG GGCCCGTCGA TCCGCTTTTC TGGCAAAGCC GCGCCTTTCA GCAATTGCAG
CAGGCCTTTG CCGACCAGGG GGATTGGGAG GCCCTGCTGA CTTGGTTGGC TCGACAAGAC
ACCCTTGATA CGGTCCGGCA TCGGGTGGAA AAAGTCCGCC TCATGACATT GCATGCCAGT
AAGGGGCTCG AATTCACTGC GGTTTTCCTC CCAGCCCTCG AAGAAGGACT GCTCCCGTAT
GGCGCAGGCG ACTTCGGTTC AAACGAGGAA AATACGGCCT TGACGGCGGA CCGTCTGGCG
GAGGAAAAAA GACTTTTCTA CGTCGGCTTG ACCCGAGCCA AACGCTATCT CTATCTCAGT
CATAGCCAGG AACGACGCCT CTACGGTCAA ACGCTCGCCG GACGGCCGAG CCGTTTTCTG
CAATCGCTGC CACCGTCTCT GGTGCAGACC AAAACCGTGG TCGCCAGGAC GAAGCAGAGA
AGCACCCAAT TGACGCTGAT GTAA
 
Protein sequence
MSETQLYYAD LHIHSKYSRA TSRQLTPRHL AAWGWVKGID LLATGDFTHP AWLALLQEQL 
EEDGRGLLRL KEPTRLEQEL PWLETPVPKA PRFMLGTEIS SIYKRGGVVR KVHNLVYLPN
FDAVRRFNER LDQVGNLEAD GRPILGLDSE HLLEMVLETD PLGFVIPAHI WTPWFSLFGS
KSGFNALEEC FGSLSQHIFA AETGLSSDPA MNWHWSALDH LTLVSNSDAH SGANLAREAT
LFSGTMDFAT IREGLRDRST GTFQGTLEFF PEEGKYHHDG HRKCGLSWDP RQTEDHGGIC
PVCGRPVTVG VLNRILELAD RKEPLRPPDH PDFVSLVPLP ELLSELLSVG PKSKTVQRRY
CRLLAQFGPE LTILRHTPVE ELRQHWSLLG EAIGRMRSGR VQRQPGFDGQ YGVIRVFSDR
ERREWRHGPS LVLPTDLDHP SCTAQTAPGG AGAGREEGDV EKHPTPNPEQ EAAVQAGPEP
VLVQAGPGTG KTRTLLARVR RLLEQGTPAK EILLLTFTRA TAEELRNRLQ LETSAAAEIQ
AGTLHSLAYA HFVQHHGRDP VLLSEEDARS LFVKAVAADR HEGQQMWTRC QLARESGIPS
RVPAPGEAEY QEAKARRGVV DFTDLLQKWL QGLTNGARPE WTQVLIDEIQ DLTGLQLNLV
QALCPPQGHG FFGIGDPDQS IYGFRGAVST TAEWLSRTWP ELTPLSLQRN YRAAAPLVQA
AGTVFPERPP LRPQRSVQGQ IIVCETPSAW REASWIATQA RTLIGATAHS QADSGDTGET
SPGDIAVLVR TRALLRPLAE KLDQAGVPCS VPEEEPFWRD NRVQDLLETV RAAFDRAVTP
PWDCPEAVLG QGPAGMAAYW AQSGPVDPLF WQSRAFQQLQ QAFADQGDWE ALLTWLARQD
TLDTVRHRVE KVRLMTLHAS KGLEFTAVFL PALEEGLLPY GAGDFGSNEE NTALTADRLA
EEKRLFYVGL TRAKRYLYLS HSQERRLYGQ TLAGRPSRFL QSLPPSLVQT KTVVARTKQR
STQLTLM