Gene Dret_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2034 
Symbol 
ID8419879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2333408 
End bp2335123 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content57% 
IMG OID645038622 
Productbifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein 
Protein accessionYP_003198896 
Protein GI258406154 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase
[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000871956 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000299137 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTCGAAG GGGATTCCGG CCGGCGGTAC GCTGAAAGCC TTATGGTCCA CTTCAGACGC 
GTGGAGCGTT TGAAGCAAGA GGCCGTGCAA TGTCCCTCGG TGGATTTGAA TGCGCGCCAG
GTCTGTGATC TGGAACTGCT TCTCAACAGG GCCTTGTATC CCCTCGAAGG GTTCATGGGC
CGGGGCGATT ACGAGTGCGT TCTCGATTCC ATGCGCCTGG AGGAGGGGAC AGTCTGGCCT
GTACCGATTT GTCTTGATCT GCCCGAGGCA GTGGCAAGCC GCCTGGAGCT TGGTGGGCTC
CTGGGACTGC GAGATCCGGA GGGCTTTTTA CTGGCGATTT TGCAGGTCAG CGATGTCTGG
GCCCCGGACA AAACACGGGA AGCCGAGGCT GTGTTCGGCA CGAGTGACCC TGAGGTCCAT
CCCGGTGTGC GGAGGTTTTT GGAGCAAACA CACCCTTTTT ATGTCGGAGG GCGTATTGAA
GGGCTCCATC TGCCACAACA TTATGATTTT TCCCAGTTGC GTCTTCCTCC TTCAGAGACC
CACCGCCGTT TTTCGCAAAA CGGCTGGCGC CGGGTGATCG GGTTTCAAAC CGAGCGTCCG
TTGCATTGTG CCCATAAAGA GATGATTCAA CATGCAGCAC GTGAAGTGGG GGCCTCGATT
TTTTTGCAGC CCGTGGTTGG GCATGGGGTC TGGGGCCGAG TAGACCATTT CACCCGGGTG
CGGTGTTTCC AAGAATTTGC CGCGCGGTTT CCCAGGAATA TGATCGATGT CGGATTGCTC
CCCATGGCGC TGCGCCACGC CGGGCCCCGG GAGGCGCTGC TCCAGGCCAT CGTTCGCCGC
AATTTCGGGT GTACCCATTT CATGGTCGCG GACGACCAGG CTGACCCATA CGCGTGCCAA
AACGGAGCAG AACGGTTTTA TCCGCAATAC AAGGCCCAGC ACCTGGTTCA GGAGTATGCC
GGCGAAACTG GGATCGACAT GGTGCCGCTC AAGCATATGG TCTATGTGGA GGACAAGGCT
CAGTATCTGC CCCAGGACGA GGTGCCCGAG GGGATGCGGG TCAAAGAGAT CAGTTCCCGG
GAGTTGGAAC GGCGACTAGA ATTCGATCTT CAGATCCCGG AGTGGTTCTC TTTCCCTGAA
GTTGTCCGGG AGTTGCGAGT CGCCCACCCC CCGCGGCACA AGCAGGGGTT CACCGTTTTT
TTGACCGGGC TCTCCGGGGC CGGCAAGTCG ACATTGGCCA AAGTTCTGTA TGTCCGGTTC
ATGGAAATGC GCGACCGTCC GGTGACTTTG CTCGACGGGG ACATCGTGCG CAAAAATCTT
TCCAGTGAGT TGAGTTTCAC CCGCGAGCAC CGGGAACTCA ATGTGCGCCG GATTGGTTTC
GTGGCCAGCG AGATCACGAA AAACGGCGGT ATTGCCGTAT GCGCCCCCAT CGCACCTTAT
GAAGATTCGC GACGGCTGAA CAGGGAACTT ATCGAAGGGT ATGGCGGGTA TATTGAAATC
TTTATGGCCA CTCCACTGAC TGTGTGTGAA CAACGCGACA GGAAAGGGCT GTATGCCAAG
GCGCGAGCCG GGGTGGTTCA GGGCGTGACT GGTATTGATG ATCCCTATAT CCCTCCTTCG
GATCCGGAAT TGGAGATCGA CACCTCCGAG ATGACACCGA CTGAAGCCGC TCAGGAGGTC
CTTCTCTATC TTGAGGAGCA GGGATATATT CGCTGA
 
Protein sequence
MLEGDSGRRY AESLMVHFRR VERLKQEAVQ CPSVDLNARQ VCDLELLLNR ALYPLEGFMG 
RGDYECVLDS MRLEEGTVWP VPICLDLPEA VASRLELGGL LGLRDPEGFL LAILQVSDVW
APDKTREAEA VFGTSDPEVH PGVRRFLEQT HPFYVGGRIE GLHLPQHYDF SQLRLPPSET
HRRFSQNGWR RVIGFQTERP LHCAHKEMIQ HAAREVGASI FLQPVVGHGV WGRVDHFTRV
RCFQEFAARF PRNMIDVGLL PMALRHAGPR EALLQAIVRR NFGCTHFMVA DDQADPYACQ
NGAERFYPQY KAQHLVQEYA GETGIDMVPL KHMVYVEDKA QYLPQDEVPE GMRVKEISSR
ELERRLEFDL QIPEWFSFPE VVRELRVAHP PRHKQGFTVF LTGLSGAGKS TLAKVLYVRF
MEMRDRPVTL LDGDIVRKNL SSELSFTREH RELNVRRIGF VASEITKNGG IAVCAPIAPY
EDSRRLNREL IEGYGGYIEI FMATPLTVCE QRDRKGLYAK ARAGVVQGVT GIDDPYIPPS
DPELEIDTSE MTPTEAAQEV LLYLEEQGYI R