Gene Dret_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1968 
Symbol 
ID8419813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2254963 
End bp2256267 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID645038556 
Productsulfate adenylyltransferase 
Protein accessionYP_003198830 
Protein GI258406088 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.678215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0535908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAT TGGTACCACC GCACGGAGGC AAAGGCCTGA CCGAATGCCT GCTTGAGGGC 
GCGCAGCGCG AAGAGGAACT TAAGAAAGCC CAGGGGTTGA AGAAGATTGA AATCCATCCC
CAGGAAAAGG GCGACCTGAT CATGATGGGG ATCGGGGGCT TCAGCCCGCT GACCGGCTTC
ATGACCCGCG CTGACTGGAA GTCCGTTTGT GAAAAATTTA CCTTGGCCGA TGGGACCTTC
TGGCCCGTGC CGGTGATGCT CTCCGTTGGC AAGGACGACG CCGCGGACAT CAAGGAAGGC
GACGAGATCA CCCTGGAGTG CGAGGGCGAG ATCTACGCCA CCATGAAGGT GGAAGAAAAG
TACGCCATGA GTGAGGAAGA CAAGAAGTGG GAAGCCGAGA TGGTCTACAA GGGCAACGGC
GAAGACTCCC AGGGCGACTT CCTCAAAACC GCCCTCGAAG ACCACGTCGG TGTCCAGAAG
GTCATGCAGC GCAAGGAATT CTGCCTGGCC GGCCCGGTGA AGGTCCTCTC CGAAGGTGAC
TATCCGACCA ATCCGAAGTA TCGTGGCTCT TATATGCGCC CGTCGGAAAC CCGCAAGATT
TTCGAAGAAC GCGGGTGGTC CAAAGTCGCC GCGCTGCAGC TGCGTAATCC CATGCACCGC
TCCCACGAGC ACCTGGCCAA GATCGCCATC GACGTCTGCG ACGGCGTGCT CATCCATTCC
CTGATCGGGA ACCTGAAGCC CGGCGACATT CCGGCTGACG TGCGCCTGAA GTGCATCGAC
ACCCTTATCG ACGGCTACTT CGTCAAGGAA AACATCCTTC AGGGTGGCTA TCCTCTGGAC
ATGCGTTACG CTGGTCCGCG TGAAGCCCTG TTGCACGCCA CGTTCCGTCA GAACTACGGC
TGCAGCCAGA TGATCATCGG CCGCGACCAT GCTGGCGTTG GCGACTTCTA CACCTTGTTT
GAAGCCCAGG AAATCTTCGA TCGCATCCCG TATGCCACCG AAGAAGAGCG CTGCAATGTC
GAGCCCGGCA AGGCGCTGCT GTGCGAGCCG ATGAAGATCG ACTGGACGTT CTATTGCTTC
AAGTGCGACG GCATGGCCTC CATGAAGACC TGCCCGCACA CGAAGGAAGA CCGCGTCATC
CTGTCCGGCA CCAAGCTGCG CAAGGCCCTG TCCGAAGGCC AGCCGGTTCC GGATCACTTC
GGCCGCGAAG AAGTCCTGAA TATCCTGCGT GAATACTACG AAGGTCTGAC CGAAAAGGTC
GAGATCAAGA TGCAGAAAGC CGCTTCCGGC GATGAAATGA AATAG
 
Protein sequence
MSKLVPPHGG KGLTECLLEG AQREEELKKA QGLKKIEIHP QEKGDLIMMG IGGFSPLTGF 
MTRADWKSVC EKFTLADGTF WPVPVMLSVG KDDAADIKEG DEITLECEGE IYATMKVEEK
YAMSEEDKKW EAEMVYKGNG EDSQGDFLKT ALEDHVGVQK VMQRKEFCLA GPVKVLSEGD
YPTNPKYRGS YMRPSETRKI FEERGWSKVA ALQLRNPMHR SHEHLAKIAI DVCDGVLIHS
LIGNLKPGDI PADVRLKCID TLIDGYFVKE NILQGGYPLD MRYAGPREAL LHATFRQNYG
CSQMIIGRDH AGVGDFYTLF EAQEIFDRIP YATEEERCNV EPGKALLCEP MKIDWTFYCF
KCDGMASMKT CPHTKEDRVI LSGTKLRKAL SEGQPVPDHF GREEVLNILR EYYEGLTEKV
EIKMQKAASG DEMK