Gene Dole_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1002 
Symbol 
ID5693837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1177046 
End bp1178320 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID641263599 
Productsulfate adenylyltransferase 
Protein accessionYP_001528889 
Protein GI158521019 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000329469 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACT TGATTGCACC GCATGGCGGT AAAGGTTTGA CCGTCTGCCT TCTGGAAGGC 
AGCGAGCTTG AAGCGGAAAA GAAAAAGGCC GAAGGCCTGA AAAAGATTCC CCTGAACCCC
CGGGCCAAAG GCGACCTGAT CATGCTCGGC ATCGGCGGTT TTTCCCCGCT GACCGGTTTC
ATGACCAAGG CTGACTGGAA GGGTGTCTGT GACAACTTCC TGTTGGCCGA CGGCACCTTC
TGGCCGGTTC CGGTCTGCCT GGATGCCGCC AAGGCCGACG CCGACGCCAT CGCCGAAGGC
GAAGAGATCG CCCTGGTGGA CCCGACGAAC AACGAAATCA TGGCCACCAT GAAGGTCACC
GAAAAGTACG AGATGACCGA GGCGGACAAG AAGTACGAAT GTGAAAAAGT TTTCATGGGT
GAAGGCACCC CCACGGCCGA CGACTTCTGG AAGATCGCCA AGGATGACCA TCCCGGCGTC
CAGATGGTCA TGGGCCAGAA GGAAGTGAAC CTGGCCGGCC CGGTCAAGGT TCTGTCCGAG
GGCGAGTATC CCGTGAAGTA CAAGGGCATC TATCACCGTC CCGCCGAGTC CCGAAAGATC
TTCGAGGAAA GAGGCTGGAA AGAGATCGCG GCCCTGCAGC TGAGAAACCC CATGCACCGC
TCCCATGAGC ACCTGTGCAA AATCGCCGTG GACGTGTGCG ACGGTGTTTA CATTCACTCC
CTGGTGGGCA ACCTCAAGCC CGGCGACATT CCCGCGGAAG TTCGCGTCCG GTGCATCGAT
GCCCTGGTGA AAAACTACTT CGTGGAAAAG AACGTTGTTC AGGGTGGCTA TCCCCTTGAC
ATGCGCTATG CCGGTCCCCG GGAAGCCCTC CTGCACGCTA CCTTCCGCCA GAACTATGGC
TGCTCCCGCA TGATCATCGG CCGCGACCAT GCCGGCGTGG GTGATTTCTA CGGCATGTTT
GAAGCCCAGA CCATCTTTGA CAAAATTCCC GCCCCGGCCG AGCCCGGCAA AGCCCTGCTC
TGCACCCCGC TGAAGATCGA CTGGACCTTC TACTGCATGA AGTGCGACGG CATGGCCTCC
CTGAGAACCT GCCCCCATTC CAAGGAAGAC CGGGTTATGC TCTCCGGCAC CATGCTGCGC
AAGGGCCTTT CCGAAGGCAC CCCGATTCCC GATCACTTTG GTCGTGAAGA GGTCCTTGAC
ATTCTGCGCG AGTACTATGC CGGCCTGACC GAAAAGGTCG CCATCAAGAC CCACAAGGCC
GCCACCGGTA ATTAA
 
Protein sequence
MSNLIAPHGG KGLTVCLLEG SELEAEKKKA EGLKKIPLNP RAKGDLIMLG IGGFSPLTGF 
MTKADWKGVC DNFLLADGTF WPVPVCLDAA KADADAIAEG EEIALVDPTN NEIMATMKVT
EKYEMTEADK KYECEKVFMG EGTPTADDFW KIAKDDHPGV QMVMGQKEVN LAGPVKVLSE
GEYPVKYKGI YHRPAESRKI FEERGWKEIA ALQLRNPMHR SHEHLCKIAV DVCDGVYIHS
LVGNLKPGDI PAEVRVRCID ALVKNYFVEK NVVQGGYPLD MRYAGPREAL LHATFRQNYG
CSRMIIGRDH AGVGDFYGMF EAQTIFDKIP APAEPGKALL CTPLKIDWTF YCMKCDGMAS
LRTCPHSKED RVMLSGTMLR KGLSEGTPIP DHFGREEVLD ILREYYAGLT EKVAIKTHKA
ATGN