Gene Dole_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1013 
Symbol 
ID5693848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1191768 
End bp1193480 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content60% 
IMG OID641263610 
Productsulfate adenylyltransferase 
Protein accessionYP_001528900 
Protein GI158521030 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase
[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG GCATTGCCCC CCCTCACGGC GGACGGCTGA TCAATCTGGT GGCCGATGAA 
TCAAGGGTTG CCGGGCTGAA GGAGCAGGCC CTGAACCTGG AAGAGATCGT ATTGAGCGGC
CGCCAGCTAT GTGATTTCGA CATGCTGGCC ACCGGTGTTT TTTCTCCGCT CTCCAGGTTT
ATGACCCGCA CCGATTATGA GGCTGTTGTG GCGCAAATGC GGCTGGCATC GGGAGAGCTG
TTCCCCCTGC CGGTGTGCCT GGATGTTTCA AACGAACGTG CCAGAACCCT GGCGCCGGGA
CAGGCCGTGG CCCTGCGGGA CCAGGAGGGG CTGCTGCTGG CGGTGATGCA CATTACCGAC
GTCTGGCAGC CGGACCGGGA ACACGAGGCC CGGACGGTTT ATGGTACCAC CGACTGCGCC
CATCCAGGCG TTCACCACCT GCTGCGATGT TGTGGTGAAT ATTACGTGGG CGGTACCATG
GAGGTGGTGC AACTGCCCAT CTACCTGGAC TCCCGGCAGC TTCGCAAAAG CCCGGCAGAG
GTCCGGGAAC TTTACCGAAA GCTCTCGTGG CACCGGGTGA TCGGATTTCA CACGCGCCAG
CCCGTTCACC GGCTTCAGTT CGAGATGACC ATTCGGGCCA TGCGGACGGC AAGAGCCAAC
CTGCTCTGCC TGCCCAGCGT GGGCACGATT GATCCTGACG ACGTGGATCA CTATACACGG
GTCCGGTGTT ATAAGCGGGT GGCCGAACGC TATCCGCCCG ATTCCTTTCT GCTCAACCTG
GTGCCCATGG CCATGCGCAT GGCCGGGCCC AGGGAAGCCC TTCTGCATCT GATTGTGGCC
AGGAACTTCG GCTGCTCCGG ATTTATTATC GGTCCGGACC ATGCCAGCCC CCCCCATCCG
GATAATGGCG TGGCGTTTTA TGAAAAGGAC CGTGCCTGCC GACTGCTTGA AACCCATGCC
GGAGAACTGG GCATGGAGGT GATCGTATTT AACGAGATCG TCTATCTTCC TTTTGAGGAT
GAGTTTGTTC CCGTGGACCA GGTGGCCGAT GGCACACAGT ATGTTTCCAT GCCGGGGTCT
TTGATCCGGC GGCGGATGCG GGCCGGCAAA ACCGTTCCCG ACTGGGCCAC TTTTCCGGAG
GTGCTTGAGG AGCTGGAGCG GGCCTGCCCG CCACCGGCCC GGCAGGGGTT TACCGTGTTT
TTTACCGGCC TTTCCGGCGC CGGTAAGTCC ACCATTGCCA GGATCCTTTA CTCCCGGTTT
TTAGAGATCG GTACCCGGCC GGTGACCCTT CTGGACGGCG ACATCGTGCG GCACAACCTA
TCCAGTGAGC TGACCTTTTC AAAGGAACAC CGGGATATCA ACGTGAAGCG TATCGGGTTT
GTGGCCAGTG AGATCACCAA GAACCGGGGC GTCGCCATCT GCGCGCCTAT TGCGCCCTAT
GCCGCCACCC GGGCCGATGT GCGGCGGGTC ATTGAGGCGT ATGGCGGATT CTTCGAGGTC
CACGTGGCAA CGCCCATTGA GGAGTGTGAA AAACGGGATC GCAAGGGGAT GTACGCCAAG
GCCAGGGCCG GAAAGATCAA GGGGTTTACC GGCGTGGATG ATCCTTACCA GGCGCCGGTC
TGCCCGGACC TGAAGATCGA CACCACGGAC ATGACCCCGG ACGAGGCGGC CCAGGCCATT
CTGCTGTTTC TCGGCGAAAA GGGATATATA TAA
 
Protein sequence
MKNGIAPPHG GRLINLVADE SRVAGLKEQA LNLEEIVLSG RQLCDFDMLA TGVFSPLSRF 
MTRTDYEAVV AQMRLASGEL FPLPVCLDVS NERARTLAPG QAVALRDQEG LLLAVMHITD
VWQPDREHEA RTVYGTTDCA HPGVHHLLRC CGEYYVGGTM EVVQLPIYLD SRQLRKSPAE
VRELYRKLSW HRVIGFHTRQ PVHRLQFEMT IRAMRTARAN LLCLPSVGTI DPDDVDHYTR
VRCYKRVAER YPPDSFLLNL VPMAMRMAGP REALLHLIVA RNFGCSGFII GPDHASPPHP
DNGVAFYEKD RACRLLETHA GELGMEVIVF NEIVYLPFED EFVPVDQVAD GTQYVSMPGS
LIRRRMRAGK TVPDWATFPE VLEELERACP PPARQGFTVF FTGLSGAGKS TIARILYSRF
LEIGTRPVTL LDGDIVRHNL SSELTFSKEH RDINVKRIGF VASEITKNRG VAICAPIAPY
AATRADVRRV IEAYGGFFEV HVATPIEECE KRDRKGMYAK ARAGKIKGFT GVDDPYQAPV
CPDLKIDTTD MTPDEAAQAI LLFLGEKGYI