Gene Msed_0963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0963 
Symbolsat 
ID5104515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp890022 
End bp891131 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content47% 
IMG OID640506865 
Productsulfate adenylyltransferase 
Protein accessionYP_001191058 
Protein GI146303742 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000704233 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.355944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCAT CTCCATATGG GGGTAGGCTA ATTCAGAACG TGATAGAGGA GCCGCACGAG 
GATCTTCCGA TGCTGGAAAT TGGGAGAAGG TATGCATTGG ATGCTGAAAA GATAGGTATT
GGGGCATACT CTCCCCTGGA AGGTTTCATG GGATCCTCTG ATTTAGAGAA CGTGCTTTAT
AAAAACGAGC TAAATAATGG CCTGCCGTGG ACGATACCTA TAATTCTTCC AGTCATGGCT
CCTCCAGAGG AGGGGGAGAG GGTATATCTC AACCTTAATG GGAACAGGTT TGGATTCCTT
GAGGTCGAGG AAGTATTTCG TTTTAACAAG AAGGAGATAG CGGAGAAGGT ATACTCAACC
CTTTCTCCTG AACATCCGGG GGTAGCTCAG GTAATGAGTG AACCAGAAAC GGCAGTCTCG
GGCAAGGTGT GGATATTTAG AAGGGTTAGT AGAGATAAGA CTCCTGCTGA AACTAGGGAG
ATCTTCAAGA AACTAGGATG GAGGGATGTT GCCGGTTACC AAACCAGAAA TCCGCCTCAT
AGGGCACACG AGTATGTGAT AAGGGTTGCC ATGGAGTTTG TAGATGGAGT CTTTATTCAT
CCAGTGGTGG GGGAACTAAA GAATGACGAC TTTCCACCAG AGGCAATTGT GGAGGCATAT
GACTACTTTG TGAAAAATTA CCTCCCCAAG AACAGAGCTC TCCTGGACAC TCTGACAATA
CCCATGAGAT ATGCTGGCCC AAAGGCCGCA GTATTCTACG CCATCATAAG GAGAAACTAC
GGCTGCACCC ATTTCGTGGT GGGCAGGGAT ATGGCAGGTG TAGGCAACTT TTACGATCCC
TATGGGGCAC AGAAAATGTT AAGGGAGATG GATTTGGGAG TGGAGATAAT ACCTGTAGGA
GAAGCATTTT ATTGTGACAT CTGCGAGGGA ATTGTGAGTG AAAGGAGCTG CGACCATAAC
GCTCGCAAGA AGATATCCAT GACTCTCATA AGAAAACTAC TAAGTCAGGG CGAAGAGCCT
CCAAGGGAAA TCATTAGGCC ACAGATAGCT TCCATACTAA AAAGATATTA CAAAAATACT
GAAAGCCTCG CCTCTTCGAG GCGAGGATGA
 
Protein sequence
MVASPYGGRL IQNVIEEPHE DLPMLEIGRR YALDAEKIGI GAYSPLEGFM GSSDLENVLY 
KNELNNGLPW TIPIILPVMA PPEEGERVYL NLNGNRFGFL EVEEVFRFNK KEIAEKVYST
LSPEHPGVAQ VMSEPETAVS GKVWIFRRVS RDKTPAETRE IFKKLGWRDV AGYQTRNPPH
RAHEYVIRVA MEFVDGVFIH PVVGELKNDD FPPEAIVEAY DYFVKNYLPK NRALLDTLTI
PMRYAGPKAA VFYAIIRRNY GCTHFVVGRD MAGVGNFYDP YGAQKMLREM DLGVEIIPVG
EAFYCDICEG IVSERSCDHN ARKKISMTLI RKLLSQGEEP PREIIRPQIA SILKRYYKNT
ESLASSRRG