Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0963 |
Symbol | sat |
ID | 5104515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 890022 |
End bp | 891131 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506865 |
Product | sulfate adenylyltransferase |
Protein accession | YP_001191058 |
Protein GI | 146303742 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2046] ATP sulfurylase (sulfate adenylyltransferase) |
TIGRFAM ID | [TIGR00339] ATP sulphurylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000704233 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.355944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCAT CTCCATATGG GGGTAGGCTA ATTCAGAACG TGATAGAGGA GCCGCACGAG GATCTTCCGA TGCTGGAAAT TGGGAGAAGG TATGCATTGG ATGCTGAAAA GATAGGTATT GGGGCATACT CTCCCCTGGA AGGTTTCATG GGATCCTCTG ATTTAGAGAA CGTGCTTTAT AAAAACGAGC TAAATAATGG CCTGCCGTGG ACGATACCTA TAATTCTTCC AGTCATGGCT CCTCCAGAGG AGGGGGAGAG GGTATATCTC AACCTTAATG GGAACAGGTT TGGATTCCTT GAGGTCGAGG AAGTATTTCG TTTTAACAAG AAGGAGATAG CGGAGAAGGT ATACTCAACC CTTTCTCCTG AACATCCGGG GGTAGCTCAG GTAATGAGTG AACCAGAAAC GGCAGTCTCG GGCAAGGTGT GGATATTTAG AAGGGTTAGT AGAGATAAGA CTCCTGCTGA AACTAGGGAG ATCTTCAAGA AACTAGGATG GAGGGATGTT GCCGGTTACC AAACCAGAAA TCCGCCTCAT AGGGCACACG AGTATGTGAT AAGGGTTGCC ATGGAGTTTG TAGATGGAGT CTTTATTCAT CCAGTGGTGG GGGAACTAAA GAATGACGAC TTTCCACCAG AGGCAATTGT GGAGGCATAT GACTACTTTG TGAAAAATTA CCTCCCCAAG AACAGAGCTC TCCTGGACAC TCTGACAATA CCCATGAGAT ATGCTGGCCC AAAGGCCGCA GTATTCTACG CCATCATAAG GAGAAACTAC GGCTGCACCC ATTTCGTGGT GGGCAGGGAT ATGGCAGGTG TAGGCAACTT TTACGATCCC TATGGGGCAC AGAAAATGTT AAGGGAGATG GATTTGGGAG TGGAGATAAT ACCTGTAGGA GAAGCATTTT ATTGTGACAT CTGCGAGGGA ATTGTGAGTG AAAGGAGCTG CGACCATAAC GCTCGCAAGA AGATATCCAT GACTCTCATA AGAAAACTAC TAAGTCAGGG CGAAGAGCCT CCAAGGGAAA TCATTAGGCC ACAGATAGCT TCCATACTAA AAAGATATTA CAAAAATACT GAAAGCCTCG CCTCTTCGAG GCGAGGATGA
|
Protein sequence | MVASPYGGRL IQNVIEEPHE DLPMLEIGRR YALDAEKIGI GAYSPLEGFM GSSDLENVLY KNELNNGLPW TIPIILPVMA PPEEGERVYL NLNGNRFGFL EVEEVFRFNK KEIAEKVYST LSPEHPGVAQ VMSEPETAVS GKVWIFRRVS RDKTPAETRE IFKKLGWRDV AGYQTRNPPH RAHEYVIRVA MEFVDGVFIH PVVGELKNDD FPPEAIVEAY DYFVKNYLPK NRALLDTLTI PMRYAGPKAA VFYAIIRRNY GCTHFVVGRD MAGVGNFYDP YGAQKMLREM DLGVEIIPVG EAFYCDICEG IVSERSCDHN ARKKISMTLI RKLLSQGEEP PREIIRPQIA SILKRYYKNT ESLASSRRG
|
| |