Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0241 |
Symbol | |
ID | 5104107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 201295 |
End bp | 202449 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506147 |
Product | UDP-sulfoquinovose synthase |
Protein accession | YP_001190342 |
Protein GI | 146303026 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.603555 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTC TTATCCTTGG CATTGATGGG TACATTGGGT GGGCCTTGGC CCTGAGGCTA GTGGCCAAAG GGCATGAGGT TGCAGGGATT GACAACCTTT CCACGAGGAG ATTTTCGGCC GAAGTGGGAT CAGACTCCGC ATTTCCATTG CCGTCCCCAA AGGAAAGGGT AGAAGCGGTG AAGAGGAAAC TGGGCGCTGA CATTAAGTTC ATTGTAGGAG ACGCAAAGGA TAAGGCATTA CTGGAGGAGA CCATTAGGGA TTTCAAACCT GATGTGATAG TTCATTTTGC CGAGCAAAGA TCGGCTCCCT ACTCAATGAA GGACTACGAA CATGCATGGT ACACTCTCGA AAATAACCTG AAGTCCACCC TTAGCCTATT GTACGCAGTA AGCGAGATAG ATCCTTCAAT TCACATACTC AAGATGGGTA CCATGGGTGA GTATGGTACC CCAAACTTCG ACATACCTGA ATCGGCATTC GTCAAAGCCA TTATCCAAGG GAAAGAGGAT ACAATCCCAA CCCCGAAGTG GGGTGGCTCA TACTATCACT GGAGCAAGAT CTTCGACACA TTTCTTATCC TGTTTAAGGG AAAGCTCTCC AATCTCACCG TAACTGACAT AATGCAGGGA CCCGTTTATG GAACAAGGAC GGAAGAGATA ACAGACGAGG AGCTACGCAC TAGGTTTGAC TTCGACGAAA CGTGGGGGAC AGTGATAAAC AGGTACTGTG TGGAGGCAGT ACTGGGTTTA CCCTTAACAC CTTACGGTAA AGGGAAACAG ACTAGGGGTT TCATTTCCCT TGAAGATAGT GTAGAGGCCT TAAGATTGCT CATAGAAAAT CCACCCAAGG ATGGCGAGTA TAGGGTCGTG AATCAGTTCG CTGAGGTTTA CAATGTGAGA CAACTTGCCG AGATTGTAAA GAACGCTGCG GAGGAATTGG GGCTGAAGAC TGATATAACA CACGTGAAGA ACCCTCGAGT TGAGGCCGAG GAACACTACT ACAACCCTGA GGTAAAAGTT CTACCCTCAC TTGGATTCAA ACCTAAAAGG AACATTAGAG ATGAGACCAA GGTTATGATA AAGGATCTCC TCCCATATAA GGAGAGACTG GAAAGCTTCA AGCATGTTAT AATGCCAAAG ACGGTGTGGA AGTAA
|
Protein sequence | MKALILGIDG YIGWALALRL VAKGHEVAGI DNLSTRRFSA EVGSDSAFPL PSPKERVEAV KRKLGADIKF IVGDAKDKAL LEETIRDFKP DVIVHFAEQR SAPYSMKDYE HAWYTLENNL KSTLSLLYAV SEIDPSIHIL KMGTMGEYGT PNFDIPESAF VKAIIQGKED TIPTPKWGGS YYHWSKIFDT FLILFKGKLS NLTVTDIMQG PVYGTRTEEI TDEELRTRFD FDETWGTVIN RYCVEAVLGL PLTPYGKGKQ TRGFISLEDS VEALRLLIEN PPKDGEYRVV NQFAEVYNVR QLAEIVKNAA EELGLKTDIT HVKNPRVEAE EHYYNPEVKV LPSLGFKPKR NIRDETKVMI KDLLPYKERL ESFKHVIMPK TVWK
|
| |