Gene Msed_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0241 
Symbol 
ID5104107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp201295 
End bp202449 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content47% 
IMG OID640506147 
ProductUDP-sulfoquinovose synthase 
Protein accessionYP_001190342 
Protein GI146303026 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.603555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTC TTATCCTTGG CATTGATGGG TACATTGGGT GGGCCTTGGC CCTGAGGCTA 
GTGGCCAAAG GGCATGAGGT TGCAGGGATT GACAACCTTT CCACGAGGAG ATTTTCGGCC
GAAGTGGGAT CAGACTCCGC ATTTCCATTG CCGTCCCCAA AGGAAAGGGT AGAAGCGGTG
AAGAGGAAAC TGGGCGCTGA CATTAAGTTC ATTGTAGGAG ACGCAAAGGA TAAGGCATTA
CTGGAGGAGA CCATTAGGGA TTTCAAACCT GATGTGATAG TTCATTTTGC CGAGCAAAGA
TCGGCTCCCT ACTCAATGAA GGACTACGAA CATGCATGGT ACACTCTCGA AAATAACCTG
AAGTCCACCC TTAGCCTATT GTACGCAGTA AGCGAGATAG ATCCTTCAAT TCACATACTC
AAGATGGGTA CCATGGGTGA GTATGGTACC CCAAACTTCG ACATACCTGA ATCGGCATTC
GTCAAAGCCA TTATCCAAGG GAAAGAGGAT ACAATCCCAA CCCCGAAGTG GGGTGGCTCA
TACTATCACT GGAGCAAGAT CTTCGACACA TTTCTTATCC TGTTTAAGGG AAAGCTCTCC
AATCTCACCG TAACTGACAT AATGCAGGGA CCCGTTTATG GAACAAGGAC GGAAGAGATA
ACAGACGAGG AGCTACGCAC TAGGTTTGAC TTCGACGAAA CGTGGGGGAC AGTGATAAAC
AGGTACTGTG TGGAGGCAGT ACTGGGTTTA CCCTTAACAC CTTACGGTAA AGGGAAACAG
ACTAGGGGTT TCATTTCCCT TGAAGATAGT GTAGAGGCCT TAAGATTGCT CATAGAAAAT
CCACCCAAGG ATGGCGAGTA TAGGGTCGTG AATCAGTTCG CTGAGGTTTA CAATGTGAGA
CAACTTGCCG AGATTGTAAA GAACGCTGCG GAGGAATTGG GGCTGAAGAC TGATATAACA
CACGTGAAGA ACCCTCGAGT TGAGGCCGAG GAACACTACT ACAACCCTGA GGTAAAAGTT
CTACCCTCAC TTGGATTCAA ACCTAAAAGG AACATTAGAG ATGAGACCAA GGTTATGATA
AAGGATCTCC TCCCATATAA GGAGAGACTG GAAAGCTTCA AGCATGTTAT AATGCCAAAG
ACGGTGTGGA AGTAA
 
Protein sequence
MKALILGIDG YIGWALALRL VAKGHEVAGI DNLSTRRFSA EVGSDSAFPL PSPKERVEAV 
KRKLGADIKF IVGDAKDKAL LEETIRDFKP DVIVHFAEQR SAPYSMKDYE HAWYTLENNL
KSTLSLLYAV SEIDPSIHIL KMGTMGEYGT PNFDIPESAF VKAIIQGKED TIPTPKWGGS
YYHWSKIFDT FLILFKGKLS NLTVTDIMQG PVYGTRTEEI TDEELRTRFD FDETWGTVIN
RYCVEAVLGL PLTPYGKGKQ TRGFISLEDS VEALRLLIEN PPKDGEYRVV NQFAEVYNVR
QLAEIVKNAA EELGLKTDIT HVKNPRVEAE EHYYNPEVKV LPSLGFKPKR NIRDETKVMI
KDLLPYKERL ESFKHVIMPK TVWK