Gene Msed_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0649 
Symbol 
ID5103809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp593630 
End bp594895 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content46% 
IMG OID640506553 
Productglycosyl transferase family protein 
Protein accessionYP_001190748 
Protein GI146303432 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTA TTGATATTAT CATCGTTTTA TCTGCAATAC TTTCATCCTT GTGGATACTT 
CTGGAGTCGT TCTACTATAC CCGGGACAAA CCTCCTGTCC CTAGGACAGA CGGTCCCAGG
TACAAGGCGT CCATAGTGGT GGCCATAAAA AATGAAGATC CTGAAGTAGT GAAGGGGTTA
GTTGAGAACC TTTCAAGGCT AGACTACCCT GATTACGAGG TAATACTGGT TTCAGATGAT
TCAGAGCAGG ACTTTGAGCG ATTAAGGCAA ATAGAGCTAC CGGAAAAATT CAAACTGGTT
AGGAGGGATG TCCCCCAAGG TAGGAAGGCT GGTGCCCTCA ATTACGGCGT TTCCCTTTCA
ACGGGGGAAA TCCTTGTTTT TTTAGATGCC GAGGCCAGAG TGGATCCTAC TATCTTAACT
AGGATATCTG CCCACTTGAG CCAGGCCGAG GCGATGGCCC TTAGACTAAG GGTCAGAGAT
CCGAAGAACA AGCTTCAGGT ACTCTATTCC GAGATAACTG AGTTTTCCAT GGACTCGCTA
TTTAGGGGAA GATACCTCAA GGGTCTTCCA GTATTTCCCA ACGGATCAGC CTTTGCCATT
AGATCCTCTA CCCTGAAGAG GATAGGAGGG TGGAGAGAGG GAATGGTCGC TGAGGACTTG
GAAATAGGGA TGAGGCTATT CCTGAATGGA GTGAAAGTTG GTTACGCCGA CGACGTTGTT
GTGGAGACGT TAGCTCCCTA TACCTGGAAG GATCTTTTCC AACAGATGAA ACGATGGGCC
TACGGATCTG GACAACTTTT CCCCTACAGT CTTTCACTGT TGAGAAGAGG TATGAGCGGT
ATAGAAGGTG CAATATATGC GAATCAGTGG GGAATTTATC CTGCATACTT TGTTATGTTA
CTGATTGCAG GTATTGTCTC TCCAGTCTTC TCCTCCTCGC TTCTTTCCTG GGTCTTGTCC
TTGACACTGT TTCTAATCTC GTCCCTGGTC TTCTCATGGA GATCAAGAAC TAGGGAGTAT
GACCTAAGGA TTCCAGCTCT CATGATCTCA GCCTTCCTAA CTGGTTATCT CCTAGGACTT
CTTAACGCAA AATTTAGCTG GAAGGTCACA CCCAAGGTAG AAAGAGAACA GGGATTATGG
ATACCGCTCG AGTCCAATAT TATCTCCTAT CTTTTTCTCT TGAGCGGAAT ATTAGCCCTA
AAAAGCTATT TAGTTCAGGG AACGATTCTC CTGGCGATTT CCCTTATCTT ACTAATCATA
CCGTGA
 
Protein sequence
MLFIDIIIVL SAILSSLWIL LESFYYTRDK PPVPRTDGPR YKASIVVAIK NEDPEVVKGL 
VENLSRLDYP DYEVILVSDD SEQDFERLRQ IELPEKFKLV RRDVPQGRKA GALNYGVSLS
TGEILVFLDA EARVDPTILT RISAHLSQAE AMALRLRVRD PKNKLQVLYS EITEFSMDSL
FRGRYLKGLP VFPNGSAFAI RSSTLKRIGG WREGMVAEDL EIGMRLFLNG VKVGYADDVV
VETLAPYTWK DLFQQMKRWA YGSGQLFPYS LSLLRRGMSG IEGAIYANQW GIYPAYFVML
LIAGIVSPVF SSSLLSWVLS LTLFLISSLV FSWRSRTREY DLRIPALMIS AFLTGYLLGL
LNAKFSWKVT PKVEREQGLW IPLESNIISY LFLLSGILAL KSYLVQGTIL LAISLILLII
P