Gene Msed_0183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0183 
Symbol 
ID5103927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp146836 
End bp148023 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content42% 
IMG OID640506088 
Productglycosyl transferase family protein 
Protein accessionYP_001190284 
Protein GI146302968 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00790444 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.140907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATG ACATTGTAAT TGGACTTAGT ATCATAGTAT CCATATGGAG CGTCTATAAC 
TCTGCCTTCG CTATCTACGG GTTGTCCTGG AAATCCGATG AGCCCAAAAC CTCCTCAGGC
CCATCCTTTT CCTTGTTAGT TCCGGTGAGG AACGAAGAGA AAGTCCTAGG GAGACTCCTT
GAAAGGCTAG TTAACCAAGA ATATGATAGG TCAAAGTATG AGATAATTGT CCTAGAGGAC
GGATCTACAG ACAACACGTT AGGGGTATGC AATAAATTTT CAGAAATGTA TAGTATTATC
AAATGTGTCC ATCTGGAAAA GAGCAATGTC GTTAATGGGA AGAGCAGAGC CCTCAATTAT
GGATTGAAAA TATCAAGGGG AGATATTATA GGCGTATTTG ACGCCGATAC TGTACCTAGA
CTTGACGTGT TAGGTTATGT AGCCCAGAAG TTTATTTCTA ATTCTAGAGT AGGAGGCGTA
CAGGGAAGGT TAGTCCCCAT CAATGTTAGG GAAAGCATAG TGGCTAGGTT AGCCTCGCTA
GAAGAGTTGT TCAGTGAGTA CTCGATTTCA GGAAGGGCCA GAGCAGGCCT TTTCGTACCA
CTTGAGGGTA CATGTAGTTT CGTTAGGAGA GATGCCTTGG AGAAAGTGGG AGGTTGGAAC
GAGAATGTAC TTACAGAGGA CCTAGATCTC AGCCTAAAAC TAACAAGCTT GAACTATTTG
ATCGTTTACT CACCTTCTGT TCAGAGCTGG AGGGAAGTCC CGGTTACATT CAGTTCACTA
GTTAGGCAGA GATTAAGGTG GTACAGGGGT AACTTTGAGC TTACCATGAG GATCTCTAGG
TTTAAGTTTA CTTGGAGGTT GGTAGATGCA GCTATGTTAG TAGGCACTCC AGTATTCATG
GTTTTAAGCT TGGCGAACTA TTCCCTTGTC TTTATTTACT CATATCAATT GCACGTCCTT
ATAGCTGCTA TTATCTCGTT TTCGTCCATG ATGACTCTTC TTCTAATAAT TATGATATCC
AGGAGACATA TGATTGAAAC AATTTATATA ATTCTATCCG CATTATATCT TAATTTTACC
ATAAGTCTCC ATTTAATTTC CATTGTTCTA GAATTGGCTG GCGCACCTAA GGGATGGAGT
AAGACGGAAA GGTCTGGTAA GATCACGGTA GATGTGCCGA GACCATAG
 
Protein sequence
MLDDIVIGLS IIVSIWSVYN SAFAIYGLSW KSDEPKTSSG PSFSLLVPVR NEEKVLGRLL 
ERLVNQEYDR SKYEIIVLED GSTDNTLGVC NKFSEMYSII KCVHLEKSNV VNGKSRALNY
GLKISRGDII GVFDADTVPR LDVLGYVAQK FISNSRVGGV QGRLVPINVR ESIVARLASL
EELFSEYSIS GRARAGLFVP LEGTCSFVRR DALEKVGGWN ENVLTEDLDL SLKLTSLNYL
IVYSPSVQSW REVPVTFSSL VRQRLRWYRG NFELTMRISR FKFTWRLVDA AMLVGTPVFM
VLSLANYSLV FIYSYQLHVL IAAIISFSSM MTLLLIIMIS RRHMIETIYI ILSALYLNFT
ISLHLISIVL ELAGAPKGWS KTERSGKITV DVPRP