Gene Msed_1306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1306 
Symbol 
ID5104557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1285192 
End bp1286355 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content52% 
IMG OID640507195 
Productglycosyl transferase, group 1 
Protein accessionYP_001191388 
Protein GI146304072 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.647455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAGTTC TCGCAGTGGT GGACTTCGGC CTCGTGGAGC ATAGTGGTGG GTACAAGAGA 
AACCTGGAAA TTATGAGGAG GTTACCCTCC TACCTCGAGG TGGATATAGT TCCCTCATTG
AGGAATGTGA GGCTCGCCTT AAGCGGGCAT AGGGATGAGT TGATCTCCTT ACTGAAGGAC
TTGAAGGCTA CCGTCTCCCT CGATGCGTTA AGGGATGCAA ACTCACTGGA GGAGTTCCTT
AAACTCCTGA GACCAGGTAA ATACGATCTC GCCCTCGTGT ATAGTAACTC TGGTGAAAAC
GTGGAGTTAG CCTCGACCCT GACCAATTCC CCCGTCGGCG TTCAACTGCA ACTGGAACCC
TTCTATAGGG ATCTCTCGAC CCTCTTCAGG ATCAAGTTCA GGGGGTCAAC AGGGAGAGCC
CTCACCAGGT TTCAGAGGGC AGTTCAGGAG TCCAAGGAGG AGGAGAGAAC CTGGCTATCC
CTGATTAGGG CGGGAAAGTT GAACTTCGCC ATTTCCGTGA GCAGGATTCC CTTAACTTAT
TCAGGCCTAG ATCAGATGAT CCCATATGAC GTCACGACCC CGGGGAATGC CATAGATCCA
GAGGTGAGCA AGGTGAGGAG GGAGAAGGAG GATTACGCAG TCTACTTCAC CAGGTTAATT
CCCGAGAAGG GACTCTTTGA CGTCCCCCTA ATCTGGAAGA GGGTAAACGA GCATAGGGAC
ATGAGGCTTT ACGTGATGGG ACAGTTCCTG AGCGAGGATG ACCGGGAGGA GTTTACCAGG
CTCGTCCGCA GACTTAACGT TAACGTGGAG TACCTGGGCT TCAGGAGTGG GGAGGATTTG
TACAAGGTCG TGGCTGGGGC CCAGTTCACG CTTTATCCCT CGCACTACGA CAGCTTCTCC
CTTGTGGTGT TGGAATCGTT GGCCCTGAAC ACTTCCGTAA TAGCTTATGA TACACCCGCA
ATCAGGGAGA TATATCGCGG TGTTAAGGGC GTTCACACCG TGGAGGAGGA CAACTTAAGC
GGTATGGCCT CCCTGTGCCT GAAGCAGGAG AGGAGTGAGG TTAACGTCCC TGAGATGTAT
TCCTCCTGGG ACAAGGTAGC CCTCGCTGAA CTCGCCTCAA TTAATAGAAT GGCCAGGACG
TTCTCGATTA ACGGAATTTT CTAG
 
Protein sequence
MRVLAVVDFG LVEHSGGYKR NLEIMRRLPS YLEVDIVPSL RNVRLALSGH RDELISLLKD 
LKATVSLDAL RDANSLEEFL KLLRPGKYDL ALVYSNSGEN VELASTLTNS PVGVQLQLEP
FYRDLSTLFR IKFRGSTGRA LTRFQRAVQE SKEEERTWLS LIRAGKLNFA ISVSRIPLTY
SGLDQMIPYD VTTPGNAIDP EVSKVRREKE DYAVYFTRLI PEKGLFDVPL IWKRVNEHRD
MRLYVMGQFL SEDDREEFTR LVRRLNVNVE YLGFRSGEDL YKVVAGAQFT LYPSHYDSFS
LVVLESLALN TSVIAYDTPA IREIYRGVKG VHTVEEDNLS GMASLCLKQE RSEVNVPEMY
SSWDKVALAE LASINRMART FSINGIF