Gene Msed_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2112 
Symbol 
ID5104405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2031879 
End bp2033267 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content41% 
IMG OID640508001 
Productglycosyl transferase family protein 
Protein accessionYP_001192175 
Protein GI146304859 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAT TGATACAAGC AATTCTGACT ACTGCCATAT TTATTATTCC TAGCTTCCTT 
TTGCTTTACC AATACATATT GTTTCGTAAT GGCATGAAAT TTAGGGATAG TCTAGAGCCA
CTCTTCGCCG AGGAATTACC TTCCCTTTCA GTTTTGGTCC CTATTAAGGG AGAAAAACCA
GAGACGCTTC AGGGCTTGCT GGATAATTTA GCTACAGTAG AATGGGATAA AAACAAGCTT
GAGATCATTG TAGTTTCCGA TGACTCTCCA GAGTATTTTG AAAATCTCAT CAGAAAAATC
TCGATACCAC AAGGGCTCAA AGTCAAGATC GTTAGAAGAG AGAAAAAGGT AGGTTACAAG
AGTGGGGCTT TAGCCTATGC ATACTCCTTA TCTAGCGGAG ACCTAATAAT TACCCTCGAT
GTAGATGCCA GGTTAGAGAA AACCTCACTG ATAAAGGCGT TCAATAGGTT AAGAATACAC
GGATGCGATG CTGTAACCAT GAACTGGATT GGATATTCAC AGAAGCCATA TTCTACTCTC
GCCAAGGGAA TAATGATCTC AACCGTTATT GCAGATACAG CCCTTCTGAA CGGAAGGGAC
AACAGTAATC TCAGGATCTT TCCCGTGGGT TGCGGAACAA TGTTCAAGAG AGATGCAATC
GAATCGGTAG GACCATGGGA TCCCTCAATG ATCCAAGACG ACCTAGAAAT AGGGGCTAGG
CTGATTAAGA ATGGGAAAAG GATTTGCTCT TCTACCTCTC CGGTCTACGT AGAAGTCCCA
GATAATCTCG TGGCATTTTA CGTGCAACAA ACTAGGTGGG CCATGGGAAG TATAGAGGTT
TTAACCAGGA GATTTAAGGA GATAATGAGC AGAAATATAT CCTTAAAGCA GAAGATTGAC
ATTCTAATTT TCCTTCTTCA GTATGTTCCC ATAGGCCTGA CATTTTTAGC AGCTTTGGGA
CTAGCATTAA TGTCCTTATT GGGACTAAAT CACGTTTACG ACTATCTGAG AACTCCCATA
ATCCTCATCT GGATTCTTTC GCTTTCGATT TATGGGTACA ATTTCATAAA GACTGCATTG
GGAAAAGGAT ACAAACTTGT AGAGGCTATG AGGGCCTTAG GAAAGGTTTC ATCTTACACT
GTGGCTATTT CACCCTTTAT TCTAGTGGGG CTTCTATCTG GTCTAAGGAA GAACAGGAAA
TACGTTGTCA CTCCTAAGGG AGTCAAAGTA GATACATGGA TCCAGTACCC AGTACTTCTT
TTTGGCATTT TGTTCTTAAC ATCCTCCATT ATCTATCTTA TACACGGAGC CCCAGTAACC
GGTCTTTGGC TCCTTTATTA CTCCATGGGG TATCTGTTCA CTGTCGCAAC TTTCAAAAGA
GAGCTTTAG
 
Protein sequence
MNPLIQAILT TAIFIIPSFL LLYQYILFRN GMKFRDSLEP LFAEELPSLS VLVPIKGEKP 
ETLQGLLDNL ATVEWDKNKL EIIVVSDDSP EYFENLIRKI SIPQGLKVKI VRREKKVGYK
SGALAYAYSL SSGDLIITLD VDARLEKTSL IKAFNRLRIH GCDAVTMNWI GYSQKPYSTL
AKGIMISTVI ADTALLNGRD NSNLRIFPVG CGTMFKRDAI ESVGPWDPSM IQDDLEIGAR
LIKNGKRICS STSPVYVEVP DNLVAFYVQQ TRWAMGSIEV LTRRFKEIMS RNISLKQKID
ILIFLLQYVP IGLTFLAALG LALMSLLGLN HVYDYLRTPI ILIWILSLSI YGYNFIKTAL
GKGYKLVEAM RALGKVSSYT VAISPFILVG LLSGLRKNRK YVVTPKGVKV DTWIQYPVLL
FGILFLTSSI IYLIHGAPVT GLWLLYYSMG YLFTVATFKR EL