Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2112 |
Symbol | |
ID | 5104405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2031879 |
End bp | 2033267 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640508001 |
Product | glycosyl transferase family protein |
Protein accession | YP_001192175 |
Protein GI | 146304859 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAT TGATACAAGC AATTCTGACT ACTGCCATAT TTATTATTCC TAGCTTCCTT TTGCTTTACC AATACATATT GTTTCGTAAT GGCATGAAAT TTAGGGATAG TCTAGAGCCA CTCTTCGCCG AGGAATTACC TTCCCTTTCA GTTTTGGTCC CTATTAAGGG AGAAAAACCA GAGACGCTTC AGGGCTTGCT GGATAATTTA GCTACAGTAG AATGGGATAA AAACAAGCTT GAGATCATTG TAGTTTCCGA TGACTCTCCA GAGTATTTTG AAAATCTCAT CAGAAAAATC TCGATACCAC AAGGGCTCAA AGTCAAGATC GTTAGAAGAG AGAAAAAGGT AGGTTACAAG AGTGGGGCTT TAGCCTATGC ATACTCCTTA TCTAGCGGAG ACCTAATAAT TACCCTCGAT GTAGATGCCA GGTTAGAGAA AACCTCACTG ATAAAGGCGT TCAATAGGTT AAGAATACAC GGATGCGATG CTGTAACCAT GAACTGGATT GGATATTCAC AGAAGCCATA TTCTACTCTC GCCAAGGGAA TAATGATCTC AACCGTTATT GCAGATACAG CCCTTCTGAA CGGAAGGGAC AACAGTAATC TCAGGATCTT TCCCGTGGGT TGCGGAACAA TGTTCAAGAG AGATGCAATC GAATCGGTAG GACCATGGGA TCCCTCAATG ATCCAAGACG ACCTAGAAAT AGGGGCTAGG CTGATTAAGA ATGGGAAAAG GATTTGCTCT TCTACCTCTC CGGTCTACGT AGAAGTCCCA GATAATCTCG TGGCATTTTA CGTGCAACAA ACTAGGTGGG CCATGGGAAG TATAGAGGTT TTAACCAGGA GATTTAAGGA GATAATGAGC AGAAATATAT CCTTAAAGCA GAAGATTGAC ATTCTAATTT TCCTTCTTCA GTATGTTCCC ATAGGCCTGA CATTTTTAGC AGCTTTGGGA CTAGCATTAA TGTCCTTATT GGGACTAAAT CACGTTTACG ACTATCTGAG AACTCCCATA ATCCTCATCT GGATTCTTTC GCTTTCGATT TATGGGTACA ATTTCATAAA GACTGCATTG GGAAAAGGAT ACAAACTTGT AGAGGCTATG AGGGCCTTAG GAAAGGTTTC ATCTTACACT GTGGCTATTT CACCCTTTAT TCTAGTGGGG CTTCTATCTG GTCTAAGGAA GAACAGGAAA TACGTTGTCA CTCCTAAGGG AGTCAAAGTA GATACATGGA TCCAGTACCC AGTACTTCTT TTTGGCATTT TGTTCTTAAC ATCCTCCATT ATCTATCTTA TACACGGAGC CCCAGTAACC GGTCTTTGGC TCCTTTATTA CTCCATGGGG TATCTGTTCA CTGTCGCAAC TTTCAAAAGA GAGCTTTAG
|
Protein sequence | MNPLIQAILT TAIFIIPSFL LLYQYILFRN GMKFRDSLEP LFAEELPSLS VLVPIKGEKP ETLQGLLDNL ATVEWDKNKL EIIVVSDDSP EYFENLIRKI SIPQGLKVKI VRREKKVGYK SGALAYAYSL SSGDLIITLD VDARLEKTSL IKAFNRLRIH GCDAVTMNWI GYSQKPYSTL AKGIMISTVI ADTALLNGRD NSNLRIFPVG CGTMFKRDAI ESVGPWDPSM IQDDLEIGAR LIKNGKRICS STSPVYVEVP DNLVAFYVQQ TRWAMGSIEV LTRRFKEIMS RNISLKQKID ILIFLLQYVP IGLTFLAALG LALMSLLGLN HVYDYLRTPI ILIWILSLSI YGYNFIKTAL GKGYKLVEAM RALGKVSSYT VAISPFILVG LLSGLRKNRK YVVTPKGVKV DTWIQYPVLL FGILFLTSSI IYLIHGAPVT GLWLLYYSMG YLFTVATFKR EL
|
| |