Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0358 |
Symbol | |
ID | 5103601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 311557 |
End bp | 312933 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640506264 |
Product | glycosyl transferase family protein |
Protein accession | YP_001190459 |
Protein GI | 146303143 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAC TAGATGCATT TTATTTAATA TTTGGATCGC TAACCTTATT CCTTTCACTG GTTTACTTTA TTCTAAATTC CTACTTCTCT ATTAACGTAA GGATGCCCAG AAGAGTAGAC TCGAACAAAT TATCTGACGT GACCGTTCTA ATCCCCGTGT ATGGGGAAAA GGCGTCAGTG TTTGAGAGAG TGATCTCTGC AGTTGCGGAG CAGGACGTGA AGTTTCTAGT TGTGGGAGAC GGTTGCGATG AGCCCTATAG GTCAATCACC TTTAGGTATG GTGGAAGATT TATCAAGACA CCTGCCAGGT CAGGGAAGAG GAATGCCCTA GCCACTGGTA TATCCCACGT GGACACGAAG TTTGTTCTCT TCCTGGATAG CGACACCGTA TTGCCCAGTG ACGGAGTAAG AAGGATGTTG TCCCTCATGG ACGAGGGAGT AGGTGGGGTA AGCGTTAACG TGAGGAATGT AAAAACTGGA AATACCTTCT ACTTAGCAGA ACTGATTGAA AGACTAAAGG AGGCCACAAT GAGGGCCGTA AATAGGTCGG GCTATGCAGT TCTTCTGAAT GGAAAATGTT CCCTTTATAG GACCGAGTTG GTTAGGCCAT TTATCCTTAG TGAAGAGTTC AGAAACCCGA GGTTCATGGG AAGAAGGGCA TTAATAGGAG ATGACAAACA ACTTACAAAC TACGTCATCT CGAGAGGTTA TAAGGCATTG CTGGACTTTG AGACCACCGT GTTGACTTAT CCTCCAGAGA GTGTTAAGAA ACTGTACAGA CAACTAATTA GATGGTCTAG GGCAAACTAC TACTTTTTCT ATAGGGAACT GAGGGATGGA ACCATGTTCA AACGAGGTCC CCTTTACGTT TTCAATTTTC TATACACAAC GATTCTTCCA TTCCTCGTGA TGGGGGTTTC AATCTTTGAT ATGATTTTCT TGGGCTCCAC AATTGTGGAT ACGAATCCAT CAGATTACGA AGTAGCTCTT CTTCACGGGG GCAACTTCCT CCTTCACTTA CCCATCATCT TGGCAAAGAG GATAGTATTC TCCCTTCTGC TGGGAACTAT GACAACTCAT TACCCATTTT CCGTGACCCA GTCAGCGTTT ATCACTCCGT TCCACTCTGC ATTTCACTCA ACCCCTACTT TCCTAGGCCT CCACTTTCCT AGGCTAGGCT TCAAGTATTC TGTTATCTTA ATGCATGTGG CAAGCTACCT TACGGCAATT CCCTTCATTT ACGCTCTGTG GAGACTTCTG CACGAGGAAA AGTTGAAGAC GCTGGTGATT GGTTCCCTAG CCTTAGCAAT TCAACTCGTT GTGAGCATTT ACGCGTTACT CACGATTTGG GATCAGGACA AGTGGCTGAC TAGGTAA
|
Protein sequence | MNQLDAFYLI FGSLTLFLSL VYFILNSYFS INVRMPRRVD SNKLSDVTVL IPVYGEKASV FERVISAVAE QDVKFLVVGD GCDEPYRSIT FRYGGRFIKT PARSGKRNAL ATGISHVDTK FVLFLDSDTV LPSDGVRRML SLMDEGVGGV SVNVRNVKTG NTFYLAELIE RLKEATMRAV NRSGYAVLLN GKCSLYRTEL VRPFILSEEF RNPRFMGRRA LIGDDKQLTN YVISRGYKAL LDFETTVLTY PPESVKKLYR QLIRWSRANY YFFYRELRDG TMFKRGPLYV FNFLYTTILP FLVMGVSIFD MIFLGSTIVD TNPSDYEVAL LHGGNFLLHL PIILAKRIVF SLLLGTMTTH YPFSVTQSAF ITPFHSAFHS TPTFLGLHFP RLGFKYSVIL MHVASYLTAI PFIYALWRLL HEEKLKTLVI GSLALAIQLV VSIYALLTIW DQDKWLTR
|
| |