Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0649 |
Symbol | |
ID | 5103809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 593630 |
End bp | 594895 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506553 |
Product | glycosyl transferase family protein |
Protein accession | YP_001190748 |
Protein GI | 146303432 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.113034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTTA TTGATATTAT CATCGTTTTA TCTGCAATAC TTTCATCCTT GTGGATACTT CTGGAGTCGT TCTACTATAC CCGGGACAAA CCTCCTGTCC CTAGGACAGA CGGTCCCAGG TACAAGGCGT CCATAGTGGT GGCCATAAAA AATGAAGATC CTGAAGTAGT GAAGGGGTTA GTTGAGAACC TTTCAAGGCT AGACTACCCT GATTACGAGG TAATACTGGT TTCAGATGAT TCAGAGCAGG ACTTTGAGCG ATTAAGGCAA ATAGAGCTAC CGGAAAAATT CAAACTGGTT AGGAGGGATG TCCCCCAAGG TAGGAAGGCT GGTGCCCTCA ATTACGGCGT TTCCCTTTCA ACGGGGGAAA TCCTTGTTTT TTTAGATGCC GAGGCCAGAG TGGATCCTAC TATCTTAACT AGGATATCTG CCCACTTGAG CCAGGCCGAG GCGATGGCCC TTAGACTAAG GGTCAGAGAT CCGAAGAACA AGCTTCAGGT ACTCTATTCC GAGATAACTG AGTTTTCCAT GGACTCGCTA TTTAGGGGAA GATACCTCAA GGGTCTTCCA GTATTTCCCA ACGGATCAGC CTTTGCCATT AGATCCTCTA CCCTGAAGAG GATAGGAGGG TGGAGAGAGG GAATGGTCGC TGAGGACTTG GAAATAGGGA TGAGGCTATT CCTGAATGGA GTGAAAGTTG GTTACGCCGA CGACGTTGTT GTGGAGACGT TAGCTCCCTA TACCTGGAAG GATCTTTTCC AACAGATGAA ACGATGGGCC TACGGATCTG GACAACTTTT CCCCTACAGT CTTTCACTGT TGAGAAGAGG TATGAGCGGT ATAGAAGGTG CAATATATGC GAATCAGTGG GGAATTTATC CTGCATACTT TGTTATGTTA CTGATTGCAG GTATTGTCTC TCCAGTCTTC TCCTCCTCGC TTCTTTCCTG GGTCTTGTCC TTGACACTGT TTCTAATCTC GTCCCTGGTC TTCTCATGGA GATCAAGAAC TAGGGAGTAT GACCTAAGGA TTCCAGCTCT CATGATCTCA GCCTTCCTAA CTGGTTATCT CCTAGGACTT CTTAACGCAA AATTTAGCTG GAAGGTCACA CCCAAGGTAG AAAGAGAACA GGGATTATGG ATACCGCTCG AGTCCAATAT TATCTCCTAT CTTTTTCTCT TGAGCGGAAT ATTAGCCCTA AAAAGCTATT TAGTTCAGGG AACGATTCTC CTGGCGATTT CCCTTATCTT ACTAATCATA CCGTGA
|
Protein sequence | MLFIDIIIVL SAILSSLWIL LESFYYTRDK PPVPRTDGPR YKASIVVAIK NEDPEVVKGL VENLSRLDYP DYEVILVSDD SEQDFERLRQ IELPEKFKLV RRDVPQGRKA GALNYGVSLS TGEILVFLDA EARVDPTILT RISAHLSQAE AMALRLRVRD PKNKLQVLYS EITEFSMDSL FRGRYLKGLP VFPNGSAFAI RSSTLKRIGG WREGMVAEDL EIGMRLFLNG VKVGYADDVV VETLAPYTWK DLFQQMKRWA YGSGQLFPYS LSLLRRGMSG IEGAIYANQW GIYPAYFVML LIAGIVSPVF SSSLLSWVLS LTLFLISSLV FSWRSRTREY DLRIPALMIS AFLTGYLLGL LNAKFSWKVT PKVEREQGLW IPLESNIISY LFLLSGILAL KSYLVQGTIL LAISLILLII P
|
| |