Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0183 |
Symbol | |
ID | 5103927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 146836 |
End bp | 148023 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640506088 |
Product | glycosyl transferase family protein |
Protein accession | YP_001190284 |
Protein GI | 146302968 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00790444 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.140907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGATG ACATTGTAAT TGGACTTAGT ATCATAGTAT CCATATGGAG CGTCTATAAC TCTGCCTTCG CTATCTACGG GTTGTCCTGG AAATCCGATG AGCCCAAAAC CTCCTCAGGC CCATCCTTTT CCTTGTTAGT TCCGGTGAGG AACGAAGAGA AAGTCCTAGG GAGACTCCTT GAAAGGCTAG TTAACCAAGA ATATGATAGG TCAAAGTATG AGATAATTGT CCTAGAGGAC GGATCTACAG ACAACACGTT AGGGGTATGC AATAAATTTT CAGAAATGTA TAGTATTATC AAATGTGTCC ATCTGGAAAA GAGCAATGTC GTTAATGGGA AGAGCAGAGC CCTCAATTAT GGATTGAAAA TATCAAGGGG AGATATTATA GGCGTATTTG ACGCCGATAC TGTACCTAGA CTTGACGTGT TAGGTTATGT AGCCCAGAAG TTTATTTCTA ATTCTAGAGT AGGAGGCGTA CAGGGAAGGT TAGTCCCCAT CAATGTTAGG GAAAGCATAG TGGCTAGGTT AGCCTCGCTA GAAGAGTTGT TCAGTGAGTA CTCGATTTCA GGAAGGGCCA GAGCAGGCCT TTTCGTACCA CTTGAGGGTA CATGTAGTTT CGTTAGGAGA GATGCCTTGG AGAAAGTGGG AGGTTGGAAC GAGAATGTAC TTACAGAGGA CCTAGATCTC AGCCTAAAAC TAACAAGCTT GAACTATTTG ATCGTTTACT CACCTTCTGT TCAGAGCTGG AGGGAAGTCC CGGTTACATT CAGTTCACTA GTTAGGCAGA GATTAAGGTG GTACAGGGGT AACTTTGAGC TTACCATGAG GATCTCTAGG TTTAAGTTTA CTTGGAGGTT GGTAGATGCA GCTATGTTAG TAGGCACTCC AGTATTCATG GTTTTAAGCT TGGCGAACTA TTCCCTTGTC TTTATTTACT CATATCAATT GCACGTCCTT ATAGCTGCTA TTATCTCGTT TTCGTCCATG ATGACTCTTC TTCTAATAAT TATGATATCC AGGAGACATA TGATTGAAAC AATTTATATA ATTCTATCCG CATTATATCT TAATTTTACC ATAAGTCTCC ATTTAATTTC CATTGTTCTA GAATTGGCTG GCGCACCTAA GGGATGGAGT AAGACGGAAA GGTCTGGTAA GATCACGGTA GATGTGCCGA GACCATAG
|
Protein sequence | MLDDIVIGLS IIVSIWSVYN SAFAIYGLSW KSDEPKTSSG PSFSLLVPVR NEEKVLGRLL ERLVNQEYDR SKYEIIVLED GSTDNTLGVC NKFSEMYSII KCVHLEKSNV VNGKSRALNY GLKISRGDII GVFDADTVPR LDVLGYVAQK FISNSRVGGV QGRLVPINVR ESIVARLASL EELFSEYSIS GRARAGLFVP LEGTCSFVRR DALEKVGGWN ENVLTEDLDL SLKLTSLNYL IVYSPSVQSW REVPVTFSSL VRQRLRWYRG NFELTMRISR FKFTWRLVDA AMLVGTPVFM VLSLANYSLV FIYSYQLHVL IAAIISFSSM MTLLLIIMIS RRHMIETIYI ILSALYLNFT ISLHLISIVL ELAGAPKGWS KTERSGKITV DVPRP
|
| |