Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1844 |
Symbol | |
ID | 5104115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1789472 |
End bp | 1790518 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507732 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001191911 |
Protein GI | 146304595 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0112136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.849327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGCTTC TTTTCATAAA TCATAGGGAC ATCTACCATC CGCAGGCGGG TGGGGCCGAG AGGGTCATCC TGGAGGTTGC GAGGAGACTG GCGAAGCGCG GGATTGACGT TTCCTGGTTG AGCGAATCCG TCAACACCTC ACGCGATTAC GAGGATGGGG TGAAACTTCT CCACGCCGGT AATCGTTTTT CGCTCCATCT TTACTCCTTG CTCCAAGCGG GTAAGTATGA CGTTGTCATA GATAGCGTGG CTCATGCCGT TCCCTTTTTC TCATACCTGG TTAACAGAAG ATCCATAGCC CTAGTTTATC ACGTTCATCA GGAGGTGGTC AAGTATGAGC TTGACCCAGT CACCGCGACC CTCGTGAGAC AACTCGAGAA GGGGGTTAAG AACTATCCTT GGATTATCTC TATTTCTCAC ACAACAAAGC GCGACCTAGT CTCCTTGGGG GTAGATCCCA GGAAGATAAC AGTGATCCAT AACGGGATTG ACCACTCGCT TTATCAACCA GGCGAGAAGT CTCCCACACC CATGATCCTA TGGATAGGCA GGATGAAGAA TTATAAGAAC CCACTAGACC CCATCAAGGT GTTTAAGCGA CTTAAGACGA GGGCTACCCT GGTTATCGTG GGGAGTGGCG ACCTTGAGGA GGAAGTAAAG AGGGCCACCC TCGGAGAGAG AGACATCATC TACCTGGGAA GGGTGTCCGA GGCCAAGAAG GTGGAGCTGT ATCAAAGGGC ATGGGTTACG CTGTCAACCT CGTTCATTGA GGGGTGGGGA ATGACCGTGG TGGAGGCTAA CGCCTGTGGG ACCCCCGTTC TTGGCTACGC TACTGGCTCT CTCCCCGAGA TCGTTGAGGA AGGGGTTAAC GGGTTCCTTG TGGGTTACAA GGACTTGGAT GGGATGGCTC AGAGGCTGGA GTACATGTTG AACGAGGATG TGATGAAGAG CCTTTCTAAG TCTAGCTACG TGAGTTCGCT TAAGTACGAC TGGGATAAGA CTGCAGATCA ATATTATGCG AAAGTAAAGG AGGTTCTCCA GACCTAA
|
Protein sequence | MRLLFINHRD IYHPQAGGAE RVILEVARRL AKRGIDVSWL SESVNTSRDY EDGVKLLHAG NRFSLHLYSL LQAGKYDVVI DSVAHAVPFF SYLVNRRSIA LVYHVHQEVV KYELDPVTAT LVRQLEKGVK NYPWIISISH TTKRDLVSLG VDPRKITVIH NGIDHSLYQP GEKSPTPMIL WIGRMKNYKN PLDPIKVFKR LKTRATLVIV GSGDLEEEVK RATLGERDII YLGRVSEAKK VELYQRAWVT LSTSFIEGWG MTVVEANACG TPVLGYATGS LPEIVEEGVN GFLVGYKDLD GMAQRLEYML NEDVMKSLSK SSYVSSLKYD WDKTADQYYA KVKEVLQT
|
| |