Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1808 |
Symbol | |
ID | 5105371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1750058 |
End bp | 1751308 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640507707 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001191886 |
Protein GI | 146304570 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTAG GCCTAGTTAC TTTTTCCTTG ATGAACCTAC GGGAAGGCGG AGCAGAGAGG CATGTTAGAG AATTTGTTAA TATTGCTAAG ACTAGATTTG AACTTGTTCT CTTTCCCACT TTAAACACGT ATTTACAAGT TGAAAACGAG GAAGATAAAA ATGCCCTAAT CAAGAGAGCT CAGGAGCTAG AGAAGGAGGG TATTACACTA GCCTCAGAGT TTTACTCCTT GGTTGACCGC TCGATTACCA GGAAAGAGCG TCTCTTGAAT TTTGTGGACT TTCGTTTACT GCGCGAGTTA AGTAAGGCCT ACTATTCCGA TCTAAACAAG ATTGATTTTC TTTTCTCCCC GAACTTTATA TCTCCAGACG TAGTTCTCAT GGCTAGCCAC TCAGGAAAAG GTTATGGAAT TCTGATAAAC GGGTATATAG CGCCCCTTCA CATGGATCCC CTCTTATATT CCATTTATAA GTTTAGGATT GGTCAGGAGA GCTTTCTTAG CTCATTGCCT AAAAACCTTC TAATTTCTAA TTATTGGGCT AAAGCCATTA AGATAATGAA AAATAATCCA CCTAAGTTCG TGGCAGGCGT AAATAGGGTA GCTATTGAAG GAGTGATAGG TAAATTGCCA ACTAATCACG TCATTCTGGA TCCTGGATTT GCAATTGATC CATCTATCGT GAAGTATAGG TCTGAAGCTA AGGATAACTA TGCAATCTTC GCTGCAGCTA GAGTAGAGCC CAGTAAGGGG ATGCTAGATC TGTTGAGGAT CATGAAGCTC CTTAAAAGTG CAGACGTGAA ACTAAAGATC ATGGGAAGGT TAAAATCAGA TTCTTCCAAT TTTTATCGAT TAGCGGAACG TTATGGTGTA AGGGATAAGA TAGAATACCT AGGATTTCTA TCTGGAGAGG AAAAGTACAA GGTCATGAGC TCTGCAAGGG TCATGGTTTA CCCGTCTCAT GACGATACCA ATGCACTGGT TGTGATGGAA TCCCTTGCAG TTGGGACTCC TGTCGTTACC TATGCTATTC CTGGCATTAA GTACGTGTAT GATGGTGTTC CTGGTGTAAC GCTAGTGAAA GAGTTTAACT ATGACGGAAT GGCGAAAGAG ATCAACAAGA TCATGCATAC TCAATATTCG TTAAATGATG AGAAGCTACA GAAATTACTC TCCCATCATA GTTCTTGGGC CAAGGTAGTC GAACAAGAGG TTAACCTGAT ACTTAAATAT GGTATTAAAG ATATGAAATA A
|
Protein sequence | MRVGLVTFSL MNLREGGAER HVREFVNIAK TRFELVLFPT LNTYLQVENE EDKNALIKRA QELEKEGITL ASEFYSLVDR SITRKERLLN FVDFRLLREL SKAYYSDLNK IDFLFSPNFI SPDVVLMASH SGKGYGILIN GYIAPLHMDP LLYSIYKFRI GQESFLSSLP KNLLISNYWA KAIKIMKNNP PKFVAGVNRV AIEGVIGKLP TNHVILDPGF AIDPSIVKYR SEAKDNYAIF AAARVEPSKG MLDLLRIMKL LKSADVKLKI MGRLKSDSSN FYRLAERYGV RDKIEYLGFL SGEEKYKVMS SARVMVYPSH DDTNALVVME SLAVGTPVVT YAIPGIKYVY DGVPGVTLVK EFNYDGMAKE INKIMHTQYS LNDEKLQKLL SHHSSWAKVV EQEVNLILKY GIKDMK
|
| |