Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0129 |
Symbol | |
ID | 5104982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 105029 |
End bp | 106222 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640506030 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001190230 |
Protein GI | 146302914 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.730138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000955625 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAGAAA AATACGCTGA AATAATAGGA GAAAACGAGC TAGATGCAAT CTACAGAATA GCTGAGAAGT TGAAGGGATT GTCAGTTCTT CATGTTAATT CAACACCTAA GGGCGGTGGA GTAGCTGAAA TCTTGAGCAA ACTTGTCCCT ATGATGAACG AACTGGGGAT AAACACTCAA TGGAAGGTCA TAAGGGGGGA CAATGAGTTC TTTAACGTAA CAAAGTCATT TCATAACTCC CTACAGAACG GTTCGGGAAA TGTAAATGAT GAGACTTTCG CAGTTTACGA GAAATGGCAA GATATTAATG CATCGGAAAT TCCGCTGGAC TACGATGTGG TCTTCATACA CGATCCTCAG CCTGCAGGAT TAATCAAGGT AAGAAAGAAG GGAAAGTGGA TATGGAGATG TCATATTGAT ATTTCCAATC CATATCCAAA GGTTTGGGAA TTCCTTCGTA AGTATGTTCA GCAGTATGAC GCCATGATAA TATCGTCATC GGTATTTGGA AGAGAGGACC TTACCATCCC ACAGTACGTG ATTCCGCCGT CGATAGATCC CCTAAGCGTT AAGAACAGAG AGATATCACG ATTTACTGTC GAGAGGATCC TAAGGAAGTT TGAGGTTGAT CCTGATAGGC CACTGGTGAC TCAGATTAGC AGATTTGACC GGGCAAAGGA TCCAATAGGA GTAATATCTG CCTTTAAGGG GCTAAAGAAG CACATAGATG CTCAATTGGT CTACTTGGGA AGCCCCGCAT CTGACGACCC CGAAGGGGAT CAGGTATATC AGGAGACGGT AAAGGCAGCT GAAGGAGTGA AAGATGTTCA TCTCTTAATG CTACCCCCCG ACAGTGATCT GGAGGTTAAT GCGTTCCAAA GGGGAGCAGA CGTCGTAATG CAGAAATCGA TTAAGGAAGG CTTCGGTTTA ACCGTAAGCG AGGCCATGTG GAAAAGGAGA GCAGTAATTG GTGGTAATAC GGGAGGAATT CCCTTACAGA TAATCCATGG TTATACGGGA TTCCTAGTGA ATACGCCCGA GGAGGCCACA CATTATCTCA TTTACTTACT GCGAAATAGA CAAATAAGGG AGAAGATAGG TCAGGACGCT AGGGAGCATG TTAGAAATAA CTTCCTGATG ACTAGGGAAC TTCGGGATTA CCTAATGGTA ATGTTATTGT CAATTCAAGG CTGA
|
Protein sequence | MIEKYAEIIG ENELDAIYRI AEKLKGLSVL HVNSTPKGGG VAEILSKLVP MMNELGINTQ WKVIRGDNEF FNVTKSFHNS LQNGSGNVND ETFAVYEKWQ DINASEIPLD YDVVFIHDPQ PAGLIKVRKK GKWIWRCHID ISNPYPKVWE FLRKYVQQYD AMIISSSVFG REDLTIPQYV IPPSIDPLSV KNREISRFTV ERILRKFEVD PDRPLVTQIS RFDRAKDPIG VISAFKGLKK HIDAQLVYLG SPASDDPEGD QVYQETVKAA EGVKDVHLLM LPPDSDLEVN AFQRGADVVM QKSIKEGFGL TVSEAMWKRR AVIGGNTGGI PLQIIHGYTG FLVNTPEEAT HYLIYLLRNR QIREKIGQDA REHVRNNFLM TRELRDYLMV MLLSIQG
|
| |