Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0790 |
Symbol | |
ID | 5105113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 723153 |
End bp | 724181 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506695 |
Product | glycosyl transferase family protein |
Protein accession | YP_001190889 |
Protein GI | 146303573 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.68906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTACTCC AGTTCCTCCT GGTGTTGCCT GCACTGGCAG ATCTAGTCCT CCTTTTACAG ATCTGGAGGG AAAACTCAAT CTTCAAATTT GACGGTAAGT TCTGCGCTCC TGCATCCATC ATTGTTCCCG TGAGGGGTCT CGACCCCGAA CTAGAGAGGA ACGTGGAATC GCTCAGGAAC CAGGACTTTC CCTGTCCCTT CGAGATAATA TACGTGGTGG ATCCTGATCA ACCATGGTTG GCTGAACGTC TGAGACGGCT TGGAGTGAAG GTCGTGATCA CCAGTTTTAC CTGTTCATGT AGCGGTAAAA TAAGGGCACA ACTCTCAGGG CTAAGGGAGT CCGCGAATGA AGTGGTGGTC TTCGCCGACT CCGACACGCT CTATCCTAGG AACTGGTTGA GGGAGATGGT GGGGAACCTT GACAGGCACA TGGCTGTAAC CACGTTTTCA TGGCCCGCCC CCCTCAAAAT AACGTGGAGA AACCTGATCA GGGCTGGCTT CTGGACATTG GGATTCGAGT CTCAGGCCTC TGGTGGGACC TTCCTCTGGG GAGGCTCCAT GGCCTTCAGA AGAGATTTCT TTGATAGTGA GGTCCTGGAA GAGCTTTCGC GTGAATGGTG TGACGACTGC ACCCTCACTA GGATAGTGAA AAAGCGAGGA GTAAGTATCG CCTTCGACGG TAAAGCCATC CCACTCAACA TTTATGACGA GAGAGACCTA TGGAAATGGT CCACAAGGCA GGTCGTCACG ATCATCAAGT ACTCTAGCAG AGGAGCCAAG GCCTTCCTGG TGATAGGTGC TCTCATGCTT GCCTTTCCAA TCCTTTTCCT TGTCTTCTTG AACCCATTCT ACCTGTCTCC TCTGCTTCTA TGGATTCTGA AAAATTTCTC CAGAAGTAGA AATCTGGGGA AATATTCATA TACCCCATCT GTCATGTCAA TTTTAGGTGT ATATTACGGG TGGATCAAGC TAATCCTTGA CTACAGGAAA AGGACAGTCG TTTGGAGAGA CAGGGTCTAT AATCTTTAA
|
Protein sequence | MLLQFLLVLP ALADLVLLLQ IWRENSIFKF DGKFCAPASI IVPVRGLDPE LERNVESLRN QDFPCPFEII YVVDPDQPWL AERLRRLGVK VVITSFTCSC SGKIRAQLSG LRESANEVVV FADSDTLYPR NWLREMVGNL DRHMAVTTFS WPAPLKITWR NLIRAGFWTL GFESQASGGT FLWGGSMAFR RDFFDSEVLE ELSREWCDDC TLTRIVKKRG VSIAFDGKAI PLNIYDERDL WKWSTRQVVT IIKYSSRGAK AFLVIGALML AFPILFLVFL NPFYLSPLLL WILKNFSRSR NLGKYSYTPS VMSILGVYYG WIKLILDYRK RTVVWRDRVY NL
|
| |