Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1835 |
Symbol | |
ID | 5104185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1779949 |
End bp | 1781346 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640507729 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001191908 |
Protein GI | 146304592 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.963545 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATATC AGTTTAAGAT CATGAATCCA TTAACTGGTT CATTGAAGTT TCTAGCGACT ACACTTCTAA ACTCAATCAC CGCTCTACTG TTCTTTCTCA TTGTCGCCCA TTTTTCTAGT CCGTCATTTG TTGGGAAGGT AGCAATAATA CAGCTTATAG AGACTATTAC AGGATCGTTC TTTGCTTTAC TACCGTTCAA TCTTGTCACG AGGGATATCT CACATAAATA TGCCTCCTCT CAGGACCATA GGAAGGTAGT TTCTACTTCA CTCTCGTACT CCCTTTTGGT TTCACCTTTT CTCCTTTTTC TGTTTCTATT TCCCTCATAC GTATGGTTGT CAATACCGTA CTTTGTTTTA TATTTATTTT CCACTTATCA ATATCAGATT TTATCGGGAT TGGGAAAGTT CTCTGAAACG AATTTAGGCA ACGTTATCTT TACCGTAACG AGATGGGGGA TATCCTCGGT CGCAGTGTTT TATCACAGCA TATCACTCTT GATCCTAATT TGGACTTTAG GTGCCCTGGT AAGGGTTATC TACTATAACC ACTATCTTCC GTTTAAATTT CACTTCGATT TTCAGGTTGC AAAGGAAATA GCCAAGATAG GGGTTCCAAT TTATTTGTCA GGAATAGTGT CTTTTATTTC CGGACAAGGG GATAGAGTTG TTACAGCATT TTTACTGGGA TCGTATAGTC TAGGTATTTA TCAATTGGTA GCATTGATTT CTGTTGTACC AAATACGCTG ATTTGGTCCT TGACCTCTGC CCTACTACCT TCCTCTACCT ACTATTACAC TAAGGGCGTC GAGATGAGGG AGATGGCCTC CGGTGCCTTT AGACTCTTGA CCTTCCTCTC CCTTCTTCTA GGGGTATCTA GTTACGCAAT TGCCCCATAT CTAGTTCTCA AGCTTTTCCC TGAGTATTCA CCTGGAGTCG AGGTGTTGAA GATCCTAGTT CTATTCATTA CAGTTACAAT GCCCTTTCAA ATTCTCTCAA CGTTCTTGAT TGCACTCAAC AAGAATTACA GACCCTTCCT GGTAATTGGG AGCGCGAGCG CCATTGAAGT GGTTCTGGTC TCCTTCCTCC TGATCCCGCG AATGGGGATT TTGGGTGCGG GGATAGCCCA GGCTGGGAAT GCCATAGTAA CCAGTATTCT TTATGTAATT TTCTCCCTAA AACAGGGAAT AATAACACTC GATAGAAAGA CTATATATTC CATTTTGTTG ATATCTCTTT CTTCGATTTC CCTCTTCTCC TGGGTGATTG GGGCACTTGT GATTATTCTA GGATTGAAGT TTCTTGGAAT CATAACTAAT AAGGAGATGG CCTTAATACA AAAATTCATT CCACCTCAGC TCAGATTCTT TATAAGGATA CTTAATCTCT TCATTTAA
|
Protein sequence | MIYQFKIMNP LTGSLKFLAT TLLNSITALL FFLIVAHFSS PSFVGKVAII QLIETITGSF FALLPFNLVT RDISHKYASS QDHRKVVSTS LSYSLLVSPF LLFLFLFPSY VWLSIPYFVL YLFSTYQYQI LSGLGKFSET NLGNVIFTVT RWGISSVAVF YHSISLLILI WTLGALVRVI YYNHYLPFKF HFDFQVAKEI AKIGVPIYLS GIVSFISGQG DRVVTAFLLG SYSLGIYQLV ALISVVPNTL IWSLTSALLP SSTYYYTKGV EMREMASGAF RLLTFLSLLL GVSSYAIAPY LVLKLFPEYS PGVEVLKILV LFITVTMPFQ ILSTFLIALN KNYRPFLVIG SASAIEVVLV SFLLIPRMGI LGAGIAQAGN AIVTSILYVI FSLKQGIITL DRKTIYSILL ISLSSISLFS WVIGALVIIL GLKFLGIITN KEMALIQKFI PPQLRFFIRI LNLFI
|
| |