Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0372 |
Symbol | |
ID | 5103615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 322664 |
End bp | 323911 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506278 |
Product | protein of unknown function DUF395, YeeE/YedE |
Protein accession | YP_001190473 |
Protein GI | 146303157 |
COG category | [R] General function prediction only |
COG ID | [COG2391] Predicted transporter component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAACTC CCTCAATGAT ACTTCAACCT GTTTTTTCTG AGTCCTGGGC TCTCATATTT GGATTGTTTG GGTTAAGCGG GTTTATCCTG GGCTGGGTAG CCCAAAGGGG GAACTATTGC TTTGTGAACG CGATGACGTC GATCTTTACG ACAAAGAGCT TCGAGAGATT TGGGGCACTT CTCATCCTTT TTGGTCTCAC TGCCCTAGGA ACTGGACTTC TCGTGGCCTT TGGGCTAATT CCTCCCGTAG ATCAATACTT CAATAACTAC TTCGCAGGAT GGTACATACT GGTTGGATCA TTCATATTCG GCTTTGGTGC AGCACTGGCT GGTGGATGCA ACCTTTCCAT GCTATACAGG GCCAGTAGCG GTTACGTCCA GAACTGGATT GAGCTATTTG GAATGATGAT CGGAACTTAC ATTTTCGCAG TTGCAATCTG GCCCTTCCAG TCCTATACAA TGCAGAGCGG GATTCTCTCC ACTAGCTCTG GCGGATACGT AGAGTATTTG CCTTACGTCC TATTCCATTC CGTCTCCAAC ATGTCAGTTT ACATTACCAC AATGATAGTG GCGCTTCCAC TTCTCTTCTT AGGGATATAT CTGCAAATGA GGACGAGAAG CAAGTGGGAT AAGTCAGTGA GCATGAAGGG ACCTGGTCTA AGTGCGGTTG GAATGAAGGC ACTACCAGGT TTACCAGGCC CCTCTTCACC GTCAATCCCC TCTGGACTTA GGCTGAGGCA GGAGGCTAAG GACATGCTCC TTCTCAGGAA GCCCTACGGA ACCAATCTCT CCACCGTGAT ATTAGCACTG GACATGATCT TCGTCTTCAT CGTTGGTGCT GGGTACACGT TTAACTACCT GGTAATTACC TCGTCAGACG GGGGAAGGTT CTTCGAATAC ATCCTCATGC CCTTGGGAAT CAACCTATTT ACCAACACCC CATGGTTCAA CAGTTCCTTG CCTATTGTTG ACCCCAGTAC CTTAATGGTG GTAATGCTTT GCGTGGGGGC ATTTTCTGCC TCATTCTTGA GCGGAGACTT CAAGATCAGG ATACCCAAGG ACAGGAAGAG GTTGGCCATA GGTTTCGTAG GTGGGATGCT CGTGGGAATA GGTGTAAGAA TGGCCCTAGG ATGCAACGTT GGATTAATGT GGACAAACTT TGGACAGTTA GGGTATGACG GCTACATTTT CCTAGGTGGA ATGCTCGCGG GGATATACCT CGCAGTCAAG GTGCAGGAGA AACTTTAG
|
Protein sequence | MATPSMILQP VFSESWALIF GLFGLSGFIL GWVAQRGNYC FVNAMTSIFT TKSFERFGAL LILFGLTALG TGLLVAFGLI PPVDQYFNNY FAGWYILVGS FIFGFGAALA GGCNLSMLYR ASSGYVQNWI ELFGMMIGTY IFAVAIWPFQ SYTMQSGILS TSSGGYVEYL PYVLFHSVSN MSVYITTMIV ALPLLFLGIY LQMRTRSKWD KSVSMKGPGL SAVGMKALPG LPGPSSPSIP SGLRLRQEAK DMLLLRKPYG TNLSTVILAL DMIFVFIVGA GYTFNYLVIT SSDGGRFFEY ILMPLGINLF TNTPWFNSSL PIVDPSTLMV VMLCVGAFSA SFLSGDFKIR IPKDRKRLAI GFVGGMLVGI GVRMALGCNV GLMWTNFGQL GYDGYIFLGG MLAGIYLAVK VQEKL
|
| |