Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0174 |
Symbol | |
ID | 5104963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 140121 |
End bp | 141041 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506077 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001190275 |
Protein GI | 146302959 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.10516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.125028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTCAG TTGTTACTGG AGGAGCAGGT TACATAGGAG GACATCTAGT GGATGCCTTA CTTCAACAGG GCAATCAAGT CCTAGTGCTG GACGATCTTT CAAGTGGTAA CTACATTAAT TCCATGGCTA AGTTCCAGAG GATAGATCTT CGGTCACAAT CCCCTAAGCT TGAGAGTTGC GACACAATGT ATCACCTAGC AGCTAACCCA GACGTGAGAA CGTCCATGGA GAACATTGAG GAACATTTCG AAAGGGATGT AAAGGCTACC CTTAATGCAC TAGAGTCGGC AAGGAAATCA GACTGCAAGT TCTTCATCTT TTTCTCGTCT TCTACAGTGT ACGGGGAGGC AAAGACCCCC ACCCCAGAGA CTGCGGAGAC AAATCCAATC TCCAACTACG GTCTCTTCAA GTTAATGGGG GAAGAAATGA CGAGGTTCTA CTCTCAGAAT TATGGGATAA CGGCCCTATC CCTAAGGCTA GCAAACATTA CGGGTGGTAG AGTTTCTCAT GGCGTAGTAA TAGATTTCAT TAAAAAGTTA ATGAAAGACC CCACAACCCT CGAAATTCTG GGAAATGGGA AACAGCGCAA GAGCTACCTT CATGTTAGCG ATCTTATTCA GGCGGTTCTT TTCCTTAAGG ATAGACATAG GCGAGGATAC GATTACTTCA ATGTGGGGAA TGAGGATTGG ATCACCGTAG ACGAGATAGC TAGTATCGTG GAAGAGGAAA TGGGACTAAG GCCTGTTCAT GTGTACCGGG ATGCAGATAA CGGAAGGGGC TGGAAGGGAG ACGTGAGATT AATGTTGCTT GACATCTCTA AGATAAAGTC CCTCGGTTGG GCACCGACTC TCTCCTCTAG GGAAGTGATT AGACGGGCCA CGAGGGAAGC ATTAAGGTTG TTAGGCTATG AGAAGGTTTA G
|
Protein sequence | MVSVVTGGAG YIGGHLVDAL LQQGNQVLVL DDLSSGNYIN SMAKFQRIDL RSQSPKLESC DTMYHLAANP DVRTSMENIE EHFERDVKAT LNALESARKS DCKFFIFFSS STVYGEAKTP TPETAETNPI SNYGLFKLMG EEMTRFYSQN YGITALSLRL ANITGGRVSH GVVIDFIKKL MKDPTTLEIL GNGKQRKSYL HVSDLIQAVL FLKDRHRRGY DYFNVGNEDW ITVDEIASIV EEEMGLRPVH VYRDADNGRG WKGDVRLMLL DISKIKSLGW APTLSSREVI RRATREALRL LGYEKV
|
| |