Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0434 |
Symbol | |
ID | 5105551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 384821 |
End bp | 385762 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506340 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001190535 |
Protein GI | 146303219 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTATC TTTTAACGGG CTTAGGTTTC ATCTCGACCC ACGTAGCGTT GTATCTGAGT GAAGTGGGCG AGGACGTTAC GGTGACGTAC AAGTCCTTGA ATCCCGTAAA GGAAGAGTAC ATCTCTCTGC TCAAGGGGAA GGTCAAGGTG GTAAATGTTG ATCCACTCTC AGAGGAGGTG ACAAAGTTGA TTTCTCGACA TGACATCGTC GCCAACTTCA TAGGAGAAAT TTCGGGTGGT GAGGATAAAC TGAGATTGGC CAACGTAGAG ATTCCCAGCA GGTTAGCGAG GGATGCCTTT GATCAGAATA AAACATTCAT CCACTTGAGC GGGGCGACCT CCACAGGAGA GACTGGGGTA AACGTGAAGG AGGAGGCAGA GCACTGCAAG GATTCTAAGG CGTCAACGCC CTTTGAGAAG TCCAAGTGTG AAGGTGAGAA AGAGATCATG AGGTTAGCAC TAGAGAAAGA CGGCAATCTT GCCATATTAA GGCCCACCCT GGTGTACGGA AGATATGCAG CTCACGTTCA ATTCGTTACC ATGTATAAGC TGGTCAAGGC TAGAGTTGTT CCTGAGCTGG GTTTAAGGAT GGCCACTGTA AATGCATGGA CTCTTGGTAG AGCAGTTCAC ACGCTTGGAA AAGTTTCTCC CAAGAGGGTT TACCTTTACG CCAGTGAGTG CGGAAGCGTA GCAGTCTCTA GGTTCTTTGA ACTCATGAGT GAAGAGGTGG GTAAAGGTAT AAGGCTACCC ATTCCCACAT GGCTGGCCAA GGCTGTTTTA CCCAAGGATA TTAGAAACCT GCTAAGATAC TCTGGTACCA CATACGACTG CTCAGCATTC AAAGAGGTTG TAGGGGACAT GAAATTTGAC GAGGAAGAGG TTAGATCCAA CGCAAGGTTC TTAAAGTATC TTGAAGAAAA GGATAAACTA ATCCCCACTT GA
|
Protein sequence | MKYLLTGLGF ISTHVALYLS EVGEDVTVTY KSLNPVKEEY ISLLKGKVKV VNVDPLSEEV TKLISRHDIV ANFIGEISGG EDKLRLANVE IPSRLARDAF DQNKTFIHLS GATSTGETGV NVKEEAEHCK DSKASTPFEK SKCEGEKEIM RLALEKDGNL AILRPTLVYG RYAAHVQFVT MYKLVKARVV PELGLRMATV NAWTLGRAVH TLGKVSPKRV YLYASECGSV AVSRFFELMS EEVGKGIRLP IPTWLAKAVL PKDIRNLLRY SGTTYDCSAF KEVVGDMKFD EEEVRSNARF LKYLEEKDKL IPT
|
| |