Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1303 |
Symbol | |
ID | 5104554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1282148 |
End bp | 1283329 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507192 |
Product | gluconate dehydratase / galactonate dehydratase |
Protein accession | YP_001191385 |
Protein GI | 146304069 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCC AAGAAATTAC GCCGTTTGTC CTCTCCTCCA AGGAGAGGGG ATCTGCAACC TGGGCCTCTA CCATGATCGT GGTGAAAGTG ACCACAAGTG ACGGAATGAT AGGTTATGGT GAGGCCGTGC CTACGTTGAG GGTTAAGCCC GTGTTTAGTG CAATCCAACA GGTTAGCAAG GGCTATCTGG GAAAGGAGGT CGAAAGAGTA GAAAGGAATT ACCACGAATG GTACAAGCAG GACTTCTATC TCAGCAGATC CTTCGAGTCC GCAACGGCAA CCAGCGCGAT AGATATTGCG CTCTGGGACC TTGTGGGAAA GGAGTTGGGA GCACCGATTC ACAGGCTACT GGGCGGAAAG TTTAGGGACC ACGTACCCGT TTACGCGAAC GGGTGGTACA AGGACTGTGT TACGCCAGAG GACTTCGCTA GGGAGGCAAA GAACGTGGTT AAGAGGGGGT ACAGGGCCAT GAAGTTTGAT CCCTTTGGAC CATACTACGA TTGGATTGAT GAACATGGAT TGAGACAGGC AGAGGAAAGG GTGAAGGCAG TCAGGGAAGC TGTTGGGGAA GAGGTCGAGA TCTTGATTGA ACATCACGGG AGGTTCAACG CAAACTCGGC TATTGAGATT GCGAAGAGGC TCGAGAAGTA CAGACCACTG TTCGCTGAGG AGCCAGTTCA CCACGAGGAT CTGGAGGGAT ATAGGAAATA CAAAAGACAC TCCAACCTAA GGGTGGCAAT GGGAGAGAGA CTGGTGAGCT TGAAAGAAAC TCTGGTTTAC TTGAGGGAGG GGCTAGTGGA CATCCTACAA CCTGACTTGA CCAACATCGG TGGAGTTACA GTTGCCAGGA AAGTAGCATC GCTAGCTGAG GCCTTCGACG TTGAAGTTGC CTTCCACAAT GCCTTCGGTT CAATACAGAA CGCGGTCTCA ATCCAGTTGG CGTCGGTTAT TCCTAACCTA CTCCTGCTGG AGAACTTCTA CGACTGGTTC CCACAATGGA AAAGGGATCT CGTGAATAAC GGAACTCCAG TTGAGATGGG AAGAGTGAAG GTACCGGATG GACCTGGCAT AGGAGTTGAG GTTAACGAGA GGATACTGGA GGAGTTGAAA ACCGATCCTG TTGCGCTTGA GGTTGTGGAG GAGCCTGTGT GGGTTGTGGG CGGAACGTGG AAAAACTATT AA
|
Protein sequence | MKIQEITPFV LSSKERGSAT WASTMIVVKV TTSDGMIGYG EAVPTLRVKP VFSAIQQVSK GYLGKEVERV ERNYHEWYKQ DFYLSRSFES ATATSAIDIA LWDLVGKELG APIHRLLGGK FRDHVPVYAN GWYKDCVTPE DFAREAKNVV KRGYRAMKFD PFGPYYDWID EHGLRQAEER VKAVREAVGE EVEILIEHHG RFNANSAIEI AKRLEKYRPL FAEEPVHHED LEGYRKYKRH SNLRVAMGER LVSLKETLVY LREGLVDILQ PDLTNIGGVT VARKVASLAE AFDVEVAFHN AFGSIQNAVS IQLASVIPNL LLLENFYDWF PQWKRDLVNN GTPVEMGRVK VPDGPGIGVE VNERILEELK TDPVALEVVE EPVWVVGGTW KNY
|
| |