Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0711 |
Symbol | |
ID | 5103749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 649608 |
End bp | 651281 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506615 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001190810 |
Protein GI | 146303494 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACA AATCAAGATC TAACAAGGTT TACGGTGGTT ACGAAAAGGC ACCCAATAGG GCCTTCCTTA AGGCAATGGG CCTAACGGAC GATGATATTT CTAAACCGCT GGTGGGAGTT GCAGTGGCCT GGAATGAGGC CGGCCCTTGT AATATACATC TCCTAGGCCT GTCTCAGGTA GTGAAGGAGG GCATAAGGGA ACTTGGCGGT ACCCCCAGGA CTTTCACGGC CCCTGTCCTA ATAGATGGAA TAGCCATGGG AAGCGAGAGC ATGAAGTACT CTCTGGTGAG CAGGGAAGTG ATTGCGAACA CTGTGGAGTT AACTGTGAAT GGGCACGGCT ACGACGGGTT CGTGGCACTG GGCGGATGTG ACAAGACCCA ACCAGGCCTC ATGATGTCAA TGGCCAGACT GAATATACCC TCGGTTTACA TGTATGGAGG AACTACCTTG CCAGGGAATT TCAGGGGTAG GGATATAGCG ATTGGAGACG TGTATGAGGC AGTGGGAGCT TTCTCTGCTG GGAAGATAAC CGCGGAAGAT CTTAGGATCA TGGAAGACAA CGCTATTCCC GGGCCTGGAG CCTGTGGAGG GTTATACACA GCTAACACAA TGGCTATGCT ATCTGAGGCC CTCGGACTTT CACTTCCCGG AAGCTCAGCC CCTCCAGCAG TAAGCTCCGA TAGAACCAAA TTCGCCAAGG AGACAGGCAG AACGTTGATG AAGGTTATGG AGATTGGTCT CAAGCCTAGG GACATCCTAA CCTTTGAGGC CTTTGAGAAC GGGATTGCCC TACTCATGGC CAGTGGAGGT TCCACAAACG GAGTTCTCCA CCTTTTGGCC ATTGCCCATG AGGCAGGCGT GTCCCTAACC CTGGACGACT TTGATAGAAT AAGCAAGAAG GTTCCAGAGA TAGTTAACAT GAAGCCTGGA GGGGACTACG TTATGGCTGA CCTCTACAGG GTTGGAGGAA CTCCCGTTAT CCTGAAGAAG CTATTGGATC GCGGACTACT TCACGGTGAC ACTATCACGG TAACTGGAAA GACTATGGCC CAAAACTTGT CCGAGTACAA GATACCTGAG TTTAAACACG ACCATATAGT CAGAGACCTC TCCAATCCCT TCCTTCCTTC AGGCGGAATA AGGATTCTGA AGGGTAGTTT AGCACCAGAA GGTTCTGTGG TGAAACTGTC CGCTTCAAAG ATCAAGTACC ATAGGGGACC GGCCAGGGTG TTCAACTCAG AGGAGGAGGC ATTTGAGACA GTTCTGAAGA AGAAGATAAA CGAGGGAGAT GTCGTGGTAA TAAGGTATGA GGGTCCAAAG GGAGGTCCAG GTATGAGGGA AATGCTTGCA GTCACTAGCG CAATAGTGGG ACAGGGACTA GGAGAGAAGG TTGCCCTGGT CACTGACGGT AGGTTCTCGG GAGCAACCAG GGGTCTCATG GTAGGTCACG TAGCCCCTGA GGCGGCGGTC GGCGGTCCCA TAGCGCTTAT CAGGGATGGC GACACCATTG TGATAGATGG CGAGAAGGGT AGACTTGATG TGGAACTCTC AGACCAGGAA CTTAAGAGTA GGGCCAAGGA TTGGACACCC CCAGAACCTA GGTACAAGAC CGGTCTCTTG GCGCAGTACG CCAAATTAGT TACCTCATCG GCGAGGGGAG CCGTTCTAGT TTAA
|
Protein sequence | MYDKSRSNKV YGGYEKAPNR AFLKAMGLTD DDISKPLVGV AVAWNEAGPC NIHLLGLSQV VKEGIRELGG TPRTFTAPVL IDGIAMGSES MKYSLVSREV IANTVELTVN GHGYDGFVAL GGCDKTQPGL MMSMARLNIP SVYMYGGTTL PGNFRGRDIA IGDVYEAVGA FSAGKITAED LRIMEDNAIP GPGACGGLYT ANTMAMLSEA LGLSLPGSSA PPAVSSDRTK FAKETGRTLM KVMEIGLKPR DILTFEAFEN GIALLMASGG STNGVLHLLA IAHEAGVSLT LDDFDRISKK VPEIVNMKPG GDYVMADLYR VGGTPVILKK LLDRGLLHGD TITVTGKTMA QNLSEYKIPE FKHDHIVRDL SNPFLPSGGI RILKGSLAPE GSVVKLSASK IKYHRGPARV FNSEEEAFET VLKKKINEGD VVVIRYEGPK GGPGMREMLA VTSAIVGQGL GEKVALVTDG RFSGATRGLM VGHVAPEAAV GGPIALIRDG DTIVIDGEKG RLDVELSDQE LKSRAKDWTP PEPRYKTGLL AQYAKLVTSS ARGAVLV
|
| |