Gene Msed_0711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0711 
Symbol 
ID5103749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp649608 
End bp651281 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content52% 
IMG OID640506615 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001190810 
Protein GI146303494 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACA AATCAAGATC TAACAAGGTT TACGGTGGTT ACGAAAAGGC ACCCAATAGG 
GCCTTCCTTA AGGCAATGGG CCTAACGGAC GATGATATTT CTAAACCGCT GGTGGGAGTT
GCAGTGGCCT GGAATGAGGC CGGCCCTTGT AATATACATC TCCTAGGCCT GTCTCAGGTA
GTGAAGGAGG GCATAAGGGA ACTTGGCGGT ACCCCCAGGA CTTTCACGGC CCCTGTCCTA
ATAGATGGAA TAGCCATGGG AAGCGAGAGC ATGAAGTACT CTCTGGTGAG CAGGGAAGTG
ATTGCGAACA CTGTGGAGTT AACTGTGAAT GGGCACGGCT ACGACGGGTT CGTGGCACTG
GGCGGATGTG ACAAGACCCA ACCAGGCCTC ATGATGTCAA TGGCCAGACT GAATATACCC
TCGGTTTACA TGTATGGAGG AACTACCTTG CCAGGGAATT TCAGGGGTAG GGATATAGCG
ATTGGAGACG TGTATGAGGC AGTGGGAGCT TTCTCTGCTG GGAAGATAAC CGCGGAAGAT
CTTAGGATCA TGGAAGACAA CGCTATTCCC GGGCCTGGAG CCTGTGGAGG GTTATACACA
GCTAACACAA TGGCTATGCT ATCTGAGGCC CTCGGACTTT CACTTCCCGG AAGCTCAGCC
CCTCCAGCAG TAAGCTCCGA TAGAACCAAA TTCGCCAAGG AGACAGGCAG AACGTTGATG
AAGGTTATGG AGATTGGTCT CAAGCCTAGG GACATCCTAA CCTTTGAGGC CTTTGAGAAC
GGGATTGCCC TACTCATGGC CAGTGGAGGT TCCACAAACG GAGTTCTCCA CCTTTTGGCC
ATTGCCCATG AGGCAGGCGT GTCCCTAACC CTGGACGACT TTGATAGAAT AAGCAAGAAG
GTTCCAGAGA TAGTTAACAT GAAGCCTGGA GGGGACTACG TTATGGCTGA CCTCTACAGG
GTTGGAGGAA CTCCCGTTAT CCTGAAGAAG CTATTGGATC GCGGACTACT TCACGGTGAC
ACTATCACGG TAACTGGAAA GACTATGGCC CAAAACTTGT CCGAGTACAA GATACCTGAG
TTTAAACACG ACCATATAGT CAGAGACCTC TCCAATCCCT TCCTTCCTTC AGGCGGAATA
AGGATTCTGA AGGGTAGTTT AGCACCAGAA GGTTCTGTGG TGAAACTGTC CGCTTCAAAG
ATCAAGTACC ATAGGGGACC GGCCAGGGTG TTCAACTCAG AGGAGGAGGC ATTTGAGACA
GTTCTGAAGA AGAAGATAAA CGAGGGAGAT GTCGTGGTAA TAAGGTATGA GGGTCCAAAG
GGAGGTCCAG GTATGAGGGA AATGCTTGCA GTCACTAGCG CAATAGTGGG ACAGGGACTA
GGAGAGAAGG TTGCCCTGGT CACTGACGGT AGGTTCTCGG GAGCAACCAG GGGTCTCATG
GTAGGTCACG TAGCCCCTGA GGCGGCGGTC GGCGGTCCCA TAGCGCTTAT CAGGGATGGC
GACACCATTG TGATAGATGG CGAGAAGGGT AGACTTGATG TGGAACTCTC AGACCAGGAA
CTTAAGAGTA GGGCCAAGGA TTGGACACCC CCAGAACCTA GGTACAAGAC CGGTCTCTTG
GCGCAGTACG CCAAATTAGT TACCTCATCG GCGAGGGGAG CCGTTCTAGT TTAA
 
Protein sequence
MYDKSRSNKV YGGYEKAPNR AFLKAMGLTD DDISKPLVGV AVAWNEAGPC NIHLLGLSQV 
VKEGIRELGG TPRTFTAPVL IDGIAMGSES MKYSLVSREV IANTVELTVN GHGYDGFVAL
GGCDKTQPGL MMSMARLNIP SVYMYGGTTL PGNFRGRDIA IGDVYEAVGA FSAGKITAED
LRIMEDNAIP GPGACGGLYT ANTMAMLSEA LGLSLPGSSA PPAVSSDRTK FAKETGRTLM
KVMEIGLKPR DILTFEAFEN GIALLMASGG STNGVLHLLA IAHEAGVSLT LDDFDRISKK
VPEIVNMKPG GDYVMADLYR VGGTPVILKK LLDRGLLHGD TITVTGKTMA QNLSEYKIPE
FKHDHIVRDL SNPFLPSGGI RILKGSLAPE GSVVKLSASK IKYHRGPARV FNSEEEAFET
VLKKKINEGD VVVIRYEGPK GGPGMREMLA VTSAIVGQGL GEKVALVTDG RFSGATRGLM
VGHVAPEAAV GGPIALIRDG DTIVIDGEKG RLDVELSDQE LKSRAKDWTP PEPRYKTGLL
AQYAKLVTSS ARGAVLV