Gene Msed_0088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0088 
Symbol 
ID5104666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp78253 
End bp79263 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content46% 
IMG OID640505987 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001190189 
Protein GI146302873 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00287551 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000627763 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAGGG TTTCGGTAAT TCCGGGCGAT GGAGTAGGAC CAGAAATATT TTTTGCAAGT 
AAGAAAATCT TAGCGAAACT TGTGGAAACG TACTCTCTCG GAATAGAGTT TATTGAGGTG
GAGGCTGGGG ATTCGGCTCA AGCCAAGTAC GGGGAGGCAT TGCCAAAGAA TACTCTCAAG
GTAATAGAAT CGTCTGACAT GATACTTAAG GGCCCAGTAG GCGAGTCAGC CATGGACGTA
GTGGTAAAGT TAAGGCAGAT GTATGATATG TACGCCAATC TGAGGCCTGC TAAATCACTT
CCAGGAGTTC CCAACAAATA CGGAAACGTG GATATCCTAA TAGTTAGGGA GAACACTGAG
GATCTCTATA AGGGGTTCGA GCATGAAATC TCAGAAGGAG TCGCTGTTGG ACTTAAGGTG
ATATCAGCTA TGGCTTCCAC CAGGATAGCC AACGTTGCGC TGGATTATGC AAAGAGGAGA
AGAAACAAGG TTACCTGCGT TCACAAGGCT AACGTGATGA GGATAACAGA TGGGTTGTTT
GCAAGGTCCT GTCGTTCCGT GCTAAAGGGG AAAGTAGAAT ATAACGAGAT GTACGTAGAT
GCGGCAGCGG CTAATCTAGT GAAGGACCCA AACATGTTTG ACGTCATTAT CACCACGAAC
ATGTACGGAG ATATACTGAG CGACGAGGCT TCACAAATAG CTGGAAGCTT AGGCTTAGCT
CCCTCGGCAA ACATTGGGGA AAGGAAGTCG CTATTTGAAC CCGTTCATGG AGCTGCCTTC
GACATAGCTG GGAAGGGAAT TGTGAACCCA ACAGCTTTTC TGCTCTCCGT GAGTATGATG
CTGGAACGCA TGTATCAACT AAGTAAGGAT CAGAGATATT TACAGGCCTC ACAATCACTT
ACTAACTCGA TTTACAAGGT TTATAGCGAG GGTAAAAATC TCACGCCCGA TGTGGGTGGA
AGCTCCAAGT TAAGTGACAT AATTGACGCG ATATATTCGA AGCTAACATA G
 
Protein sequence
MFRVSVIPGD GVGPEIFFAS KKILAKLVET YSLGIEFIEV EAGDSAQAKY GEALPKNTLK 
VIESSDMILK GPVGESAMDV VVKLRQMYDM YANLRPAKSL PGVPNKYGNV DILIVRENTE
DLYKGFEHEI SEGVAVGLKV ISAMASTRIA NVALDYAKRR RNKVTCVHKA NVMRITDGLF
ARSCRSVLKG KVEYNEMYVD AAAANLVKDP NMFDVIITTN MYGDILSDEA SQIAGSLGLA
PSANIGERKS LFEPVHGAAF DIAGKGIVNP TAFLLSVSMM LERMYQLSKD QRYLQASQSL
TNSIYKVYSE GKNLTPDVGG SSKLSDIIDA IYSKLT