Gene Msed_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0830 
Symbol 
ID5105191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp759193 
End bp760440 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content51% 
IMG OID640506735 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001190929 
Protein GI146303613 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGTA CATTAACTGA AAAAATACTT TCAAGGGCGT CAGGAAAAAC TGTTTCGCCC 
GGTGACGTCA TAGAGGCAAA GACTGACATA GTGGCCTTCC ACGACCTAAC GGGATATCAC
GTAATTGAGG TAATGGAGAA GGCTAACATG ATGAAGATCT TCGATAAGAC AAAAATAGTT
GTAGCCTTCG ACCACTTGGC ACCGCCACCT GACGTCAGAA GCGCAGAGAT CCAAGGTAAC
ATAAGGAAGT TCGTGAAGGA GATGAGACTA CCTAACTTTC ATGATATTAA CGTGGGCATT
CTTCACGAGC TTCTCATAGA ACAATACGCC CTACCTGGTC AGGTGATTGT GGCTGCCGAC
AGTCACACGA CAACCTCTGG TGCCGTGGGA GCGTTTGCCC AGGGAATGGG AGCAAGCGAC
GTTGCTGCCG CCGTGATCAC GGGTAAAACT TGGCTAGTGG TTCCTCAGCC CTTCAAGGTA
ACCCTCAAGG GAAACCCCGG TAAGTGGATA AATGGAAAGG ATGTAGCCCT AGAGTTGCTG
GGTAAGTTCA AGGCTGATTA CTTTAACGGA ATGTCCATAG AGGTTCACGT CGAGAACCCC
AAGGCTTTCC CCATGGACTA TAGGGCGACG GTCTCCAACA TGGGGATAGA GATGAACGCT
GATGCCCTCA TGTTTGTCCC TGACGTCGAG ACCAAGGATT ACATAAAGAC CATGAGGGGG
AAGGAAGTTG AGCTCGTGAC CCCAGATCCT GGGGCAAAGT ATCTAGATGA GCACACAATT
GAGCTAGACA AACTGGAACC GCTTGTGGCT GCGCCCTACA GCGTAGACAA CGTTAAGACC
GCAAGGGAGG AGTCCAAGGT CCCAGTGGAT CAGGTCTACA TCGGTTCCTG TACCAACGGT
AGGCTATCAG ACTTCAGGAT TGCGTCGGAG ATCCTCAAGG GGAAGAAGGT CAAGACCAGG
TGTATAGCCA TTCCCTCTTC CTACACGATG TTTAAGCAGG CCATGGAAAT GGGTTACATC
GAAGACCTAG TTAATGCTGG ATGTGTGGTG ACCTACGGTA CCTGCGGGCC ATGTCTAGGC
GGTCACTTCG GAGTCGCTGG TCCAGGGGAG GTTATAGTTT CCACGAGCTC CAGGAACTTC
AGGGGTAGGA TGGGGAGCAA CGAGGCTAAG GTCTACCTGT CCGGGCCTTC GGTTGCGGCT
GCCTCAGCAG CTACAGGGTA CATAACTGAT CCGAGGGATG TGCAATGA
 
Protein sequence
MTGTLTEKIL SRASGKTVSP GDVIEAKTDI VAFHDLTGYH VIEVMEKANM MKIFDKTKIV 
VAFDHLAPPP DVRSAEIQGN IRKFVKEMRL PNFHDINVGI LHELLIEQYA LPGQVIVAAD
SHTTTSGAVG AFAQGMGASD VAAAVITGKT WLVVPQPFKV TLKGNPGKWI NGKDVALELL
GKFKADYFNG MSIEVHVENP KAFPMDYRAT VSNMGIEMNA DALMFVPDVE TKDYIKTMRG
KEVELVTPDP GAKYLDEHTI ELDKLEPLVA APYSVDNVKT AREESKVPVD QVYIGSCTNG
RLSDFRIASE ILKGKKVKTR CIAIPSSYTM FKQAMEMGYI EDLVNAGCVV TYGTCGPCLG
GHFGVAGPGE VIVSTSSRNF RGRMGSNEAK VYLSGPSVAA ASAATGYITD PRDVQ