Gene Msed_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2095 
Symbol 
ID5104389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2016169 
End bp2017740 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content47% 
IMG OID640507985 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001192159 
Protein GI146304843 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.729554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAAGA AATCCATAGA AATACTTGAC ACGACCCTTA GGGATGGTTC TCAAGCAGCA 
ACTATTTCCT TCACCCTAAG GGATAAGATA AAGATAGCCC TTCTCCTAGA TGAGCTAGGA
GTGGACTATA TAGAAGGTGG ATGGCCTGGA TCTAACCCTA AGGATTACGA GTTCTTCAGG
GAGATTAAGA ACTACCACCT TAAGCATTCA AAGATTGCAG CTTTCGGGAG CACTAGAAGG
AAGGATTTAA GGCCAGAGAA CGACCCGAGC CTATCTTCAA TTCTAGAGGC AGACGTCAAG
GTTGCGGTGC TCTTCGGTAA AACATGGACG CTTCACGTCA AGGAAATATT GAGGACCACC
TTGGAAGATA ATCTCTCCAT AATAGCTGAT AGCATCCAAT TCCTGAGGGA TCATGGACTG
GAGGTTATAT TCGATGCTGA ACATTTCTAT CAAGGCTTCA AGGAGGATCC AGATTATGCA
ATAAGGGTAG TTAATACTGC AGGTGAGGCT GGGGCAAGAG TTGTCGCCCT AGCGGATACC
AACGGGGGCA CGCTACCCCA TGAGGTACTA GAAATAACCA GACGAGTGGC CGAAAACACT
AGGGTGAAGC TTGGGGTTCA CATGCACAAT GATTCGGGTA CAGCGGTGGC CAACTCATTG
ATGGGGGTAG TAGGAGGGGC TAGACACGTT CAGGGAACTA TTAACGGAAT TGGTGAGAGA
ACTGGAAATG CTGACCTAAT TCAGATAATA CCCAGCCTCG TTCTAAAAAT GGGGTATAGG
GTACTCAAGA ATGAGGATGG TCTTAAGAAG CTAAAACAGG TGTCCTCCAC ACTCTATGAA
TTGGCAGGAT TACATCCCAA CCCCTTCCAG CCCTATGTGG GAGATAACGC ATTTACCCAT
AAGGCAGGAG TTCACGCCGA TGCGGTAATG AAGAACACCA GAGCATATGA ACATATAGAC
CCGTCCCTAC TGGGCAATCA GAGAAAGGTA GTAATCTCAG AGCTTTCTGG GACGTCAAAC
CTAGTGAATT ACCTTGAGAA ACTGGGAATC AAGGTAGAAA AGAAGGAGGA GAAATTGAAG
AAGGCATTAA AGGCCATCAA GGAGATGGAG GCCCGCGGAT ACAGCTTTGA CCTGGCACCT
GAATCGGCTA TGTTAGTGGC CTTCAAGGAA ATGGGAATAT ATGAAAAGTT CTTCGACGTG
CAGTACTGGA AGGTTATAAA CGAGAACGGA TTAGCCCTTG CCGTAGTCAA GGTTAACTCT
AGAGTTGAAG CTGCTGAGGG TACGGGACCA GTTCACGCAG TGGACGTGGC GCTGAGACGG
TGCCTCTCAA AGGATTTTCC TGAAATAGAG AGAGTTAAAC TCACGGATTA CAGGGTCGTT
CTACCAGGCG AGGTAAAGAA CACGGAGAGC GTAGTACGTG TGACCATCGA GTTTAACGAC
GGAGAAAGGT CTTGGAGGAC TGAAGGTGTC TCAACTAGCG TTATAGAGGC ATCGATAATG
GCACTTGTGG ATGGGCTGGA CTATTATCTT CAACTTAATA AGAAATTAAA GCCTCTAGTT
GCAAAAGAGT AG
 
Protein sequence
MAKKSIEILD TTLRDGSQAA TISFTLRDKI KIALLLDELG VDYIEGGWPG SNPKDYEFFR 
EIKNYHLKHS KIAAFGSTRR KDLRPENDPS LSSILEADVK VAVLFGKTWT LHVKEILRTT
LEDNLSIIAD SIQFLRDHGL EVIFDAEHFY QGFKEDPDYA IRVVNTAGEA GARVVALADT
NGGTLPHEVL EITRRVAENT RVKLGVHMHN DSGTAVANSL MGVVGGARHV QGTINGIGER
TGNADLIQII PSLVLKMGYR VLKNEDGLKK LKQVSSTLYE LAGLHPNPFQ PYVGDNAFTH
KAGVHADAVM KNTRAYEHID PSLLGNQRKV VISELSGTSN LVNYLEKLGI KVEKKEEKLK
KALKAIKEME ARGYSFDLAP ESAMLVAFKE MGIYEKFFDV QYWKVINENG LALAVVKVNS
RVEAAEGTGP VHAVDVALRR CLSKDFPEIE RVKLTDYRVV LPGEVKNTES VVRVTIEFND
GERSWRTEGV STSVIEASIM ALVDGLDYYL QLNKKLKPLV AKE