Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1765 |
Symbol | |
ID | 5104765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1700007 |
End bp | 1701389 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507660 |
Product | 2-isopropylmalate synthase |
Protein accession | YP_001191844 |
Protein GI | 146304528 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02146] homocitrate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.877684 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAG GTATACTAGA TTCGACTTTG AGGGAAGGCG AACAGACTCC TGGAGTTGTG TTCACCACTG AGCAGAGAGT CGAGATAGCC AAGGCCCTAT CCGATCTGGG AGTTTCCATG ATTGAGGCCG GTCACCCGGC AGTTTCACCG GACATTTATG AGGGAATAAA GAGGATCGTG AAGCTCAAGA GGGAGGGAGA GATCACCTCC GAGATCCTGG GTCACAGCAG GGCTGTGAAG AGGGACGTGG AGATTGCCAG CGAACTCGAG GTGGACAGGA TAGCCATCTT CTACGGGGTA AGCGACATTC ACCTCAAGGC CAAGACTAAG ACCACCAGGG AGGAGGCTCT CAACATCATT GCGGACGTGG TTCAGTACGC CAAGGCCCAC GGGGTCAAGG TCAGGTTCAC GGCAGAGGAC GCAACCAGGA CTGACCTGGA CTACCTGGTT AAGGTCGCCA GAACGGCTAG GGATGCAGGA GCTGACAGGA TAAGCATAGC TGATACCGTG GGGATCCTCT ACCCCGTGAA GACCAGGGAA CTCTTCTCCT ATCTAGTAAA GGAAGTCCCC GGGGTCGAGT TCGACATCCA CGCCCACAAC GACCTGGGTA TGGCAGTGGC CAACGCCCTG GCAGCAGTTG AGGGAGGCGC AACCATAATT CACGCTACCG TGAACGGCCT CGGTGAGAGA GTGGGAATTG TTCCCCTGCA GGCCGTGGCA GCAGCCCTCA AGTACCACTT TAACGTCGAC GTGGTTAAGC TTGACAGGCT CTCGAGTGTG GCCTCGCTCG TGGAAAAGTA TAGCGGGATC ACCATGCCCC CCAACTTCCC AATCACGGGA GATTATGCCT TCGTGCATAA GGCTGGAGTC CACGTGGCTG GGATACTCAA CGACCCAAGA ACTTACGAGT TCATGCCACC CGAGGTCTTT GGTAGATCCA GGGATTACGT CATAGACAAG TACACCGGTA AGCACGCGGT CAAGGATAGA TTTGAAAGAC TTGGGGTAAA GCTGGACGAC AGGGAACTGG AGCAGGTACT TGCGAGGATC AAGTCCAGTG AGGGAACCAG GTACTTCAGG GATGTGGACC TCCTGGAGAT AGCGGAGGAG GTCACGGGTA AGGTGCTCAA GCCGAGACCT CCAGAGAGGA TTGAGGCCGT GGTCTCGGTG AAGTGTGGCT CCAACGTTTA CACCACCTCC GTGACCAGGA GGCTGTCCAT AATCCCCGGG GTAAAGGAAG TCATGGAAAT TTCAGGGGAT TACGACATAC TCGTTAAGGT GGAGGCAAGG GACTCAGCGG AGCTTAACAA CATTGTGGAG AGCATCAGGT CAGTGAAGGG AGTCGAGTCA ACCCTGACCT CACTGGTTCT CAAGAAGATG TAA
|
Protein sequence | MKVGILDSTL REGEQTPGVV FTTEQRVEIA KALSDLGVSM IEAGHPAVSP DIYEGIKRIV KLKREGEITS EILGHSRAVK RDVEIASELE VDRIAIFYGV SDIHLKAKTK TTREEALNII ADVVQYAKAH GVKVRFTAED ATRTDLDYLV KVARTARDAG ADRISIADTV GILYPVKTRE LFSYLVKEVP GVEFDIHAHN DLGMAVANAL AAVEGGATII HATVNGLGER VGIVPLQAVA AALKYHFNVD VVKLDRLSSV ASLVEKYSGI TMPPNFPITG DYAFVHKAGV HVAGILNDPR TYEFMPPEVF GRSRDYVIDK YTGKHAVKDR FERLGVKLDD RELEQVLARI KSSEGTRYFR DVDLLEIAEE VTGKVLKPRP PERIEAVVSV KCGSNVYTTS VTRRLSIIPG VKEVMEISGD YDILVKVEAR DSAELNNIVE SIRSVKGVES TLTSLVLKKM
|
| |