Gene Msed_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1765 
Symbol 
ID5104765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1700007 
End bp1701389 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content55% 
IMG OID640507660 
Product2-isopropylmalate synthase 
Protein accessionYP_001191844 
Protein GI146304528 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02146] homocitrate synthase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.877684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAG GTATACTAGA TTCGACTTTG AGGGAAGGCG AACAGACTCC TGGAGTTGTG 
TTCACCACTG AGCAGAGAGT CGAGATAGCC AAGGCCCTAT CCGATCTGGG AGTTTCCATG
ATTGAGGCCG GTCACCCGGC AGTTTCACCG GACATTTATG AGGGAATAAA GAGGATCGTG
AAGCTCAAGA GGGAGGGAGA GATCACCTCC GAGATCCTGG GTCACAGCAG GGCTGTGAAG
AGGGACGTGG AGATTGCCAG CGAACTCGAG GTGGACAGGA TAGCCATCTT CTACGGGGTA
AGCGACATTC ACCTCAAGGC CAAGACTAAG ACCACCAGGG AGGAGGCTCT CAACATCATT
GCGGACGTGG TTCAGTACGC CAAGGCCCAC GGGGTCAAGG TCAGGTTCAC GGCAGAGGAC
GCAACCAGGA CTGACCTGGA CTACCTGGTT AAGGTCGCCA GAACGGCTAG GGATGCAGGA
GCTGACAGGA TAAGCATAGC TGATACCGTG GGGATCCTCT ACCCCGTGAA GACCAGGGAA
CTCTTCTCCT ATCTAGTAAA GGAAGTCCCC GGGGTCGAGT TCGACATCCA CGCCCACAAC
GACCTGGGTA TGGCAGTGGC CAACGCCCTG GCAGCAGTTG AGGGAGGCGC AACCATAATT
CACGCTACCG TGAACGGCCT CGGTGAGAGA GTGGGAATTG TTCCCCTGCA GGCCGTGGCA
GCAGCCCTCA AGTACCACTT TAACGTCGAC GTGGTTAAGC TTGACAGGCT CTCGAGTGTG
GCCTCGCTCG TGGAAAAGTA TAGCGGGATC ACCATGCCCC CCAACTTCCC AATCACGGGA
GATTATGCCT TCGTGCATAA GGCTGGAGTC CACGTGGCTG GGATACTCAA CGACCCAAGA
ACTTACGAGT TCATGCCACC CGAGGTCTTT GGTAGATCCA GGGATTACGT CATAGACAAG
TACACCGGTA AGCACGCGGT CAAGGATAGA TTTGAAAGAC TTGGGGTAAA GCTGGACGAC
AGGGAACTGG AGCAGGTACT TGCGAGGATC AAGTCCAGTG AGGGAACCAG GTACTTCAGG
GATGTGGACC TCCTGGAGAT AGCGGAGGAG GTCACGGGTA AGGTGCTCAA GCCGAGACCT
CCAGAGAGGA TTGAGGCCGT GGTCTCGGTG AAGTGTGGCT CCAACGTTTA CACCACCTCC
GTGACCAGGA GGCTGTCCAT AATCCCCGGG GTAAAGGAAG TCATGGAAAT TTCAGGGGAT
TACGACATAC TCGTTAAGGT GGAGGCAAGG GACTCAGCGG AGCTTAACAA CATTGTGGAG
AGCATCAGGT CAGTGAAGGG AGTCGAGTCA ACCCTGACCT CACTGGTTCT CAAGAAGATG
TAA
 
Protein sequence
MKVGILDSTL REGEQTPGVV FTTEQRVEIA KALSDLGVSM IEAGHPAVSP DIYEGIKRIV 
KLKREGEITS EILGHSRAVK RDVEIASELE VDRIAIFYGV SDIHLKAKTK TTREEALNII
ADVVQYAKAH GVKVRFTAED ATRTDLDYLV KVARTARDAG ADRISIADTV GILYPVKTRE
LFSYLVKEVP GVEFDIHAHN DLGMAVANAL AAVEGGATII HATVNGLGER VGIVPLQAVA
AALKYHFNVD VVKLDRLSSV ASLVEKYSGI TMPPNFPITG DYAFVHKAGV HVAGILNDPR
TYEFMPPEVF GRSRDYVIDK YTGKHAVKDR FERLGVKLDD RELEQVLARI KSSEGTRYFR
DVDLLEIAEE VTGKVLKPRP PERIEAVVSV KCGSNVYTTS VTRRLSIIPG VKEVMEISGD
YDILVKVEAR DSAELNNIVE SIRSVKGVES TLTSLVLKKM