Gene Msed_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1037 
Symbol 
ID5104337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp961839 
End bp963494 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content54% 
IMG OID640506933 
Productacetolactate synthase, large subunit 
Protein accessionYP_001191126 
Protein GI146303810 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0844069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGTT CAACTCTTCT TCTAGAACTA CTGAAGGATT ACGACGTGGA TAGGGTTTTT 
GGACTTCCTG GAGAGACATC TATCCCATAC TACCCCGAAT TCGCAGAGCT TCAGGTGATA
ACTAGGGATG AGAGGAACGC CGTCTACATG GCTGACGCCT ATGCCAGGGT TAGTTTCAAG
CCGGGAGTGG TTGAGGGACC GAGCGTTGGC TCGCCCTACA TGTTACCAGG TGTGATAGAG
GCATACAAGT CCTCCTCTCC CGTGATAGTC ATCACCACGG ATACTGACCT CTACGGAGAG
AGGATGAACA TGTTGACTTC CCTGGATCAG ACAGCCCTCT TCAAACCCTA CACCAAGGAG
TCCATCACCG TGACGAAGGC AGACGACCTG TCCCACGCCG TGAGGAGGGC CTTCAGGTTA
GCAACCGGAG GGAGACCTGG ACCAGTTCAC CTAAGGATAC CCCATCATGT ACTCGAGGAG
GAGGGATCCA TCTACCTCCC ACCGCAGAGG GAGTTCTCCA GGTATCCAGC TCAAAGGCCC
GTCGCAGACC GGGATGCGGT GAGGCTCGCG GTCTCAGCCC TCCTGGACAG TTCCAACCCG
GTCATTATCT GCGGTCAAGG AGCACTGTAC TCCAGGGCGT GGGATGAGGT CGTGGAGTTG
GCTGAGCTCA TGGGAATTCC AGTGGGTACC ACCATCACGG GGAAGGGATG CATCTCGGAG
CTTCACCCCC TCTCCATAGG GGTAGTGGGA GGAAGAGGAG GGACTAGCTT TTCAAATTCC
TTTCTGGAGG AGGCCGATCT AATCTTCCTT GTGGGATCAA ACACGGACTC AGCCAACACA
GATAGGTGGA GATATCCTCC CAGGACGAAG ACCGTGATCC ATCTAGATGT GAGTGAGGCT
GAAGTGGGGA ACAACTACAA CTCCATAAAC CTGATAGGGG ACGCTAAGGC AACGCTTAGG
GAGATAATCA GGGAGGTAAG ATCCCGGGGA GTGAAGAGAA GGGAAGTGAA AGTGAATAGG
GACGAGTTTG AGGCCAGGGT GAGGGAGATT GCCTCCATGT CCGGGGAAAG GGTCAACCCG
GTCAGGTTTG TGAAGGAGTT GGAGAGGAGG GTTAGGGATC AGGTAATAGT AGCAGACCCT
GGGGTAGGTG CAATTTACGT CTCCGCCCTT TTCAGAACTG GGAAGGCTGG AAGAAACTTC
GTGTTCAACT ACGGCCTTGG GGGACTGGGT TACGCGATAC CTGCCTCAGT TGGGGCCCAA
CTGGGATCTG GTAGACAGGT TCTTGCCATG ACCGGGGATG GTAGCTTTGG GTTCTCTGCG
GGAGAGCTGG AGACAATTGC TAGGTTGAAA AGTGACGTGG TCTTGTTTGT GTTCAATAAC
TCCAGTTTCG GCTGGATAAG GGCAGAAATG AGGATTCAGG GTAGGGATGT GAGGGGGACC
GACTTCTCGT CGCTGGATTA CGTTAAGATA GCTGAGGGAT TCGGGCTCAG GGGTTACAGG
ATCTCCACGG ACCAAGAGAT AGGCGATGTG TTGGACGAGG CCATGGAGAG CACTCCCAGT
CTGGTTGAGG TAGTCGTGGA TCCTGAGGAT AAGTTCTACC CACCTGTGGC ACACTGGGCT
AGGGCACTAC TCCACGACGT GAAACATGTA TATTGA
 
Protein sequence
MKGSTLLLEL LKDYDVDRVF GLPGETSIPY YPEFAELQVI TRDERNAVYM ADAYARVSFK 
PGVVEGPSVG SPYMLPGVIE AYKSSSPVIV ITTDTDLYGE RMNMLTSLDQ TALFKPYTKE
SITVTKADDL SHAVRRAFRL ATGGRPGPVH LRIPHHVLEE EGSIYLPPQR EFSRYPAQRP
VADRDAVRLA VSALLDSSNP VIICGQGALY SRAWDEVVEL AELMGIPVGT TITGKGCISE
LHPLSIGVVG GRGGTSFSNS FLEEADLIFL VGSNTDSANT DRWRYPPRTK TVIHLDVSEA
EVGNNYNSIN LIGDAKATLR EIIREVRSRG VKRREVKVNR DEFEARVREI ASMSGERVNP
VRFVKELERR VRDQVIVADP GVGAIYVSAL FRTGKAGRNF VFNYGLGGLG YAIPASVGAQ
LGSGRQVLAM TGDGSFGFSA GELETIARLK SDVVLFVFNN SSFGWIRAEM RIQGRDVRGT
DFSSLDYVKI AEGFGLRGYR ISTDQEIGDV LDEAMESTPS LVEVVVDPED KFYPPVAHWA
RALLHDVKHV Y