Gene Msed_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0201 
Symbol 
ID5103945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp164663 
End bp166321 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID640506106 
Productthiamine pyrophosphate enzyme, central region 
Protein accessionYP_001190302 
Protein GI146302986 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.582625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0361256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGTT TCTGGTATTA TAGCGATCCA ACTTTGAGCC AGCCTAAGAG GAAAGAGGAG 
ACCGTAGGTA GGGAGATGAC GGGAGATGAG GCATTAGCTT ACGTCCTTAA GGAAATAGGA
GTTAAGCGAG TGTTCACCTC GAACGCAGTC CCAGATTTCC TTAGGGAGAG ATTAGCCCAG
TATGGCCTTG AAATAGATAT TTCCTTGAGT GTAAGGGAGG CTCTGGAACT GGCAGACGCT
TTCGCCAGGG ATTCAGGAGA TGCTGGAGTT GTAATCAGCA CTCCAGGAAG CTCATTACTC
GAGGGTAGCA GTGTAGTGGC TCAGGCATTC TCCGATTCCG TTCCGCTACT CATGATCGGT
ACATTGAGGT CCTATAGGGA TGTGGGAAAG GCTAGAGTTG GCGAACTGAA GTCACCTGAC
GACGTATCAA GTTCCCTCTC ACCCTTCATC AAGTTCAAGG AGAGGGTAAT CAGCATAGAG
GAGATTACAG TCACTGTGGA GAAGGGGTAC AAGGAAGCTC TGAGCAATAG AATGAGGCCC
GCTCTTGTGG AGATAGCAGA GGAGCTTTTC CGGTTAAAGG CATATCCACT CTCTACCGCA
GAGCAGAAGC CTGAGAGGAA GACGCCAGAC AAAAACACAG TGGCCAAGGT GGCTGAGGTA
ATGGGAAACT CCAAGTTGCC TGTAGTGGTT GCAGGGTATG GAGTTAGGGC AAGCAATGCG
TCACCTCAAT TATTGGAACT CGCGGAGTTA CTTGACGCGC CGGTGATCAC AACCTTTAGA
GCCAAGGGAG TTTTCCCGGC CTCACATCCG CTCTACGCAG GCGAGGGATT GGGAGCATTT
TCCACGGAAG TTGCTTCCAA GCTCATGATG GAAGCAGACT CGATTCTAGT ACTTGGGTCT
AGATTGCCTC AACTTAGTAC TGCCGGCTGG TCCATGAGGT ATAAGGGTTT CCTCATGCAC
AACAATGTGG ATGGAGAGGA TATAGGCAAG GTAGTAATGC CACAACTTCC CATTGTTGCA
GACACAGGCC TCTTCCTTAA GGAACTGATA ACAATACTCA AACAGAAACT AAAGGAAAAC
ATCAAGAGGG AGGTGAGAAG CGAGATAGCG TCAAGCAGGA GAGTGTTCAC CATGAAACCC
CACTCGGGAC TATGGCCATA TGACGTTACT AGGCTTCTAC AACAGTTCAA GTTTTCGAGA
TACTTCGTGG ATTTGAGTGC CCCAACTCTC GACCTGGTTA GACTGCCCAT CGAGAGCCCT
GTGTGGAACA CGAGCGAATC AATTCTCGAG AAGGGAATAG GTGTCGCTGG TGTGCTGCAG
TCCAACGATC CAGGTGCCCT CGGGATTACT GACCTAGCTG GTGTACTAAG AAATGTTGGC
CTCATTCAGC AAAGGGCTGA AAAGGCGAAG GGAGTAATCC TTGTGCTCAA TGACGGGGGA
GCCACTTACC TTGACACGTT CAAATCGGAC ATACCGTCTA TAGGAAAATC GGGAACGTTT
GTGGACGTGG ATGAATTCCT AGAGAGATCC ACGGGGGCAG TCACAGTGGA TACCTACGGA
GGGTTGAAGG ACATCCTGGA GCGGAGAGAC CCTAAGCTCA AGGTAATAAA CGTGAAGATA
GATCCGGATT ACGAGTCAAT CGTTCTTCTA AAACCATAA
 
Protein sequence
MTSFWYYSDP TLSQPKRKEE TVGREMTGDE ALAYVLKEIG VKRVFTSNAV PDFLRERLAQ 
YGLEIDISLS VREALELADA FARDSGDAGV VISTPGSSLL EGSSVVAQAF SDSVPLLMIG
TLRSYRDVGK ARVGELKSPD DVSSSLSPFI KFKERVISIE EITVTVEKGY KEALSNRMRP
ALVEIAEELF RLKAYPLSTA EQKPERKTPD KNTVAKVAEV MGNSKLPVVV AGYGVRASNA
SPQLLELAEL LDAPVITTFR AKGVFPASHP LYAGEGLGAF STEVASKLMM EADSILVLGS
RLPQLSTAGW SMRYKGFLMH NNVDGEDIGK VVMPQLPIVA DTGLFLKELI TILKQKLKEN
IKREVRSEIA SSRRVFTMKP HSGLWPYDVT RLLQQFKFSR YFVDLSAPTL DLVRLPIESP
VWNTSESILE KGIGVAGVLQ SNDPGALGIT DLAGVLRNVG LIQQRAEKAK GVILVLNDGG
ATYLDTFKSD IPSIGKSGTF VDVDEFLERS TGAVTVDTYG GLKDILERRD PKLKVINVKI
DPDYESIVLL KP