Gene Msed_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1108 
Symbol 
ID5103582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1035797 
End bp1037266 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content47% 
IMG OID640507003 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001191196 
Protein GI146303880 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0811747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAG CAGAATCAAT ATTCAAAACC CTGTCAAGTT CTACCACAAC CGTGTACGGG 
AACCCAGGAA CCACGGAGAT TTCCTTTCTC AAGTATCTAC CGAGCGAATT TCGATATTTC
CTAGCCCTCC ACGATGGCCC AGCCATAGGT ATGGCCAGTG GTTACTCTCT CATGACAGGT
AAGGTAGGGG TAACCAACAC GCATGCAGCT CCTGGGTTAA TGAATTCCTT GGGTTACGTT
TATTCGGCAA GACTTGACAG AACTCCCCTT CTCATCACGG TGGGGCAACA GTCTTCTACC
CAGTTGTTGG ATGAGCCCAT ACTTTCCGTA GATCTAAGGA CCGTTCCATA TGCAAAGGAT
GTGATAGAGG TGAGAAGGAA GGAGGAAGTT AGTAAAGCCT TGATTAGGGG AATCAAAACG
GCTGTTTCTC TTCCGCCCGG ACCGGTTATC CTCGGGCTAC CATATGACAT CATGGAGGAG
GAGATAGGGA ACACGGAGAG TTACATCTCT GGAAGAGTTG AATGGAACTG TCCCTGCAAT
CTCCTAGACG TAGAGATGGT AGCCGAGGAG ATCAATGCAG TCAATAAAGT TGCAGTAGTT
GCAGGATACG AATTGGACAT AGTGGATGCT CACGAGGAAG TTGTGGAGTT GGCAAGAAAG
GTGGGTTCAC CTATCTTCAC AGAACCCCAC TTCTCCCGGT CTCCAGGTTC AAAGATCGAC
GTTATATTAC CAAGAAGTGC CAGTGGGATA AACAGGATTC TTGGTCAATA CGATCTAGTC
CTCCTCCTCG GAGGTACCCT TCACAACGTG TTGTACATGG ACCAAGAGTT CAGGTTCAAC
ATACTTCAGA TTACCATGGA CCCAGAGGAG AAATCCAAGA GGATTTGGAG AACCGTCCTC
TGTAATCCAA AGGACTTCCT GAGACACCTT CTCCCTAAGG TAAGGGAAAA AGTCGGTTCT
CACGATCTCA AGCCGGATAA CAAAAATAAG GTCACGGAGC TAATGGAGTA CCTGGTTTCA
AAGCTAAACG GACACGCCAT ATTTGAAGAG ACTCCGTCCC ATAAGGAGGT AGTTAAGAAA
GTAATTGGGA TTAGGAAACA TCTCTTCTTC TCCAATAGAT CTGGATTCCT GGGTTGGGCT
CTACCTGCAT CACTGGGCTA CGTTACTGCC GGAGGTAAGG CTGTCACTCT CATAGGAGAT
GGAAGTTTCC ACTTTTCTCC ACAGACACTT TGGACCGCAT CCTACTATGA CCTAGAAATG
AGAATAATGA TACTTAACAA CCATGGGTAT GAATCGTTGA GGGGGAGAGC TGATTATCAA
GCTAACTTCT TCAATCCAAG GACACAACCC CTAAAAGTCG CTGAGGCTTA TGGATTTGAG
ACGTTCGAGA CTGACCATTT AGCAGATGGT GTGGATTGGC TAATGGAAAA GGGAGGGAAG
AGGAGAGTTG TGGAAATTGT ACTAAAATAA
 
Protein sequence
MNVAESIFKT LSSSTTTVYG NPGTTEISFL KYLPSEFRYF LALHDGPAIG MASGYSLMTG 
KVGVTNTHAA PGLMNSLGYV YSARLDRTPL LITVGQQSST QLLDEPILSV DLRTVPYAKD
VIEVRRKEEV SKALIRGIKT AVSLPPGPVI LGLPYDIMEE EIGNTESYIS GRVEWNCPCN
LLDVEMVAEE INAVNKVAVV AGYELDIVDA HEEVVELARK VGSPIFTEPH FSRSPGSKID
VILPRSASGI NRILGQYDLV LLLGGTLHNV LYMDQEFRFN ILQITMDPEE KSKRIWRTVL
CNPKDFLRHL LPKVREKVGS HDLKPDNKNK VTELMEYLVS KLNGHAIFEE TPSHKEVVKK
VIGIRKHLFF SNRSGFLGWA LPASLGYVTA GGKAVTLIGD GSFHFSPQTL WTASYYDLEM
RIMILNNHGY ESLRGRADYQ ANFFNPRTQP LKVAEAYGFE TFETDHLADG VDWLMEKGGK
RRVVEIVLK