Gene Msed_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1175 
Symbol 
ID5104471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1144036 
End bp1145616 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content55% 
IMG OID640507067 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001191260 
Protein GI146303944 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.312541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA CCACTGCGGA ACTCCTGATG GACGCGATCT CCTCCCAGGT TGAGGACGTT 
TTCGGGATAC CTGGGACTCA CGGCCTCTCC CTCTATGAGG AGATAAGGAA GAGGGTTGAG
AGACAAGAGG TTCGTTACTA CATGCCGAGA CTGGAGTACG GGGGCGCCAT AATGGCCGAC
TACTACGCCA GACTGAAGGG AAACGTGGGG ATCTTCATCT CGGTGAATGG TCCAGGATTC
ACGAACTCCT TGACGGGGCT AGCAGAGGCC TTCTCCGAGG GTTCACCCCT TGTCCTAATC
TCCTTCAACA AGGAGTTCAG GTATAGGCAT AGGAGACAAC TTCACGACAC TGGATACTAC
GACGCACAGA TCGAGGTAGC TAAGCAGATA ACTAAGGCGT CATTCAGGAT CTATTCCCCC
GAGGAGGTAC CCCTAGTCAT GGAAAAGTCC TTCAGGATAG CCCTTCAGGA CAGGATGGGG
CCTGTCTACG TTGAGATCCC TGTTGACGTC CTCGAGGAGA AGGGCGAGGC TGAGGCGGAG
AGAGTGAAGG TTCCAAGGAG TCTGGTCTAC CCTAGTAAGG ACGAGGTGAG GGAGGCCGTG
AACTTTCTGA GTGAGTGCTC CAAGCCCGTC CTACTCCTGG GATATGGAGC GTCCAGATCC
GATCTGGTTC CCTACCTGGA AAAGCTGGGA ATTCCTGTCC TGACCACGGT TAGGGGTAAG
GGAAGCATCC CTGAGAATCA CCCTCTCTAC GCGGGCACAA CCTTCAACCT AGCCGAGATC
CCTGGGGACT GCCTGATCGC CATTGGAACC TCATTCAATG ACCTCGAGAC TAGGAGGTGG
AGCATGAAGC TTCCTAGGAC TCTTCACGTC GATCCGGACC CCTCAGTCTT CAACACCTCC
TTCAGGGCAG ACGTCGTGGT GAGGGCTAGC GCCGAGGCCT TCTTAACCGA GGTGGTGGAG
AGGGTCAAGT TGCCCAGGTG GAGTTTCAGG GTTGAGCGAA GGGAGACCAA CCTGCAGGGT
GAGGGAATAA CTCACGACCT TCTCGCCAAG GTTCTAAACG AGGCGTTAGG GGAGGACAGG
GTGGTGATCG CAGATGCGGG CACAAATCAG GTAATGGCTA TTGACGTACA GGTGTATAGG
CCCAACTCCT ACTTCAACTC CCTGATCTTC AACGCCATGG GTTCAGCTAT CCCAGCGGGT
ATAGGGGCAA AGATTGCAGT CCCAGAGAGG CAGGTCGTGA GCATAATAGG TGACATGGGA
TTTCAGGGGT GTTTTCAGGA GTTGATCACT GCGGTAGAGA ACGGGATCAA CTTCCTGACA
GTCCTCGTGG AGGACGGCGT TCAGCATTTC CTAAGGATGA ACCAGAACAT GAGATATGGA
ACCACCTTCA CGACGCAGGT CTTTCCAATT GACTACACGA AGGTTCTCGA GGGGATTGGG
GTGAAGGTTG TGGAGGCTAG GGACAGGGAA GAGTTGAGGA GGGCAACGGA GGAGGCAGTC
AGCTGGTCAG CCAAGATGCC AACGGTACTC AGGGTTAGGG TGAACCCGAA CAGCGTCCCA
TCTAGGCTAA CGCGAAGATG A
 
Protein sequence
MAKTTAELLM DAISSQVEDV FGIPGTHGLS LYEEIRKRVE RQEVRYYMPR LEYGGAIMAD 
YYARLKGNVG IFISVNGPGF TNSLTGLAEA FSEGSPLVLI SFNKEFRYRH RRQLHDTGYY
DAQIEVAKQI TKASFRIYSP EEVPLVMEKS FRIALQDRMG PVYVEIPVDV LEEKGEAEAE
RVKVPRSLVY PSKDEVREAV NFLSECSKPV LLLGYGASRS DLVPYLEKLG IPVLTTVRGK
GSIPENHPLY AGTTFNLAEI PGDCLIAIGT SFNDLETRRW SMKLPRTLHV DPDPSVFNTS
FRADVVVRAS AEAFLTEVVE RVKLPRWSFR VERRETNLQG EGITHDLLAK VLNEALGEDR
VVIADAGTNQ VMAIDVQVYR PNSYFNSLIF NAMGSAIPAG IGAKIAVPER QVVSIIGDMG
FQGCFQELIT AVENGINFLT VLVEDGVQHF LRMNQNMRYG TTFTTQVFPI DYTKVLEGIG
VKVVEARDRE ELRRATEEAV SWSAKMPTVL RVRVNPNSVP SRLTRR