Gene Msed_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1597 
Symbol 
ID5103961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1543558 
End bp1545381 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content50% 
IMG OID640507486 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001191676 
Protein GI146304360 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.10146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTTG TTAAGCCAAG TCTAATGTTA GGCAATGAGG CCATAGCTTA CGGTGCTCTA 
GCGTCCGGCG TTGCGGTTGC GGCAGGTTAC CCCGGAACTC CGTCGACCGA GATAATTGAG
ACCCTTATGA AGTTCAAGGA CGTTTACACA GAATGGAGTT CCAATGAGAA GGTGGCCTTT
GAGACAGGAT TCGGGGCTGC AATTATGGGT GCCCGGGCCC TGGTCACCAT GAAACACGTG
GGAATGAACG TTGCCTCAGA TTCGCTTATG TCGTCATCCT ACACGGGAGT ATCAGGGGCA
CTTGTGGTGG TCTCTGCAGG AGATCCAGGC ATGTGGTCTT CGCAGAGCGA ACAGGACACC
AGGTATTATG GGCTTATGGG TATGATTCCA GTCCTTGAGC CCTTCAATCC CCAATCTGCT
CATGACCTCA CAGTTGAGGC ATTTAACCTG AGCTCGGAGG TTGGTCATCC TGTCATTATT
TCCACAAATA CCAGGATTAG CCATGTTAGG TCACAGGTTA ATGTAATTCC CAGGAGGGAA
CCGGTCTATG GGAAATTCCA GAAAAACCCT GGAAGATACT CCCTTGTACC TGAGGTCTCC
AGAAGGGATA GGGAGGAACA ACTTAACAGA TGGGAGAAGA TCAAGTCGCT TACCGCCCAT
CTGGTGGAGT CTCGCGGGGA AGGTAAGGTG GCTGTTGTTG GGGTAGGAAT TTCATATTCT
TACGCCTTAG AGGCCTTGAG GGAACTTAAG GCTGAGCAAG TTAAGGTAAT AGGTGTATCC
TGCTCTGTAC CATTGCCTGA GAAGATTCTA GATTACCTTA CTGACGTCGA GAAAGTCCTT
GTGATTGAGG AGCTGGATCC TGTGGTGGAG AATCAGTTGA AATCCATGAT ACTTGACCAA
GGACTTCACG TTAAGGTGGA TGGAAAGAAA CTTACGGGAT ACGCGGGAGA AATGTCCCTC
GAAAGGGTAT CAAGAGCCAT AGCTAAATTC CTTGGTATTG AGGAGGAACC TCAACTGGAC
CAGATCCTAA AGGCCCCCGT AGATGTACCC AAGAGACCTC CCGCCATGTG TCCAGGTTGC
CCGCATAGGT CAAGCTTCTT CTTCCTCAAG AAGGGGCTAT CCCTGGGCGG GATCTCAAGC
ACCTTCTATT CTGGGGATAT AGGTTGCTAC TCATTGGGAG TACTCCCGCC TTTCAACGAG
CAAGATAGCT TGATATCCAT GGGAAGTAGT TTAGGAATAG CTAATGGAGT TTATAGGTCG
ACGCACACGA TCCCCGTGGC AATCATAGGA GATTCAACGT TCTTCCATAC AGGACTTCCA
GGCCTCGCAA ATGCCGTCTA TAACAAGTTC CCAGTTCTCG TGATCGTGTT AGATAATCGC
TCCACCGCCA TGACAGGCCA ACAGGGTAGC CCATCAACCA GTATTGATAT AGCGAACGTA
GCTAAGGGCC TAGGCGTTGA GTATGTGGAA GTTGGGGATC CCTTTAGTCC TGATTTTGCC
AAGGTTGTAG CTAGGGCATC TGAATGGGTA AAGAGGAATC AGGCACCAGC TGTCGTGGTG
GCGAAAAGGG CCTGTGCCCT CGAGGTCATA GATAGGGTAA AACCCGCACA GGTAGCCGTG
GTGAATTACG ATAAATGTAC AGGCTGTACG ATCTGCTATG ATTACTTTAC GTGTCCTGCA
ATCCTGAAAA GGAGTGACAA GAAGGCGGTA ATTAATCCTC AGGATTGTAT TGGGTGTGGC
GCATGCGTTC CCGTGTGCCC CTTTAACGCT ATCAAACTTG AGGGGGAGAA ACCTATGGGG
TGGGATGAGG CATGGACAAG CTAA
 
Protein sequence
MLVVKPSLML GNEAIAYGAL ASGVAVAAGY PGTPSTEIIE TLMKFKDVYT EWSSNEKVAF 
ETGFGAAIMG ARALVTMKHV GMNVASDSLM SSSYTGVSGA LVVVSAGDPG MWSSQSEQDT
RYYGLMGMIP VLEPFNPQSA HDLTVEAFNL SSEVGHPVII STNTRISHVR SQVNVIPRRE
PVYGKFQKNP GRYSLVPEVS RRDREEQLNR WEKIKSLTAH LVESRGEGKV AVVGVGISYS
YALEALRELK AEQVKVIGVS CSVPLPEKIL DYLTDVEKVL VIEELDPVVE NQLKSMILDQ
GLHVKVDGKK LTGYAGEMSL ERVSRAIAKF LGIEEEPQLD QILKAPVDVP KRPPAMCPGC
PHRSSFFFLK KGLSLGGISS TFYSGDIGCY SLGVLPPFNE QDSLISMGSS LGIANGVYRS
THTIPVAIIG DSTFFHTGLP GLANAVYNKF PVLVIVLDNR STAMTGQQGS PSTSIDIANV
AKGLGVEYVE VGDPFSPDFA KVVARASEWV KRNQAPAVVV AKRACALEVI DRVKPAQVAV
VNYDKCTGCT ICYDYFTCPA ILKRSDKKAV INPQDCIGCG ACVPVCPFNA IKLEGEKPMG
WDEAWTS