Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1597 |
Symbol | |
ID | 5103961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1543558 |
End bp | 1545381 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507486 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001191676 |
Protein GI | 146304360 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.10146 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGTTG TTAAGCCAAG TCTAATGTTA GGCAATGAGG CCATAGCTTA CGGTGCTCTA GCGTCCGGCG TTGCGGTTGC GGCAGGTTAC CCCGGAACTC CGTCGACCGA GATAATTGAG ACCCTTATGA AGTTCAAGGA CGTTTACACA GAATGGAGTT CCAATGAGAA GGTGGCCTTT GAGACAGGAT TCGGGGCTGC AATTATGGGT GCCCGGGCCC TGGTCACCAT GAAACACGTG GGAATGAACG TTGCCTCAGA TTCGCTTATG TCGTCATCCT ACACGGGAGT ATCAGGGGCA CTTGTGGTGG TCTCTGCAGG AGATCCAGGC ATGTGGTCTT CGCAGAGCGA ACAGGACACC AGGTATTATG GGCTTATGGG TATGATTCCA GTCCTTGAGC CCTTCAATCC CCAATCTGCT CATGACCTCA CAGTTGAGGC ATTTAACCTG AGCTCGGAGG TTGGTCATCC TGTCATTATT TCCACAAATA CCAGGATTAG CCATGTTAGG TCACAGGTTA ATGTAATTCC CAGGAGGGAA CCGGTCTATG GGAAATTCCA GAAAAACCCT GGAAGATACT CCCTTGTACC TGAGGTCTCC AGAAGGGATA GGGAGGAACA ACTTAACAGA TGGGAGAAGA TCAAGTCGCT TACCGCCCAT CTGGTGGAGT CTCGCGGGGA AGGTAAGGTG GCTGTTGTTG GGGTAGGAAT TTCATATTCT TACGCCTTAG AGGCCTTGAG GGAACTTAAG GCTGAGCAAG TTAAGGTAAT AGGTGTATCC TGCTCTGTAC CATTGCCTGA GAAGATTCTA GATTACCTTA CTGACGTCGA GAAAGTCCTT GTGATTGAGG AGCTGGATCC TGTGGTGGAG AATCAGTTGA AATCCATGAT ACTTGACCAA GGACTTCACG TTAAGGTGGA TGGAAAGAAA CTTACGGGAT ACGCGGGAGA AATGTCCCTC GAAAGGGTAT CAAGAGCCAT AGCTAAATTC CTTGGTATTG AGGAGGAACC TCAACTGGAC CAGATCCTAA AGGCCCCCGT AGATGTACCC AAGAGACCTC CCGCCATGTG TCCAGGTTGC CCGCATAGGT CAAGCTTCTT CTTCCTCAAG AAGGGGCTAT CCCTGGGCGG GATCTCAAGC ACCTTCTATT CTGGGGATAT AGGTTGCTAC TCATTGGGAG TACTCCCGCC TTTCAACGAG CAAGATAGCT TGATATCCAT GGGAAGTAGT TTAGGAATAG CTAATGGAGT TTATAGGTCG ACGCACACGA TCCCCGTGGC AATCATAGGA GATTCAACGT TCTTCCATAC AGGACTTCCA GGCCTCGCAA ATGCCGTCTA TAACAAGTTC CCAGTTCTCG TGATCGTGTT AGATAATCGC TCCACCGCCA TGACAGGCCA ACAGGGTAGC CCATCAACCA GTATTGATAT AGCGAACGTA GCTAAGGGCC TAGGCGTTGA GTATGTGGAA GTTGGGGATC CCTTTAGTCC TGATTTTGCC AAGGTTGTAG CTAGGGCATC TGAATGGGTA AAGAGGAATC AGGCACCAGC TGTCGTGGTG GCGAAAAGGG CCTGTGCCCT CGAGGTCATA GATAGGGTAA AACCCGCACA GGTAGCCGTG GTGAATTACG ATAAATGTAC AGGCTGTACG ATCTGCTATG ATTACTTTAC GTGTCCTGCA ATCCTGAAAA GGAGTGACAA GAAGGCGGTA ATTAATCCTC AGGATTGTAT TGGGTGTGGC GCATGCGTTC CCGTGTGCCC CTTTAACGCT ATCAAACTTG AGGGGGAGAA ACCTATGGGG TGGGATGAGG CATGGACAAG CTAA
|
Protein sequence | MLVVKPSLML GNEAIAYGAL ASGVAVAAGY PGTPSTEIIE TLMKFKDVYT EWSSNEKVAF ETGFGAAIMG ARALVTMKHV GMNVASDSLM SSSYTGVSGA LVVVSAGDPG MWSSQSEQDT RYYGLMGMIP VLEPFNPQSA HDLTVEAFNL SSEVGHPVII STNTRISHVR SQVNVIPRRE PVYGKFQKNP GRYSLVPEVS RRDREEQLNR WEKIKSLTAH LVESRGEGKV AVVGVGISYS YALEALRELK AEQVKVIGVS CSVPLPEKIL DYLTDVEKVL VIEELDPVVE NQLKSMILDQ GLHVKVDGKK LTGYAGEMSL ERVSRAIAKF LGIEEEPQLD QILKAPVDVP KRPPAMCPGC PHRSSFFFLK KGLSLGGISS TFYSGDIGCY SLGVLPPFNE QDSLISMGSS LGIANGVYRS THTIPVAIIG DSTFFHTGLP GLANAVYNKF PVLVIVLDNR STAMTGQQGS PSTSIDIANV AKGLGVEYVE VGDPFSPDFA KVVARASEWV KRNQAPAVVV AKRACALEVI DRVKPAQVAV VNYDKCTGCT ICYDYFTCPA ILKRSDKKAV INPQDCIGCG ACVPVCPFNA IKLEGEKPMG WDEAWTS
|
| |