Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0635 |
Symbol | |
ID | 5103795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 581437 |
End bp | 582987 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640506539 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_001190734 |
Protein GI | 146303418 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.470789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACG TGGCCATTGT GGGTGGTGGT CACAACGGTC TTGTTACGGC GTCTTATCTA GCTATGAATG GACTTAAGGT CGCCGTCTTT GAAAGGAGGA ACATAGTGGG AGGCGCCTCC GTCACGGAGG AGTTGTGGCC AGGGGTAAAG GTTTCCACAG GCGCCTACGT TCTCAGCCTT CTGAGACCCA GGATCATAAG GGATCTGAGA CTGGAGCAAT TTGGACTCGA GGTAATAACC AAGGATCCCG GGCTCTTCGT CCCCTTCGGT AACGGGAGGT CACTTTACAT CTGGAGCGAT CTCAAGAAGA CTCAGAGGGA AATAGAGAAA TTTTCCAAAA GGGACGCCTT GGCGTACGAG AAGTGGCTCA AGTTCTGGGA TCCCTTCTAT GAATTGGCGG ATCTCCTGAT GTTGTCCCCT CCACCCTCCT GGGATGATTT GGATAGTCTC CTCTCCCTAG TCAAGGTTCA GGGTCTAGAT CTCTCGGAAC TGGCCCTTCC CTTGAGGTCA GTGGTTCAGG ATGCGTCCTC CCTTCTGAAC GAGTTCTTTG AATCCGAGGA GGTCAAGGCA GCCCTAGTTG AGGACGCAGT AGTGGGGACA ATGGCCTCTC CCTCCACCCC TGGTACCGCG TACGTTCTGG CTCATCACGT CCTCGGGGAA GTTAACGGTG TTAAGGGTGC GTGGGGCTAC GTCAAGGGTG GGATGGGCGG TGTCACCCAA GCTCTCAGGA GGTCGGCTGA GCACCTTGGG GTGGAAATAT TCACTGGGGC CGAGGTTGAC AGCATTTTAG CGAAGGGAGG GAAAGTGGAA GGCATTAAAC TAGCCAACGG GAAAATCGTT CACTCTAGGA TAGTTGTGTC CAACGCAGAC CCTAAAACCA CCTTCCTGAA GCTCCTTAGG GACGCTGAAC TGGACGAGGA CTTCCTCAGG AGGGTCAGGT CGCTGAAGAG CAGGGGAGTC TCATTCAAGA TCGTGGGTTA CACCGAGGAA CTGCCCAATT TCGGGAACGG GACTACTCTA GGCCCAGAGC ACGTGGCGTC TGAGCTTATT CTGCCCTCCG TCGACTACGT TGAGAGGGCC TTCATTGACG CCAAATCCCT TGGTTACTCC AAGGAACCTT GGCTTTCCAT AAACATACAG TCCTCAGTGG ATCCCACGGT TGCCCCTCCG GGTAAGTTCT CCTTCTCAAT CTTCGGTCAG TATCTTCCCT ACTCCAAGAA CTTGGACGAT ATGAGGGAAC TGGTATATCA GATAACCCTG GAGAAAATAA GGGAGTACGC TCCCAATTTC AAGCCAGTTA AGTACGAAGT CTTAACCCCA CTGGATATAG AGAGAAGGTT TGGAATAACT GAGGGCAACA TCTTTCACCT GGACATGACG CCCGATCAGT TATACGTTTT CAGACCTCTG CCTGGGTACC ACGATTACAC CACACCGGTA CAGGGGCTAT ATCTATGTGG CTCGGGAACA CATCCTGGTG GAGGGGTAAC TGGGGCTCCA GGTTTTAACG CCGCTCAAAG GATATTGTCC GACCTCAGGA GTAATCACTA A
|
Protein sequence | MYDVAIVGGG HNGLVTASYL AMNGLKVAVF ERRNIVGGAS VTEELWPGVK VSTGAYVLSL LRPRIIRDLR LEQFGLEVIT KDPGLFVPFG NGRSLYIWSD LKKTQREIEK FSKRDALAYE KWLKFWDPFY ELADLLMLSP PPSWDDLDSL LSLVKVQGLD LSELALPLRS VVQDASSLLN EFFESEEVKA ALVEDAVVGT MASPSTPGTA YVLAHHVLGE VNGVKGAWGY VKGGMGGVTQ ALRRSAEHLG VEIFTGAEVD SILAKGGKVE GIKLANGKIV HSRIVVSNAD PKTTFLKLLR DAELDEDFLR RVRSLKSRGV SFKIVGYTEE LPNFGNGTTL GPEHVASELI LPSVDYVERA FIDAKSLGYS KEPWLSINIQ SSVDPTVAPP GKFSFSIFGQ YLPYSKNLDD MRELVYQITL EKIREYAPNF KPVKYEVLTP LDIERRFGIT EGNIFHLDMT PDQLYVFRPL PGYHDYTTPV QGLYLCGSGT HPGGGVTGAP GFNAAQRILS DLRSNH
|
| |