Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1323 |
Symbol | |
ID | 5104574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1301478 |
End bp | 1302632 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507212 |
Product | FAD-dependent pyridine nucleotide-disulphide oxidoreductase |
Protein accession | YP_001191405 |
Protein GI | 146304089 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00637621 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCT TAATACTTGG AGCAGGGTAT GCAGGACTTA CCGTGGCTCA TAAGTTGAGG AGGTACCTCA ATGACGAGAT AACCGTTATC TCTGAGTCTA GGGTGGTGAG AGAAAACACA ATTTTCCCGC TTCTCCTCAC GGACGAGGTG AAGGTCGAGG AAACTGAATT TGACGCCAAG GTAGCCATGG AGAGGAAGGG AGTGAACTTC GTGGAGGGTA GGGTGACGCA GATCCTGCCT GAGAGCAGGG AGGCCAAGAC GGACAAGGGA ACCTTTGACT ATGACTATCT CTTCCTAGCC CTCGGTGGAG GATATGAGGA GAACTTCAGG AGGATACCTG GACATGAGAA CGCAGTCATG CATCACACCC TGGATGGATT CCTCAAGCTT AAGGAAATGC TGTGGAACAC TGAGGGGAAC GTGTTCGTAG GTAACGCGCC AGGGAGTCCC ATAGAGGGTC CCTCGTATCA GGTCGCCCTC ATAGCTGAGT ACATTCTTAG GAGGAGAGGG GTGAAGGGGA AGGTTTACCT GGCTACGCAA AGTCCCAAGG GTGTCTTTGG CCCAATTCCC TTGGACTGGG TTCACGAGAA GGCCAACTCA TACTTCGAGA GGAGGGGTAT AAGCGTCCTC AAGGGTAAAG CCGTAAAGGA GATTAAGAGA GGGAAAGTAG TCTTGAGCGA CGGCCAAGAA GTGGAGGCTG ATGTGATCTC CGTGCTTCCC TCACTCTCTG CGCCAAAGGT GGTTAGAGAT GCTGGACTTG CGGGAAATTC AGGTTTCGTT GAGGTGAAGT TACCAAGTTT TAGGAAGGAT GACAGGATAT TTGCCCTAGG AGACCTGGCC CAGACTCCCT TCCCTAGGAC AGCTAGGGCA GCAATGATCT CCGCAGAGAA TGCAGTCTCG TCAGTGTTAA GGGAGGTCAA GGGACTGGAG TTGCCCATGT ACTCCCTAGG TGTTCTATGC GTGATGGAGG GAGGAGATGA CGGTGGAATT CTTAGATTCG ATACGAATGG GAAAGAGGTG AAAACCACGC TAGCCTTTGG AAAGAGTTAC GTCACCATCA AGAAACTCTA CAGTAAACTT CTAGTCAAGA GGGCCTTTGA TGTTCCCTAT CACGGAGCGC TAACAGTTGA AATACGCTAC AACTTCTCTC CTTAA
|
Protein sequence | MKFLILGAGY AGLTVAHKLR RYLNDEITVI SESRVVRENT IFPLLLTDEV KVEETEFDAK VAMERKGVNF VEGRVTQILP ESREAKTDKG TFDYDYLFLA LGGGYEENFR RIPGHENAVM HHTLDGFLKL KEMLWNTEGN VFVGNAPGSP IEGPSYQVAL IAEYILRRRG VKGKVYLATQ SPKGVFGPIP LDWVHEKANS YFERRGISVL KGKAVKEIKR GKVVLSDGQE VEADVISVLP SLSAPKVVRD AGLAGNSGFV EVKLPSFRKD DRIFALGDLA QTPFPRTARA AMISAENAVS SVLREVKGLE LPMYSLGVLC VMEGGDDGGI LRFDTNGKEV KTTLAFGKSY VTIKKLYSKL LVKRAFDVPY HGALTVEIRY NFSP
|
| |