Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2059 |
Symbol | |
ID | 5105039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1978589 |
End bp | 1979818 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507949 |
Product | FAD-dependent pyridine nucleotide-disulphide oxidoreductase |
Protein accession | YP_001192123 |
Protein GI | 146304807 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.761901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGG TCTTAGTTTT AGGTGGAAGA TTCGGCGGAC TTACTGCCGC CTATAACGCA AAGAGACTGT TGGGGAGCAA GGCTGAGGTT AAGCTGATGA ACCAGGACAG GTTCACGTAC TTTAGGCCTG CTCTCCCACA CGTGGCCATA GGCGTGAGAG ACGTAGAGGA GCTCAGGATA GATTTGGCCA GCGCCATGCC AGAGAGGGGA ATATCCTTTG CCCAAGGGAA GGTAGAGAAG ATAGATGCCG AGTCTAGGAT AGTTTACTAC AAGAAACCAG ATGGAGGAAT GGGAGAAGAG GAATATGATT ACCTAATGGT GGGGATAGGC GCACACCTCG GGACTGAACT CATAAAGGGA TGGGATCAGT TCGGTTACAG CGTTTGCGAA CCGGAGTTTG CGGTCAAACT TAGGGATAGA CTGAAGGACT TCAAGGGCGG ACATATTACC ATCGGATCGG GTCCCTTCTA CCAAGGAAAG AATCCTAAAC CCAAGGTTCC AGAGAACTTT GTACCTCAAG CAGACTCGGC CTGTGAAGGG CCTGTCTTCG AGATGTCGCT GATGCTACAC GGGTACTTCA CAAGGAAGGG TATGTGGGAT AAGGTGAAAA TAACGGTCTA CTCTCCCGGC GAATATCTGT CAGATCTTTC TCCCGCATCC AGGAAGGCCG TTGCTGAGAT CTATAAAGGA TTGGGAATAG AGCTAGTACA CAACTTCAGA CTAAAGGAAT TGAGAGAGAA GGAAATAGTG GATGAAAAAG GTAACAAGCT TGAATCGGAT CTGAGCATAT TACTCCCGCC TTACACGGGT AACCCGGCAC TTAAGGCTTC CACAAAGGAC CTAGTGGACG ATGGAGGATT CATCCCCACT GACCTGAACA TGCAATCCAT CAAGTATGAC AACATATATG CAGTTGGCGA TTCTAACGCC CTAACTGTGC CTAAGCTGGG GTACTTGGCA GTTCAGACTG GCAGGATCGC GGCTCAACAT CTGGCGAAGA GATTGGGAGT TAACACGAAG GTGGAATCCT ACTATCCCAC CATCGTATGC GTAGCCGACA ATCCACTTGA GGGATATGCC GTCTCAGTGA AGGACGATAC CTGGTATGGA GGTCAGGTCT CGGTAGCTCA ACCTGCTGCA GTGAATCACT TAAAGAAGGA ACTATTCACC AAGTACTTCA TGTGGACCAA GGGTGATATG GTCCTAGAGA AATTCTTGGG AAGCTGGTGA
|
Protein sequence | MTKVLVLGGR FGGLTAAYNA KRLLGSKAEV KLMNQDRFTY FRPALPHVAI GVRDVEELRI DLASAMPERG ISFAQGKVEK IDAESRIVYY KKPDGGMGEE EYDYLMVGIG AHLGTELIKG WDQFGYSVCE PEFAVKLRDR LKDFKGGHIT IGSGPFYQGK NPKPKVPENF VPQADSACEG PVFEMSLMLH GYFTRKGMWD KVKITVYSPG EYLSDLSPAS RKAVAEIYKG LGIELVHNFR LKELREKEIV DEKGNKLESD LSILLPPYTG NPALKASTKD LVDDGGFIPT DLNMQSIKYD NIYAVGDSNA LTVPKLGYLA VQTGRIAAQH LAKRLGVNTK VESYYPTIVC VADNPLEGYA VSVKDDTWYG GQVSVAQPAA VNHLKKELFT KYFMWTKGDM VLEKFLGSW
|
| |