Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1094 |
Symbol | |
ID | 3784709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1259407 |
End bp | 1260660 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811179 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_411789 |
Protein GI | 82702223 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.402289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAGA TACGTAATTA CACGATGAAC TTCGGCCCGC AGCATCCGGC CGCCCACGGT GTCTTGCGAC TGGTGCTGGA GCTCGACGGG GAAGTCATAC AACGTGCCGA TCCGCACATC GGTCTGCTGC ATCGTGCAAC GGAAAAGCTT GCCGAGTACA AGACATATAT CCAGTCAGTC CCTTACATGG ACAGATTGGA CTACGTCTCC ATGATGGCCA ACGAGCACGC GTATGTGATG GCGATCGAGA AGTTGCTTCA ACTCGAAGTG CCCATACGCG CGCAGTATAT CCGGGTGATG TTCGACGAAA TCACGCGGAT ACTCAATCAT CTTCTGTGGC TGGGAGCACA CGCGCTGGAT GTGGGCGCGA TGACGGTATT CCTTTACGCT TTCCGTGACC GGGAAGACTT GATGGATGCC TACGAATCGG TTTCGGGCGC AAGGATGCAT GCCGCTTACT ATCGGCCTGG CGGCGTTTAT CGGGACTTGC CGGATTCAAT GCCGCAGTAT AAGGCGTCAA AAATTCACGA CGAAAAAACA ACCAAGGCGC GCAATGAAAA CCGCCAGGGT TCGCTGCTCG ATTTCATCGA AGACTTCACC AACCGGTTTC CCACGTACGT CGATGAGTAC GAGACCCTGC TTACCGATAA CCGTATCTGG AAACAGAGAC TGGTAGGCAT TGGAACGGTT TCGCCCGAGC GGGCCATGGC TCTGGGATTC ACCGGCCCCA TGCTGCGCGG GTCCGGCGTC GAATGGGATC TGCGGAAGAA GCAGCCCTAT GAAGTTTATG ATCAGCTCGA TTTCGATATA CCTGTCGGCG TCAATGGGGA TTGCTACGAC CGCTATCTGG TCCGGATCGA AGAATTCCGG CAGTCCAATC GCATCATCAG GCAATGTGTC GACTGGCTTC GTAAAAATCC GGGGCCGGTC ATAACGGATA ACCACAAGGT CGCGCCGCCT TCCCGTGTGA ACATGAAGCA GAACATGGAG GAACTGATCC ATCATTTCAA GCTTTTCACT GAAGGGTTTC ACGTGCCGCC CGGTGAAACC TATGCGGCAG TCGAGCACCC GAAAGGGGAA TTCGGCATTT ACCTGATATC GGATGGCGCC AACATGCCTT ACCGCATGAA AATCCGCGCT CCCGGCTTTG CCCATCTGGC AGCGCTGGAC GAGATGTCGC GCGGCCATAT GATTGCCGAT GTGGTTGCCA TCATTGGTAC CCAGGATATT GTGTTTGGTG AAATAGACAG ATGA
|
Protein sequence | MAEIRNYTMN FGPQHPAAHG VLRLVLELDG EVIQRADPHI GLLHRATEKL AEYKTYIQSV PYMDRLDYVS MMANEHAYVM AIEKLLQLEV PIRAQYIRVM FDEITRILNH LLWLGAHALD VGAMTVFLYA FRDREDLMDA YESVSGARMH AAYYRPGGVY RDLPDSMPQY KASKIHDEKT TKARNENRQG SLLDFIEDFT NRFPTYVDEY ETLLTDNRIW KQRLVGIGTV SPERAMALGF TGPMLRGSGV EWDLRKKQPY EVYDQLDFDI PVGVNGDCYD RYLVRIEEFR QSNRIIRQCV DWLRKNPGPV ITDNHKVAPP SRVNMKQNME ELIHHFKLFT EGFHVPPGET YAAVEHPKGE FGIYLISDGA NMPYRMKIRA PGFAHLAALD EMSRGHMIAD VVAIIGTQDI VFGEIDR
|
| |