Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1098 |
Symbol | |
ID | 3784713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1264954 |
End bp | 1266054 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811183 |
Product | respiratory-chain NADH dehydrogenase, subunit 1 |
Protein accession | YP_411793 |
Protein GI | 82702227 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATACG CGCAACAACT GTTCGGGGAT TTTTTCGGTC CTGAGTGGGG ACCTGCTCTT TTCCTGCTGG TGAAGAATGT CCTTCTGATC GTGGCCATCG TGCTGCCACT GATGCTGGCG GTTGCCTATC TCACATTTGC CGAACGCAAG ATCATTGGCT ATATGCAGTT GCGCGTGGGT CCCAATCGGG TAACGTTCTT TGGCATTCCC TGGCTGGGGG GGTGGGCGCA GCCCATTGCC GATGCGGTAA AGGCGGTGAT GAAAGAAATC ATCATCCCGA GCGGAGCGAA CAAAGTCCTG TTCGTGCTTG CGCCCATACT GACGTTCGCG CCGGCACTGG CGGCCTGGGC GGTCATTCCC TTTTCTCCGG ATGTGGTTCT GGCGGACATC AATGCAGGTC TGCTTTATAT TCTGGCCATG ACCTCGATGG GAGTCTATGG CATTATCATT GCGGGGTGGG CCTCCAACTC CAAATACGCA TTCCTGGGAG CAATGCGTTC GGCGGCTCAA GTGGTTTCCT ACGAACTGGC CATGGGTTTT GCGCTGGTGT GCGTGCTCAT GATGTCCCAG AGCCTGAACC TGGGTGACAT TGTCAAGGGC CAGCAAGGGG CCAGCATGCT GAACTGGTAT CTGATACCGC TGTTTCCCAT GTTTCTGGTT TATTTTATTT CCGGCGTCGC GGAAACCAAT CGTGCTCCAT TCGATGTCGC CGAGGGTGAG TCCGAAATCG TGGCAGGTTT TCATGTCGAG TATTCGGGCA TGGCGTTCAC GGTGTTTTTC CTGGCCGAAT ATTCCAACAT GATTCTGGTG GCCATGCTTG CAAGCATCAT ATTCCTGGGT GGCTGGCTGC CTCCTGTCAA CGTTGCGCCG TTTACCCTTG TTCCCGGCTT CATCTGGCTG ATCCTGAAAG CATCATTTCT ATTGTTCTGT TTTCTCTGGT TCCGGGCCAC GTTTCCACGT TATCGTTACG ACCAGATCAT GCGTCTTGGC TGGAAGGTAT TCATTCCGAT CACGCTCGTC TGGATAGTGG TGCTTGGCCT GGTGATGCAG CTTCCGGCAT CGATTCGGGG CGCATTCCCG CTTAACTTGT GGTTTCACTG A
|
Protein sequence | MEYAQQLFGD FFGPEWGPAL FLLVKNVLLI VAIVLPLMLA VAYLTFAERK IIGYMQLRVG PNRVTFFGIP WLGGWAQPIA DAVKAVMKEI IIPSGANKVL FVLAPILTFA PALAAWAVIP FSPDVVLADI NAGLLYILAM TSMGVYGIII AGWASNSKYA FLGAMRSAAQ VVSYELAMGF ALVCVLMMSQ SLNLGDIVKG QQGASMLNWY LIPLFPMFLV YFISGVAETN RAPFDVAEGE SEIVAGFHVE YSGMAFTVFF LAEYSNMILV AMLASIIFLG GWLPPVNVAP FTLVPGFIWL ILKASFLLFC FLWFRATFPR YRYDQIMRLG WKVFIPITLV WIVVLGLVMQ LPASIRGAFP LNLWFH
|
| |