Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2416 |
Symbol | |
ID | 3971500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2617686 |
End bp | 2619206 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637925525 |
Product | NADH dehydrogenase subunit M |
Protein accession | YP_532287 |
Protein GI | 90423917 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.185861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00201163 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGCTT CCATCATGAC CTGGCCCATT CTTTCCGTCG TCACCTTCCT GCCGTTGGTC GGCGCGCTGC TGCTGTTGGT GATCCGCGGC TCCGATCCGG CGGCGCAGCG CAACGCGCGC TGGATCGCGC TGTGGACCAC GCTGATCACC TTCGCGGTGT CGCTGATCCT GGTGGCGCGG TTCGATTCCA GCGTGGCGGA TTTCCAGTTC GTCGAAAAGG CGCCCTGGCT CGCCGCCGGC ATCGGCTATC ATATGGGGGT CGACGGCATT TCGCTGCCGT TCGTGATCCT GACCACGGCG CTGATGCCGT TCTGCATCAT CGCCAGCTGG AAATCGATCA CCCGCCGCGT CGGCGAATAC ATGATGGCGT TCCTGGTGCT GGAAACGCTG ATGATCGGCA CCTTCTCGGC GCTCGATTTG GTGCTGTTCT ATCTGTTCTT CGAAGGCGGC CTGATCCCGA TGTTCCTGAT CATCGGGGTG TGGGGCGGCC CGCGCCGGGT CTACGCCTCG TTCAAGTTCT TCCTCTACAC GCTGCTCGGC TCGGTGTTGA TGCTGCTGGC GATCATGGCG CTGTATCTCA ACGCCGGCAC CACCGACATT CCGACGCTGA TGCACACCGC GGTGCCGCGC AGCCTGCAGA CCTGGGCCTG GCTCGCCTTC TTCGCCTCAT TCGCGGTGAA GATGCCGATG TGGCCGGTGC ACACCTGGTT GCCCGACGCC CACGTCGAGG CGCCGACCGC GGGCTCGGTG GTGTTGGCCG CGATCATGCT GAAGATGGGC GGCTACGGCT TCCTGCGGTT CTCGCTGCCG ATGTTCCCCG ACGCCTCGCA CGACTTCGCG CCGCTGATCT TCGCGCTGTC GGTGATCGCC ATCGTCTACA CCTCGCTGGT GGCGCTGATG CAGGAAGACA TCAAGAAGCT GATCGCCTAC TCCTCGGTGG CGCATATGGG CTTCGTCACC ATGGGGATCT TCGCCGGCAC CACCCAGGGC GTCGCCGGCG GCGTATTCCA GATGGTGTCG CACGGCATCG TCTCCGGTGC GCTGTTCCTC TGCGTCGGCA TCGTCTACGA CCGCATGCAC ACCCGCGAGA TTTCGGCCTA TGGCGGCCTC GTGAACCGGA TGCCGATCTA TGCCGTGGTG TTCATGGTGT TCACCATGGC CAATGTGGGT CTGCCTGGCA CCTCGGGCTT CGTCGGCGAA TTCCTGACGC TGCTCGGCAC CTTCAAGGTC AACATCCCGA CCGCGACCGT CGCCACGCTC GGCGTGATCT TGTCTGCGGC CTATGCGCTG TGGCTGTACC GCAAGGTGGT GTTCGGCGCG CTGGTCAAGC CGTCGCTCGC CTCGATCAAG GATCTCACCT GGCGCGAGAG CCTGACGCTG GTGCCGCTGC TCATCCTCAC GCTGCTGTTC GGAGTCTATC CGAAGCCGGT GCTCGACATG TCGGCCGCCT CGGTGCAGCA CCTCGTCACC ACTTACGCCG CCGCAGCCTC TGCCGTGAAG GCAGCTGCGC TCGTGCCATG A
|
Protein sequence | MSASIMTWPI LSVVTFLPLV GALLLLVIRG SDPAAQRNAR WIALWTTLIT FAVSLILVAR FDSSVADFQF VEKAPWLAAG IGYHMGVDGI SLPFVILTTA LMPFCIIASW KSITRRVGEY MMAFLVLETL MIGTFSALDL VLFYLFFEGG LIPMFLIIGV WGGPRRVYAS FKFFLYTLLG SVLMLLAIMA LYLNAGTTDI PTLMHTAVPR SLQTWAWLAF FASFAVKMPM WPVHTWLPDA HVEAPTAGSV VLAAIMLKMG GYGFLRFSLP MFPDASHDFA PLIFALSVIA IVYTSLVALM QEDIKKLIAY SSVAHMGFVT MGIFAGTTQG VAGGVFQMVS HGIVSGALFL CVGIVYDRMH TREISAYGGL VNRMPIYAVV FMVFTMANVG LPGTSGFVGE FLTLLGTFKV NIPTATVATL GVILSAAYAL WLYRKVVFGA LVKPSLASIK DLTWRESLTL VPLLILTLLF GVYPKPVLDM SAASVQHLVT TYAAAASAVK AAALVP
|
| |