Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2404 |
Symbol | |
ID | 3971488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2606317 |
End bp | 2607507 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637925513 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_532275 |
Protein GI | 90423905 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00163945 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCTGAAG GCGCGCTTCG CAACTTCACC ATCAATTTCG GACCGCAGCA TCCGGCGGCG CATGGCGTGC TGCGGCTGGT GCTGGAGCTC GACGGCGAGA TCGTCGAGCG GGTCGATCCG CATATCGGGC TGTTGCATCG CGGCACCGAG AAGCTGATCG AGGCCAAGAC CTATCTGCAG GCGATCCCGT ATTTCGATCG GCTCGATTAC GTCGCGCCGA TGAATCAGGA GCACGCCTTC TGCCTCGCCG CCGAGAAGCT GTTGGACATC GCGGTGCCGC GCCGCGCCCA ATTGATCCGG GTGCTGTATT GCGAGATCGG CCGCATCCTG TCGCATCTGC TCAACGTCAC CACGCAGGCG ATGGACGTCG GCGCGCTGAC CCCGCCGCTG TGGGGCTTTG AAGAGCGCGA AAAGCTGATG ATGTTTTACG AGCGCGCCTC CGGCAGCCGG ATGCACGCGG CGTATTTCCG CGTCGGCGGC GTGCACCAGG ACCTGCCGCC GAAGCTGGTC GACGACATCG AGGCGTGGTG CGTCGCGTTT CCGCAAGTCA TCGACGATCT CGATCGGCTG CTCACCGGCA ACCGGATCTT CAAGCAGCGC AACGTCGATA TCGGCGTGGT GACGCTGGCG CAGGCCTGGG AGTGGGGCTT TTCCGGCGTC ATGGTGCGCG GCTCCGGCGC CGCCTGGGAT TTGCGCAAGT CGCAGCCCTA TGAGTGCTAC GCCGAGCTGG AATTCGACAT TCCGATCGGC AAGAACGGCG ACTGCTACGA CCGTTATTGC ATCCGCATGG AGGAGATGCG GCAGTCGGTG CGGATCATGC AGCAGTGCAT CGCCAAGCTG CGCGCGCCGG ACGGCGGCGG CCCGGTCGCG GTCCAGGACA ACAAGATTTT CCCGCCGCGT CGCGGCGAGA TGAAGCGCTC GATGGAATCG CTGATCCATC ACTTCAAGCT TTATACCGAG GGCTTCCGCG TGCCCGCCGG CGAAGTCTAC GTCGCGGTCG AGGCGCCGAA AGGCGAATTC GGCGTGTTCC TGGTCTCCGA CGGTAGCAAC AAACCCTATA AGTGCAAGAT CCGCGCGCCG GGCTTCGCGC ATCTGCAGGC GATGGACTTT ATCTCGCGCG GCCATCTGTT GGCCGACGTC TCGGCGATCC TGGGCTCGCT CGACATCGTG TTCGGCGAGG TCGATCGGTG A
|
Protein sequence | MPEGALRNFT INFGPQHPAA HGVLRLVLEL DGEIVERVDP HIGLLHRGTE KLIEAKTYLQ AIPYFDRLDY VAPMNQEHAF CLAAEKLLDI AVPRRAQLIR VLYCEIGRIL SHLLNVTTQA MDVGALTPPL WGFEEREKLM MFYERASGSR MHAAYFRVGG VHQDLPPKLV DDIEAWCVAF PQVIDDLDRL LTGNRIFKQR NVDIGVVTLA QAWEWGFSGV MVRGSGAAWD LRKSQPYECY AELEFDIPIG KNGDCYDRYC IRMEEMRQSV RIMQQCIAKL RAPDGGGPVA VQDNKIFPPR RGEMKRSMES LIHHFKLYTE GFRVPAGEVY VAVEAPKGEF GVFLVSDGSN KPYKCKIRAP GFAHLQAMDF ISRGHLLADV SAILGSLDIV FGEVDR
|
| |