Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1358 |
Symbol | |
ID | 3908463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1547329 |
End bp | 1548774 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883252 |
Product | proton-translocating NADH-quinone oxidoreductase, chain M |
Protein accession | YP_484979 |
Protein GI | 86748483 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.199626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.187922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGA TCGCGCTCAT TCTGCTGCCG GCCATCGGCG GCTTCGTCGC ATTCTTCATC GCGGGGCGCG GTGAGCGCGA GCGCTGGGTC ACGATCGCGA CCTTCGCGCT GATGCTGGTG CTGCTGATGG CGATCGTCGC CGGCGCGCCG GAAGGCCGCT GGTACGCCAA GGTCTCCTAT CCGTGGGTGC CGTCGTTCGG TATCTCGCTC GATTTCGCGA TGGACGGGTT GTCGGCGGCG CTGATCGCGA TATCGGCCGG GCTCGGCATC ATCTCGGTGA TGGCGTCGTG GTCGGAGATC CGCACCCAGT CCGGGCTGTT CCATGCCTGC CTGTGCTGGA CCGTGGCGGC GACGATCGGC GTCTTCCTCT CGTTCGATCT GCTGATCTTC GCGTTCTTCT GGGAAGCGAT GCTGGTGCCG GCGTTCACGC TGATCGCGGT CTGGGGCCAT GGCGATCGGG AAGGCGCGGC GCTGAAATTC CTGATCTTCA ATGCGGTCGC GGGCTTCGGC CTGCTGGCGG CGGCGTTTGC GCTGGCGTCG ATGGCCGATC ACATGACCTT CAGCGCGTTC GAACTCGCGG AGATGAAGCT CAGTACGTCG GTGCAGGTCT GGATGCTGCT CGGCTTCTCG CTGGCGTTTC TGGTCAAGCT GTCGGTGCCG CCGTTCCACG CCTGGCTGCC TGAGGCGCAT ACGCTGGCGC CGACCGCGGG CTCGATCCTG CTCGCCGGCA TCCTGCTGAA GACCGGCGCC TACGGCCTGT TCCGCTTCGC GCCGATGCTG TTTCCGCAGG GGCTCGAAGC GGTCGCGCCC TACGGCATTG CGCTCGGCGC TGCTGGCGCG CTGTATGGCG GCCTGGTCGC CTGCGGCCAG AACGACGCCA AGCGGCTGGT CGCCTATACC TCGATCGCCC ATATGTCGAT CGTGCTGATG GGCATCGCCG CGGGCGTGCA CTACGCGGTC GCCGGCGCCG CCGTGGAGAT GATCGCGCAT TCGTTCTCGG CCTCGGCGCT GTTCCTGCTG ATTGGGGCGC TGTACGAGCG CACCCACACC CGCGATCTGC GCCAGCTCGG TGGCCTGCAG CAGATCGCGC CGCGCTTCGC TGCGGCCTTC GCGCTGTTCT GCTCGGCGCT GCTGGCGCTG CCCGGCACGG CGAATTTCGT CGGCGAGGCG CTGGTGGTCG TCGGCATCTT CCAGGTCAAT TGGGTGTTCG CACTGCTGGC GCTGTCGACG CTGGTGGTCT CGGTGATCTA CGCGACGCGG CTGTTGAAGG GGCTGGTGTT CGGTCAGCCG CGGCTCGGCG CGCCGGTCGC CGACCTGACC TGGCGTGAAT ATGGGCCGGT GCTGGCGATG GGCCTCGCCA CGCTGGTAAT CGGCCTGTAT CCGCAGAGCC TGCTGGCGCT GCTGGAGCCC GCGATCAAGG CCGCTCTGGT GCTGCCGCAG CCATGA
|
Protein sequence | MALIALILLP AIGGFVAFFI AGRGERERWV TIATFALMLV LLMAIVAGAP EGRWYAKVSY PWVPSFGISL DFAMDGLSAA LIAISAGLGI ISVMASWSEI RTQSGLFHAC LCWTVAATIG VFLSFDLLIF AFFWEAMLVP AFTLIAVWGH GDREGAALKF LIFNAVAGFG LLAAAFALAS MADHMTFSAF ELAEMKLSTS VQVWMLLGFS LAFLVKLSVP PFHAWLPEAH TLAPTAGSIL LAGILLKTGA YGLFRFAPML FPQGLEAVAP YGIALGAAGA LYGGLVACGQ NDAKRLVAYT SIAHMSIVLM GIAAGVHYAV AGAAVEMIAH SFSASALFLL IGALYERTHT RDLRQLGGLQ QIAPRFAAAF ALFCSALLAL PGTANFVGEA LVVVGIFQVN WVFALLALST LVVSVIYATR LLKGLVFGQP RLGAPVADLT WREYGPVLAM GLATLVIGLY PQSLLALLEP AIKAALVLPQ P
|
| |