Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2579 |
Symbol | |
ID | 3910371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2959313 |
End bp | 2960638 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884479 |
Product | NADH dehydrogenase I subunit F |
Protein accession | YP_486193 |
Protein GI | 86749697 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | [TIGR01959] NADH-quinone oxidoreductase, F subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.445028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.139963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACG ACAAGGACCG CATCTTCCGG AACCTGTACG GGCTCGACGA TTGGGGCCTC AAGGGCGCGC GCCGTCGCGG GCAGTGGGAA GGCACCAAGG CGATCATCGA CAAGGGTCGC GACTGGATCA TCAGCGAGAT GAAGGCGTCC GGCCTGCGCG GCCGCGGCGG CGCCGGCTTT CCGACCGGCC TGAAATGGTC GTTCATGCCG AAGGACAATG CCGACGGCAG GCCGAGCTAT CTCGTCGTCA ATGCCGACGA GTCCGAACCT GGTACCTGCA AAGACCGCGA GATCATGCGG CACGATCCGC ACACGCTGGT CGAGGGCTGC CTGATTGCCG GCTGCGCGAT GGGTGCGCAT GTCGGCTACA TCTACGTCCG CGGCGAATTC ATCCGCGAGC GCGAGCATCT GCAAGCCGCG ATCGATCAGG CCTACGAGGC CAAGCTGATC GGCAAGGACA ACGTCCACGG TTATCCGTTC GACCTGTATG TCGCACATGG TGCCGGCGCT TATATCTGCG GTGAAGAAAC GGCCCTATTG GAAAGCCTCG AAGGCAAGAA GGGCCAGCCG CGGCTGAAGC CGCCGTTCCC GGCCAATGTC GGCCTGTACG GCTGCCCGAC CACGGTCAAC AACGTCGAGT CGATCGCGGT GGCGCCCGAC ATCCTGCGCC GCGGCGCGTC CTGGTTCGCC GGCATCGGCC GGGCGAACAA TGTCGGCACC AAGCTTTACG GCATCTCCGG CCACGTCAAC ACGCCCTGCG TCGTCGAAGA GGCGATGAGC ATTCCGTTCC GCGAGCTGAT CGAGAAGCAC GGCGGCGGCA TCCGCGGCGG CTGGGACAAT CTGCTGGCGA TCATTCCCGG CGGCGCGTCG TGCCCGCTGA TTCCGGCGGC GGATTGCGAA GAACTGATCA TGGATTTCGA CGGCACCCGC GCGGTGAAGT CGAGCTTCGG CACCGCCGGC GTCATCGTGA TGGACAAGTC CACCGACGTC GTCGCCGCGA TCGCCCGCAT CAGCTACTTC TTCAAACACG AGAGCTGCGG CCAGTGCACG CCGTGCCGCG AAGGCACCGG CTGGATGTGG CGCGTGCTCG ACCGCATGGT GCACGGCCGC GCCCACAAGC GCGAGATCGA CATGCTGCTG GAAGTCACCA AGCAGGTCGA AGGCCACACC ATCTGCGCGC TCGGCGACGC CGCGGCCTGG CCGATCCAGG GCCTGATCCG CGCCTTCCGG CCCGAGATCG AGCGCCGCAT CGACGATTTC TCGCGCAAGG CCACGCTCGA CGATCAGGGC GTGCTCGATC CGGCGCATAT GGTGGCGGCG GAGTAA
|
Protein sequence | MLDDKDRIFR NLYGLDDWGL KGARRRGQWE GTKAIIDKGR DWIISEMKAS GLRGRGGAGF PTGLKWSFMP KDNADGRPSY LVVNADESEP GTCKDREIMR HDPHTLVEGC LIAGCAMGAH VGYIYVRGEF IREREHLQAA IDQAYEAKLI GKDNVHGYPF DLYVAHGAGA YICGEETALL ESLEGKKGQP RLKPPFPANV GLYGCPTTVN NVESIAVAPD ILRRGASWFA GIGRANNVGT KLYGISGHVN TPCVVEEAMS IPFRELIEKH GGGIRGGWDN LLAIIPGGAS CPLIPAADCE ELIMDFDGTR AVKSSFGTAG VIVMDKSTDV VAAIARISYF FKHESCGQCT PCREGTGWMW RVLDRMVHGR AHKREIDMLL EVTKQVEGHT ICALGDAAAW PIQGLIRAFR PEIERRIDDF SRKATLDDQG VLDPAHMVAA E
|
| |