Gene RPC_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2404 
Symbol 
ID3971488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2606317 
End bp2607507 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID637925513 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_532275 
Protein GI90423905 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00163945 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTGAAG GCGCGCTTCG CAACTTCACC ATCAATTTCG GACCGCAGCA TCCGGCGGCG 
CATGGCGTGC TGCGGCTGGT GCTGGAGCTC GACGGCGAGA TCGTCGAGCG GGTCGATCCG
CATATCGGGC TGTTGCATCG CGGCACCGAG AAGCTGATCG AGGCCAAGAC CTATCTGCAG
GCGATCCCGT ATTTCGATCG GCTCGATTAC GTCGCGCCGA TGAATCAGGA GCACGCCTTC
TGCCTCGCCG CCGAGAAGCT GTTGGACATC GCGGTGCCGC GCCGCGCCCA ATTGATCCGG
GTGCTGTATT GCGAGATCGG CCGCATCCTG TCGCATCTGC TCAACGTCAC CACGCAGGCG
ATGGACGTCG GCGCGCTGAC CCCGCCGCTG TGGGGCTTTG AAGAGCGCGA AAAGCTGATG
ATGTTTTACG AGCGCGCCTC CGGCAGCCGG ATGCACGCGG CGTATTTCCG CGTCGGCGGC
GTGCACCAGG ACCTGCCGCC GAAGCTGGTC GACGACATCG AGGCGTGGTG CGTCGCGTTT
CCGCAAGTCA TCGACGATCT CGATCGGCTG CTCACCGGCA ACCGGATCTT CAAGCAGCGC
AACGTCGATA TCGGCGTGGT GACGCTGGCG CAGGCCTGGG AGTGGGGCTT TTCCGGCGTC
ATGGTGCGCG GCTCCGGCGC CGCCTGGGAT TTGCGCAAGT CGCAGCCCTA TGAGTGCTAC
GCCGAGCTGG AATTCGACAT TCCGATCGGC AAGAACGGCG ACTGCTACGA CCGTTATTGC
ATCCGCATGG AGGAGATGCG GCAGTCGGTG CGGATCATGC AGCAGTGCAT CGCCAAGCTG
CGCGCGCCGG ACGGCGGCGG CCCGGTCGCG GTCCAGGACA ACAAGATTTT CCCGCCGCGT
CGCGGCGAGA TGAAGCGCTC GATGGAATCG CTGATCCATC ACTTCAAGCT TTATACCGAG
GGCTTCCGCG TGCCCGCCGG CGAAGTCTAC GTCGCGGTCG AGGCGCCGAA AGGCGAATTC
GGCGTGTTCC TGGTCTCCGA CGGTAGCAAC AAACCCTATA AGTGCAAGAT CCGCGCGCCG
GGCTTCGCGC ATCTGCAGGC GATGGACTTT ATCTCGCGCG GCCATCTGTT GGCCGACGTC
TCGGCGATCC TGGGCTCGCT CGACATCGTG TTCGGCGAGG TCGATCGGTG A
 
Protein sequence
MPEGALRNFT INFGPQHPAA HGVLRLVLEL DGEIVERVDP HIGLLHRGTE KLIEAKTYLQ 
AIPYFDRLDY VAPMNQEHAF CLAAEKLLDI AVPRRAQLIR VLYCEIGRIL SHLLNVTTQA
MDVGALTPPL WGFEEREKLM MFYERASGSR MHAAYFRVGG VHQDLPPKLV DDIEAWCVAF
PQVIDDLDRL LTGNRIFKQR NVDIGVVTLA QAWEWGFSGV MVRGSGAAWD LRKSQPYECY
AELEFDIPIG KNGDCYDRYC IRMEEMRQSV RIMQQCIAKL RAPDGGGPVA VQDNKIFPPR
RGEMKRSMES LIHHFKLYTE GFRVPAGEVY VAVEAPKGEF GVFLVSDGSN KPYKCKIRAP
GFAHLQAMDF ISRGHLLADV SAILGSLDIV FGEVDR