Gene RPD_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1330 
Symbol 
ID4021807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1496911 
End bp1498197 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content69% 
IMG OID637961523 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_568469 
Protein GI91975810 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.220778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA AGCCGCTCAC CGGCCGCGCC AGGCCCGACC GTCAGCCGCA CAGCCTCGCC 
GCATGGCGCG GGCTCGGCGG TGGTGCTGCG TTGGCGCTGG CCTTGAAGCG CCACAGTCCC
GAGGCCGTCA TCGCGATGGT CGAGGCCGCG GGTCTGCGCG GCCGCGGCGG CGCGGGATTC
CCGACCGCGA ACAAATGGCG CTTCATGCGC GCCGGCGCGG ATAAAGCGGG GCCGGGCGCG
CGTTATCTCT GCGTCAATGG CGACGAGACC GAGCCGGGTT CGTTCAAGGA CCGGCTGCTG
ATGGAGGCGT TGCCGCATCA GCTGATCGAG GGCGCGACGA TCGCGGCCTA TGCGACCGGC
GCGACCGAAG TGATCATCCT CGTGCGCGAC GAGTATCGCG CCGCCGCGGC CGCGCTGACC
CGCGCGATCA CCGAGGCCGA ACAGGCCGGT TGGCTCGGAC GGGACATTCT CGGCTCCGGC
TTCGATCTGA CGATGCGCGT CCATACCTCG GCCGGCCGCT ACATCGTCGG CGAAGAGACC
GCGCTGATCG CCGCCCTCGA AGGCGAACGT CCGGTGCCGC GACATCGCCC GCCCTATCCG
GCGGTGAGCG GGCTTTGGGG ACGGCCGACC ACGGTCAACA ATGTCGAGAC GCTGTCGGCG
GTGCCTTCGA TCGTCGAGAA CGGCGCCGAT TGGTATCGTG GCCTGTCGCG GTCGGACGAA
GGCGGCACCA AGCTCTACGG GGTGTCCGGC TGCGTCGCGC GCCCGCAATT GATCGAAGCC
CCGATGGGCA CCACAGGGCG CGAGTTGATC GAGATTTGCG GGGGCATTCG CAACGGCCGC
GCGCTGTTCG CCTTCCAGCC CGGCGGCGGC GCGACTGCGT TTCTGGAGAC GGCCGAACTC
GACGTTCCGC TGGACTTCAC CCATACCAAG AAGGCCGGCA GTTCGCTCGG CACCGGCGCG
CTGATCGTGC TCGACGATCG CGCCTGTCCG GTCGCAGCGA TCGGCCGCCA CATGCGGTTC
TATGCGCGCG AGAGCTGCGG GCTTTGCACG CCGTGCCGCG ATGGCTTGCC GTGGGTCGCG
AAACTGCTCG ATACGCTGGA AGCCGGCCGT GGCGCGAAAG CCGATCTCGA TCTGCTGCAC
AGCCATGTCC AATTGTCCGC TCCCTCGGGG CGCAGCTATT GCGACCTCAA CACCGGGGCG
TTGACGCCGC TGAAGAGCGG ACTCGAACGC TTCGGCGACA TCTTCACGGC CCATCTGCAG
GGCTCCTGTC CGGTGGGCCG CGCATGA
 
Protein sequence
MSEKPLTGRA RPDRQPHSLA AWRGLGGGAA LALALKRHSP EAVIAMVEAA GLRGRGGAGF 
PTANKWRFMR AGADKAGPGA RYLCVNGDET EPGSFKDRLL MEALPHQLIE GATIAAYATG
ATEVIILVRD EYRAAAAALT RAITEAEQAG WLGRDILGSG FDLTMRVHTS AGRYIVGEET
ALIAALEGER PVPRHRPPYP AVSGLWGRPT TVNNVETLSA VPSIVENGAD WYRGLSRSDE
GGTKLYGVSG CVARPQLIEA PMGTTGRELI EICGGIRNGR ALFAFQPGGG ATAFLETAEL
DVPLDFTHTK KAGSSLGTGA LIVLDDRACP VAAIGRHMRF YARESCGLCT PCRDGLPWVA
KLLDTLEAGR GAKADLDLLH SHVQLSAPSG RSYCDLNTGA LTPLKSGLER FGDIFTAHLQ
GSCPVGRA