Gene RPC_4501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4501 
Symbol 
ID3972416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5012528 
End bp5013613 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content64% 
IMG OID637927612 
ProductNADH dehydrogenase (ubiquinone) 
Protein accessionYP_534343 
Protein GI90425973 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0565202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCT TCAACATGCC GGTCGGCCCG CTGCACGTCT CGCTCGAAGA GCCGATGTAT 
TTCCGCATCG ACGTCGAGGG CGAGAAGGTC GCGGGGCTTG AAATCACCGC AGGCCACGTG
CATCGCGGTA TCGAATATCT CACCGCCAAG CGCAACATCT ACCAGAACCT GGCGCTGATC
GAGCGGGTCT GCTCGCTGTG CTCGAACAGC CATCCGGAAG CCTATTGCAT GGCGCTGGAG
ACCATCGCCG GCATCGAGGT GCCGGAGCGC GCGCAGTATC TGCGGGTGTT CGCCGACGAG
ATCAAGCGCG TCGCCTCGCA CATGTTCAAC GTCGCGATCC TGGCGCATGT CGTCGGCTTC
GAATCGCTGT TCATGCACGT CATGGAAGCC CGCGAGATCA TGCAGGACAC CAAGGAGACC
GTGTTCGGCA ACCGCATGGA TCTTGCCGCC AACATCATCG GCGGGGTGAA ATACGATATC
GACGCCACGC AGTCGGCCTA CATCATCAGC CAGCTCGACC GGCTGGAGCC GCTGCTGTTG
AACGAGATCA TTCCGGTCTA CGAGACCAAT GCCACGATCC AGTCGCGCAC CCGCGGCATC
GGCCGGATCA GCCGCGAGCA CTGCATCGAA TACGGCCTGA TGGGCCCGGT GGCGCGCGGC
GCCGGGCACG GCTATGACGT ACGCACCGCG GCGCCCTACG CGGTCTATGA CCGGATGGAC
GTCGAAGTGA TCACCTATCC GGACGGCGAC GTCTGGTCGC GCGCCATGGT GCGGCTGAAG
GAGGTGGCGG CCTCGATCCG GCTGCTGCGG CAGTGCCTGC GCGATCTGCC GGATGGTGCG
ACCGACGCCG GCCCGCTGCC GTTCATTCCG GCCGGCGAGG CGGTGACCAA GGTCGAGGCG
CCGCGCGGCG AACTCGTCTA CTACGTCAAC ACCGACGGCA CCGACATTCC GGCGCGGGTG
AAATGGCGGG TGCCGAGCTA CATGAACTGG GACGTGCTGC ATCTGATGAT GGTCGGCGAG
GGGATCTCCG ACATTCCGTT GATCGTCAAC AGCATCGATC CCTGCATTTC ATGCACCGAG
CGTTGA
 
Protein sequence
MKSFNMPVGP LHVSLEEPMY FRIDVEGEKV AGLEITAGHV HRGIEYLTAK RNIYQNLALI 
ERVCSLCSNS HPEAYCMALE TIAGIEVPER AQYLRVFADE IKRVASHMFN VAILAHVVGF
ESLFMHVMEA REIMQDTKET VFGNRMDLAA NIIGGVKYDI DATQSAYIIS QLDRLEPLLL
NEIIPVYETN ATIQSRTRGI GRISREHCIE YGLMGPVARG AGHGYDVRTA APYAVYDRMD
VEVITYPDGD VWSRAMVRLK EVAASIRLLR QCLRDLPDGA TDAGPLPFIP AGEAVTKVEA
PRGELVYYVN TDGTDIPARV KWRVPSYMNW DVLHLMMVGE GISDIPLIVN SIDPCISCTE
R