Gene RPD_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3756 
Symbol 
ID4024272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4193196 
End bp4194194 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content62% 
IMG OID637963960 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_570878 
Protein GI91978219 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00499393 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGTCG TTCCGACGAT CAACCTGCAA GACGCGCAAC TCCGGGCCGA GGCGTCGATC 
GAGCCCGACG CCCCGGTGAC GACTCCTGTC ACCAAGGAAA CCCAGATTAT CGCGATCTAC
GGCAAGGGTG GCATCGGCAA GAGCTTCACG CTCGCCAACC TGTCCTACAT GATGGCGCAA
CAGGGCAAGA AGGTGTTGCT GATCGGCTGC GATCCGAAGA GCGACACCAC GTCTCTGCTG
TTCGGTGGCA AGGCCTGTCC GACCATTATC GAGACCTCGT CGAAGAAAAA GCTCGCCGGC
GAGGAAGTGA AGATCGGCGA CGTCTGCTTC AAGCGCGACG GCGTGTTCGC GATGGAGCTC
GGCGGCCCTG AAGTCGGTCG CGGCTGCGGC GGTCGCGGCA TCATCCACGG TTTCGAACTG
CTCGAGAAGC TCGGCTTCCA CGAGTGGGGC TTCGACTACG TGCTGCTCGA TTTCCTCGGC
GACGTGGTAT GCGGCGGCTT CGGTCTGCCG ATCGCGCGCG ACATGTGTCA GAAGGTGATC
GTGGTCGCAT CCAACGACTT GCAGTCGTTG TATGTCGCCA ACAACGTCTG CTCCGCGGTC
GAGTATTTCC GCAAGCTCGG CGGCAATGTC GGCGTCGCCG GTATGGTGAT CAACAAGGAC
GACGGCACCG GCGAGGCGCA GGCCTTCGCC ACTGCGGTGG GCATTCCGGT TCTTTCGGCA
ATTCCGGCCG ACGACGACAT CCGCAAGAAG AGCGCCAACT ACGAGATCAT CGGCAAGCCC
GATGGCGAAT GGGGGTCGCT GTTCGAGACC CTGGCGGCGA ATGTCGCGAC CGCGCCGCCA
GTTCGTCCCA ATCCGCTTAC GCAGGACGGT CTGCTCGGTC TGTTCACGAG CGACATCACC
GGGCGTGACG TCGTGCTGCT ACCGGCCACG ATCGAAGACA TGTGCGGAGC CTCGGTGCTG
AACAAGCCGT CGCTCGAAGT CATCTACGAC GCGGTTTGA
 
Protein sequence
MNVVPTINLQ DAQLRAEASI EPDAPVTTPV TKETQIIAIY GKGGIGKSFT LANLSYMMAQ 
QGKKVLLIGC DPKSDTTSLL FGGKACPTII ETSSKKKLAG EEVKIGDVCF KRDGVFAMEL
GGPEVGRGCG GRGIIHGFEL LEKLGFHEWG FDYVLLDFLG DVVCGGFGLP IARDMCQKVI
VVASNDLQSL YVANNVCSAV EYFRKLGGNV GVAGMVINKD DGTGEAQAFA TAVGIPVLSA
IPADDDIRKK SANYEIIGKP DGEWGSLFET LAANVATAPP VRPNPLTQDG LLGLFTSDIT
GRDVVLLPAT IEDMCGASVL NKPSLEVIYD AV