Gene RPB_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4001 
Symbol 
ID3911808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4567690 
End bp4568688 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID637885905 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_487605 
Protein GI86751109 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.18225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00239808 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGTCG TTCCGCAGAT CAACCTGCAA GACGCGCAAC TCCGGGCCGA GGCGGCAATC 
GAGCCCGACG CGCCGCTGAC GACTCCCGTG ACCAAGGAAA CCCAGATCAT CGCGATCTAC
GGCAAGGGCG GTATCGGCAA GAGCTTCACG CTCTCCAACC TGTCCTACAT GATGGCGCAG
CAGGGCAAGA AAGTGCTGCT GATCGGCTGC GATCCGAAGA GCGATACGAC ATCGCTGCTG
TTCGGCGGCA AGGCCTGTCC GACCATCATC GAGACGTCTT CGAAGAAGAA GCTTGCCGGC
GAGGAAGTGC AGATCGGCGA CGTCTGCTTC AAGCGCGACG GCGTGTTCGC GATGGAGCTC
GGCGGCCCGG AAGTCGGCCG CGGTTGTGGC GGCCGTGGCA TCATTCACGG CTTCGAGACG
CTCGAAAAGC TCGGCTTCCA CGAATGGGGC TTCGACTACG TGCTGCTCGA TTTCCTCGGC
GACGTGGTGT GCGGCGGCTT CGGCCTGCCG ATCGCCCGCG ACATGTGCCA GAAGGTGATC
ATCGTCGGCT CCAACGATCT GCAGTCGCTG TACGTCGCCA ACAACGTCTG CTCCGCGGTT
GAATATTTCC GCAAGCTCGG CGGCAATGTC GGCGTCGCCG GTCTGGTGAT CAACAAAGAT
GACGGCACCG GCGAGGCGCA GGCCTTCGCC GAAGCGGCCG GCATTCCGGT GCTGGCGGCG
ATTCCCGCCG ATGACGACAT CCGCAGGAAG AGCGCCAATT ACGAAATCAT CGGCCTGCCG
GACGGGGAGT GGGGTCCGCT GTTCGCGGAG CTGGCCGCCA ACGTCGCCAC CGCGCCGCCG
GTACGTCCGA AGCCGCTCAC CCAGGACGGG CTGCTCGGCC TGTTCTCCAG TGACGTGACC
GGCCGCGATG TCGTGCTGCT GCCCGCCACC ATGGAAGACA TGTGCGGCGC CGCGGTGCTG
AACAAGCCGT CGCTCGAAGT GATCTACGAC GCGGTTTGA
 
Protein sequence
MNVVPQINLQ DAQLRAEAAI EPDAPLTTPV TKETQIIAIY GKGGIGKSFT LSNLSYMMAQ 
QGKKVLLIGC DPKSDTTSLL FGGKACPTII ETSSKKKLAG EEVQIGDVCF KRDGVFAMEL
GGPEVGRGCG GRGIIHGFET LEKLGFHEWG FDYVLLDFLG DVVCGGFGLP IARDMCQKVI
IVGSNDLQSL YVANNVCSAV EYFRKLGGNV GVAGLVINKD DGTGEAQAFA EAAGIPVLAA
IPADDDIRRK SANYEIIGLP DGEWGPLFAE LAANVATAPP VRPKPLTQDG LLGLFSSDVT
GRDVVLLPAT MEDMCGAAVL NKPSLEVIYD AV