Gene RPB_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2074 
Symbol 
ID3909889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2355890 
End bp2357686 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content70% 
IMG OID637883966 
Productradical SAM binding protein 
Protein accessionYP_485691 
Protein GI86749195 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0385557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG CCGCGCGCGC GCCGCTGCGG GTGGCGCTGG TCGGGCCGCG CGAACAGCCC 
TTCGCCGGCG ATCCCGATCG CGAGCATCGC GAGACGATGC TGCGCAGCTA CGCGGAGATC
TGCGACAGCG TCGCCACCTT CGGCTCGGAC TTCACGCTGT CGCGCGAATT TCTCGGCATC
GAATATCTCG CGGCGACGCT GCGGCGCGAC GGCCGCATCG TCCGCGTGCT GTCCGCCGCC
AATGAGGGGC TGGACGACGA TGCGCTGCTC GCCGAACTAC TGGCCTTCGC GCCGCGCATC
GTCGGCCTCT CGGTGCTGTA CGATCTGCAA CTCGGCAATG CGCTGGTGCT GGCGCGGCGG
CTGAAGGCGG CGCGGCCCGG TCTTGCCATC GTGTTGGGCG GTCCGCTCGC CACCGCGCTG
TCGCAGGAAC TGCTCGGCAC CTTCGCCTTC GTGGACTACG TCGTCGAAGG CGAGGGCGAG
GCGGCGTTGA GCCGGCTCGC TGATGCGATC GAACGCGGCG AGGCGCCGAG CGACGTGCCG
GCGCTGGCGC ATCGCGGCCC GGGCGGCATC GTCCGCAATC CGCGCGGCGC GCCGCTCGAT
CTCGACCGCC TGCCGCATCC GGCGCGCGAC GGCCTCGCGT CGATCCGCGC CCGCGGCCTG
CCGGCGCCGA GCGCCTATCT CACCACCTCG CGCGGCTGCA AGGCGTTCTG CACCTTCTGC
ACTGTGCCGG GCAGCGTGCG GAGCCTGAAG AGCGGCGTCT ACCGGATGCG CGATCCGGTC
GACGTGGTCG ACGAGATCGA AGAGTTGGTG CGCGATCACG GCGTCAGCCG CTTCTACATG
GCCGACGACA ATTTCCTCGG CTATGGCGAG GACAGCAACG CGCGGATGCA TCGCTTCGCC
GACGAGATCC TGCGCCGCCG GCTCGCGATC CATTTCCACG CCGAATGCCG CGTCGACTCG
CTGATCCCGG AGACTCTGGT CAGACTGCGC GCCGCCGGCT TCGACCAGAT CCTGTTCGGC
CTGGAATCCG GCTCGGCGCG GACGCTGAAG CGCTGGGCCA AAGGCCAGAC GGTGGCGCAG
AACGAGGCCG CGATCGCGCT GGCGCGGCGG TTGCGCATCG AGATGATGCC GAGCCTGATC
CTGCTCGACT GGGAGTCCGA CCTCTCCGAG ATCGAAGAGA CGATCGGCTT CATCGAGCGC
AACCAATTGT GGCGCAGCGG CCAGCCGCTG TGGCTGGTCA ACAAGCTCAA GGTCCATTGC
GGCACCGCCG CCGCGCGCCG CTACGACAGC GTGCACGGCC GGCCGACGCC GCCCGCGGTC
GGCTATTCCG ACGCCGATAT TCATCGTTGG TGCGAGACCG TGACCTATCA GCACGTCGGC
ATCGACGATG TCTATGTCGC GGCGTTCTGG CGCGCGCTCA ACGCCGCCGC CAATCGCTGG
TCGGTGCTGA TCGACGAGGT GCTGCCGCCG TTTCTGAAGA GCCTGCGCAG CGAGGCGCGC
CGCGGCGACC GGACCGATCG CCTCGAACTG GTGCGCCGGC TCGCCGCGTT CCGCCGCTCG
ATCGGGGCGT CGCTCGCCGC GCTGATGCGG CTGCTGATCG ATCAGGCGAT CGCGATGCAG
CAGGCGCGCG CGCCGCAGCC GGATCTGCGC GGGCTCGCGC TGGCTCATGT CGAGGCGCAG
GAGCGCCGCT TCTTTCCGGA GGGTCTGCAT GTGGCCTTGC AGGATACCGG CCGCCGCCGC
GCCGTTGCTG GTCATGCCAT CGGCGCGCGG CTGGGCGAGA TCGTTTCGAC CGCGTGA
 
Protein sequence
MTGAARAPLR VALVGPREQP FAGDPDREHR ETMLRSYAEI CDSVATFGSD FTLSREFLGI 
EYLAATLRRD GRIVRVLSAA NEGLDDDALL AELLAFAPRI VGLSVLYDLQ LGNALVLARR
LKAARPGLAI VLGGPLATAL SQELLGTFAF VDYVVEGEGE AALSRLADAI ERGEAPSDVP
ALAHRGPGGI VRNPRGAPLD LDRLPHPARD GLASIRARGL PAPSAYLTTS RGCKAFCTFC
TVPGSVRSLK SGVYRMRDPV DVVDEIEELV RDHGVSRFYM ADDNFLGYGE DSNARMHRFA
DEILRRRLAI HFHAECRVDS LIPETLVRLR AAGFDQILFG LESGSARTLK RWAKGQTVAQ
NEAAIALARR LRIEMMPSLI LLDWESDLSE IEETIGFIER NQLWRSGQPL WLVNKLKVHC
GTAAARRYDS VHGRPTPPAV GYSDADIHRW CETVTYQHVG IDDVYVAAFW RALNAAANRW
SVLIDEVLPP FLKSLRSEAR RGDRTDRLEL VRRLAAFRRS IGASLAALMR LLIDQAIAMQ
QARAPQPDLR GLALAHVEAQ ERRFFPEGLH VALQDTGRRR AVAGHAIGAR LGEIVSTA