Gene RPB_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2195 
Symbol 
ID3907935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2488020 
End bp2489195 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content49% 
IMG OID637884088 
Producthypothetical protein 
Protein accessionYP_485811 
Protein GI86749315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.175958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0733507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATG ATAATGATGA CTGCGAAGAG CAGCTCACTT TTTTTACGAC AGAGAAACCG 
CACCGTACGC AACGAGAGAT CGAAGCCGCA GAGAGCCAAA TTGTCGCCCA GTTGCGAGAC
GTACGATACG TAGTTCGCGA ATATCCGATC GAAGTCGTAG TTCAAATGTA CCTAAGCGGT
AGAAGCGAAG ATCGCAACGA AATATATGTG CCAGACTATC AACGAGACCT AATCTGGTCG
GAGAGACACA AATCTCGCTT CGTCGAGTCC CTTTTGATTG GACTCCCGAT CCCTTTTCTT
TTCGTTGCAG ACGTCGGGGA CGAAGAAGAT CCAGACAAAG CTGGCAGACT AGAAATCGTT
GACGGCGTCC AACGCATTCG AACCCTTGCG GAATTCCTAA CCGGCAGGCT AACACTAAGT
TGTCTTGATC GCCTTGATCG TCTGAACGGT TTTCGTTTCA ATGATCTTCC AATTTCCAGA
CAGCGACGCT TCCGCAGAGC TACACTCAGA TTGATTGAAC TGACAGAGGC CGTTACCGAA
GACGTGCGTC GTGAGATGTT CGACCGGATC AATAGCGGAT CAGTCAACCT AAAAGCGGTC
GAAGTCAGAA GGGGGATGCA ACGCGGCCCA TTTCTTGATC TTGTTACCGA ACTCGCGGCA
GCCCCCCTCC TACACCAACT AGCGCCAATT TCGGACGGAC TTCGAAAGCG ATTTGAGTAT
GAAGAATTAG TCACTCGCTT TTTCGCATTT CTTTACCGTT ACGAAGACTA CGGGAAAGGA
GGAAAGGTCG TCTCCGAATT CTTGCTCAAC TATGTACGCG ACACGAACAA GAAATTAAGC
TCTCCAGAAG GCGACAGAAT TGCTGAAGAA ATGAAGCGCC AATGGCACGA AATGCTAGAG
GTAGTACGAG GCTATTTCCC CGACGGCTTC AAGAAGCGCG GCCCCGGCCG CAAGGTTCCT
CGGGTACGTT TCGAGGCAAT CGCAGTGGGC ATCGGATTAG CGATAAGAGC ACTTAAGGAC
GACGGAGACA GCTTTAAGAT TCTCCACATA GACAACATTG ATGAATGGCT TGAAAGTGAC
GAATTCAAAG AATGGACCAC CAGTGACGCC TCGAACAACA AATCGAATCT TGTAGGACGC
CTCGAATTTG TCCGTAACAA AATGCTAGGC CGATGA
 
Protein sequence
MANDNDDCEE QLTFFTTEKP HRTQREIEAA ESQIVAQLRD VRYVVREYPI EVVVQMYLSG 
RSEDRNEIYV PDYQRDLIWS ERHKSRFVES LLIGLPIPFL FVADVGDEED PDKAGRLEIV
DGVQRIRTLA EFLTGRLTLS CLDRLDRLNG FRFNDLPISR QRRFRRATLR LIELTEAVTE
DVRREMFDRI NSGSVNLKAV EVRRGMQRGP FLDLVTELAA APLLHQLAPI SDGLRKRFEY
EELVTRFFAF LYRYEDYGKG GKVVSEFLLN YVRDTNKKLS SPEGDRIAEE MKRQWHEMLE
VVRGYFPDGF KKRGPGRKVP RVRFEAIAVG IGLAIRALKD DGDSFKILHI DNIDEWLESD
EFKEWTTSDA SNNKSNLVGR LEFVRNKMLG R