Gene RPB_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2133 
Symbol 
ID3908548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2425169 
End bp2426359 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID637884027 
Producthypothetical protein 
Protein accessionYP_485750 
Protein GI86749254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.695891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGA AATTGGGCAT GGCCAGGAGT TATATGGCCA GGACTAATAT GGCCAAGGCG 
GGCATCGCAC TCTGCGCGGT CGCGCTGGTG CTGATGTGGC CGCATGCCCG ACAGAGCGCG
GCGTTTCTCG CCGCACAGGA CGATCCGGCC CGGCTGTCGG ACCTGCAGAT CGGGGCCGCC
TTGCAGCGCG ACCCGGACCT GATCACGCGC CATATCACCG ATGCGCTGGA CGCCCGCGAT
CCGGATCTCG CCGACAGCCT GGTGCAACTT GCCGCTGCAC GAAATATCAC GCTTCCCGTC
GAACTCACCA CGCGCGTCGC CGCGGCGGTG GCCGCTGAGC AATCGGCCGC CGGCATCGCC
ACCCGCTTCG CGACCGGCCT CGTCACCGGC GAGGCGAAGG ACGGCGCCAG CCTGTCCGGC
ACGGTGGCGG GCGATCTGTT CGTGTTCGGC GACATCCGCG ACGTGGTCCG CGAGGGCACC
AATCTGGCGA CGGGCGCCGA CGCCGACCGG GTCGTGCTCG GGCTCGCGGC CGCCGGGATC
GCCATCACCG CGGCGACCTA TGTCACGCTG GGCGGCGCGG CGCCGGTGCG CGCCGGGCTG
ACGCTGGTCA AGGATGCGCG CAAGGTCGGC CGGCTCGGCG GGGGGCTGGC GACATGGACC
AGCCGCTCGG CGCGCGAGGT GGTCGATGCG CCGGCGCTGC AGCGCGCGGT TGCGGGCTCC
TCGTTCAGCC GCCCGGCAGA GACGCTGACT GCGGTCAAGG CGGCATTCCG CGCCGAGAAG
GCCGGCGGGC TGATGCGGCT CGCCAAGAAT GTCGGCCGCA TCGGCGACAA GGCCGGCACC
CGCGGCGCGC TCGATACGTT GAAGATCGCC GAAGGTCCGA AGGATGTCGC CCGCGCGGCG
CGACTCGCCG AGGCCAAAGG CGGCCAGACC CGCGCCTTCC TCAAGGTCCT CGGCCGCGGC
GCGCTGCTGC TCACCACCGG CGCATGGAAT TTCGCCTGGT GGATCTTCGG CGCGCTGATG
ACGCTGTTCG GTCTCGTCAC CTCGCTCAAG GCCGGCGTCG AGCGGATGAC GCAAGGCTGG
ATCGATCGCG GCAAGGCGCG GCGCGCGAAG CGGCTGCTGG CCGAGGCGAA GCGGGCGCAA
CGCGCGCAAG TCAATCCGTC TCCGGTCGCA GCGGCCCTGT CGGTTTCGTA G
 
Protein sequence
MRLKLGMARS YMARTNMAKA GIALCAVALV LMWPHARQSA AFLAAQDDPA RLSDLQIGAA 
LQRDPDLITR HITDALDARD PDLADSLVQL AAARNITLPV ELTTRVAAAV AAEQSAAGIA
TRFATGLVTG EAKDGASLSG TVAGDLFVFG DIRDVVREGT NLATGADADR VVLGLAAAGI
AITAATYVTL GGAAPVRAGL TLVKDARKVG RLGGGLATWT SRSAREVVDA PALQRAVAGS
SFSRPAETLT AVKAAFRAEK AGGLMRLAKN VGRIGDKAGT RGALDTLKIA EGPKDVARAA
RLAEAKGGQT RAFLKVLGRG ALLLTTGAWN FAWWIFGALM TLFGLVTSLK AGVERMTQGW
IDRGKARRAK RLLAEAKRAQ RAQVNPSPVA AALSVS