Gene RPB_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3046 
Symbol 
ID3910847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3470661 
End bp3472307 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content56% 
IMG OID637884954 
ProductPhage integrase 
Protein accessionYP_486659 
Protein GI86750163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTAT GGAAACGCGG CACCGTTTAT TGGTTTCGCC GATCAATCCC GCAGGATCTG 
AAGGCTCACG TTGGCTCGAA TGACGCCGCC GCGAGTCTCC GGACCGGCGA CCGTAGGAAG
GCCCGGAGAC GGGCAATGAG GCTGGCAGTG ACGATGGATG AATGGTCGGA GGCGGTCCGG
CAAGCACAAT TGAATGATCT GGAACAACCC GACCGCGCCC TTTTGCTGAA AATGTTCGAC
GACGCGATGG GTCTCGTCGA AAGCCAGCAG GCGACCGACA GCACCCGGCA GCAGGCGGAC
GCAATCCAGG CAGCCATCAC CGAGTACAAG GCGACGGCCG GAACTCGGGA GCGTCTTTCG
GGGATCAATC AACGTATCCG TGTAATCGGC GACCGCGTAG CGGCCTTGAC CGCCGCACCG
GCGAAGGCTC CCTCAAGCGA GATTGCCGAC CTTCGTGCCG ACTTGATGAA CGAGATGGCG
AAGGTCCGGA ATGACGTTCA AAACGCGCAC AAGAATCAGT GGTCCGCCGA CCTCCTGTCG
ACCAAGATCG ACGATTACGT TCAGGCCAAG GCCAAAGAGC TTGGCGGCAC GAAGCATGTT
TCGACAATTG GGCCGCGGAT CAGGAATTTT CTGGAAACGA TCGGCGACAA GCCACTCCGC
GACTACACAC GCCAGGATTT TGAACGATAT CGCGATATCT TGGACCGCAC GCCGAAACGA
GCTTTCGACC GGTTCAAAAC CAACAATCTT GCCGATGCCG CCGAAAGGAA CGAAAAGCGC
GCACGGCCCT TTGAAGTTAT CGATGACAAA ACCGTCGACG ACGACTATCT GACACCCGTC
AAAACGTTCT TCATGCATCT TGTTCGGAAT CTCTGGATTC CCGCGACCCC TGCCATCGGC
ATCGTCTCCT CGCGGACACG CGAAAGCCGC AGGACCGACC GACCTGACGA AGGTCGGCGA
GCTTTCAAGC CCGATCCGAT CAATGCCTAT TTCCGGTACA TCGTCCAGAA ACGATCGCGA
GCGACAGAAG ACTTTTGGCT GCCGATCCTG GCTCTATATA CCGGTGCTCG CCTCAATGAA
CTCTGCCAGA TCGAGCCGCG CCGGATTATT CTACACAATG GCCGCTGGCA TATCGATTTG
CTGACGATTT ACGATCGAGA TGAGATCGAG CGCGCGGTGA AAGGAATGAA AAAAGAGGAA
AAGTCTGCTG CGAGATTGAA ACTGAAGACC GCATCGGCGC GCCGACAAAT TCCTATTCAC
GACGACCTGA TCAAGATCGG ATTTATTGAT TTTGTCGATC AGCGGAGAAA TCACCCCAAG
TACACGCGCC TGTTTCCCAA CCTGCGGCCG GATCAATACG GATATTTTTC ATCCGCCGTC
GGGAAACGGC TAAACCGCGA CATCAAGAGC GCTGGCGCGA AAACCGACGA TACCTCGTTC
TACAGTCTCC GCCACAATTT CGCGGCTGCA TTAGAGCGGG CGCTTGTGCC CCATCGCACG
AAAGACAGGA TCATGGGCCA CCTCGTCGTC GGAGCCCAAG GTCACTACAC CGATCCCGAA
CTTGAGGATG TCGAAACCGC AGTCATCGAG CGCGTCAGCT TTCCGGGTGT CGACATCGCT
CCATATCTTT CCCAAAAACG CTTGTAA
 
Protein sequence
MYLWKRGTVY WFRRSIPQDL KAHVGSNDAA ASLRTGDRRK ARRRAMRLAV TMDEWSEAVR 
QAQLNDLEQP DRALLLKMFD DAMGLVESQQ ATDSTRQQAD AIQAAITEYK ATAGTRERLS
GINQRIRVIG DRVAALTAAP AKAPSSEIAD LRADLMNEMA KVRNDVQNAH KNQWSADLLS
TKIDDYVQAK AKELGGTKHV STIGPRIRNF LETIGDKPLR DYTRQDFERY RDILDRTPKR
AFDRFKTNNL ADAAERNEKR ARPFEVIDDK TVDDDYLTPV KTFFMHLVRN LWIPATPAIG
IVSSRTRESR RTDRPDEGRR AFKPDPINAY FRYIVQKRSR ATEDFWLPIL ALYTGARLNE
LCQIEPRRII LHNGRWHIDL LTIYDRDEIE RAVKGMKKEE KSAARLKLKT ASARRQIPIH
DDLIKIGFID FVDQRRNHPK YTRLFPNLRP DQYGYFSSAV GKRLNRDIKS AGAKTDDTSF
YSLRHNFAAA LERALVPHRT KDRIMGHLVV GAQGHYTDPE LEDVETAVIE RVSFPGVDIA
PYLSQKRL