Gene RPB_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3306 
Symbol 
ID3911107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3780879 
End bp3782231 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content59% 
IMG OID637885208 
ProductPhage integrase 
Protein accessionYP_486913 
Protein GI86750417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.889733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.962231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCAC AACCCGCGCC CCTAACGCCA TTTGCGATCG CCAACGCCAA GCCCAAGCCG 
ACCCGGTATG AGATCAGCGA CGGCGGACAC GCCGGCCTGC GCGTTATCGT GCAGCCGTCT
GGCATCAAGT CGTTCGTGTT CCGATACAAA CGAGGGACGG GGGAAGCTGC GAAAAATGTG
CGGATTATGT TGGGTCGCGC AGCCGGTCCC GGCGCGCTCA CCTTGCGGCA GGCACGCGAA
GCGGCCGACT CCCACCGCCG GCTCAAGCTG ACGGGGGCGG ACCCTGCCGA CCAGCGCAGA
GTGGAACGGG CCGCAAATCT TGCGCGAATC AGGGCCGAAG AGATCGAAAA TCGCCGGAAA
GACGACACCG TTGCGCTCGT CTTGGAACGC TACTTCAAGA GTCATGTTAA CGGCCTTCTC
TCTGCTCGGG AGACGAAACG TATTTTGACG CGCGAACTAA GCGGTTGGGC ACGGCGGCGG
ATCGATCATG TTTCGCGTGC CGATGCCGTC AAATTACTTG AAGCAATCCA AGAGCGAGAC
AAACCCATCC TCGCGAACCG GACCCGCGCG CACGCAAGTA AGTTCTTCAA GTGGTGTATT
GAGAAGGGGT TGCTTGAGAT CAATCCATTC GAACACACCA CGCGGGCAGC CAAGGAAATC
GCTCGTGATC GCGTCTTGAG CGATGCCGAG CTGCGCATTT TGTTGCTTGC GAACGACCGC
CTTGAATGGC CATGGCGAGA GTACATTGCG GTTCTGCTCA TGCTCGGCCA GCGCCGCGAG
GAGGTTGCCG GGATGCGCTG GGACGCGCTG GACCTTGATT GTGCCGAACC GGTGTGGCTG
ATGGCCGCGT CGCGATACAA GAACGGCCAA CCTCACGCCG TTCCCCTCCC CGCCGCTGTC
GTCTCGATCC TCCGCAGCAT CGGTCGGATG CACTTCACTG AGATCATTGA CGGTGCGCCG
ACGCTCAAGG AGTCGCCGTT CGTTTTCACG ACGACGGGCC GCACCGCAAT TAGCGGCTTC
TCGAAAGCGA AAGTTCAACT CACCGGAATC ATGCACGAAA TAGCTTGTGG TGAAGCGAAG
GCCCGGGGCG AATCCACCGC TACCATTGAA AAGATCGAAT GGCGTCTGCA CGACCTGCGG
CGCACAATGG CAACGACCAT GGCACGCCTA AAAATTAACG TTGTAACGAT TGAACGCGTC
TTGGGGCACA AGATGCAAGG TGTCATGGCC GTCTATCAGC GGTACGACTA CCTACCCGAA
AAGCTCCACG CACTCACCGT TTGGAATGAC CATATTGCGA GGATAGTAGC TCCTCAGCAA
TCGAATGTCG TTCGCATGAC CGTCGCAGGC TGA
 
Protein sequence
MPSQPAPLTP FAIANAKPKP TRYEISDGGH AGLRVIVQPS GIKSFVFRYK RGTGEAAKNV 
RIMLGRAAGP GALTLRQARE AADSHRRLKL TGADPADQRR VERAANLARI RAEEIENRRK
DDTVALVLER YFKSHVNGLL SARETKRILT RELSGWARRR IDHVSRADAV KLLEAIQERD
KPILANRTRA HASKFFKWCI EKGLLEINPF EHTTRAAKEI ARDRVLSDAE LRILLLANDR
LEWPWREYIA VLLMLGQRRE EVAGMRWDAL DLDCAEPVWL MAASRYKNGQ PHAVPLPAAV
VSILRSIGRM HFTEIIDGAP TLKESPFVFT TTGRTAISGF SKAKVQLTGI MHEIACGEAK
ARGESTATIE KIEWRLHDLR RTMATTMARL KINVVTIERV LGHKMQGVMA VYQRYDYLPE
KLHALTVWND HIARIVAPQQ SNVVRMTVAG