Gene RPB_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1123 
Symbol 
ID3909208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1292024 
End bp1294384 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content67% 
IMG OID637883016 
Producthypothetical protein 
Protein accessionYP_484744 
Protein GI86748248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.428473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC ACCGCCCCGT CCACTTCAAC GCTTCGCAGA CTTCAGGCTC AGACATGACG 
ATTCGAGTAA CCAACACCGC CCGCGCCAGC CGCTCCGTGC TGGAAAATTG GATGCCGGAG
GTCACGCCGG TGGCGGCCGC GGATGCCGGT TCGGTCACGA TCGACAACAC CGGCCCGTTC
GTCGCCGGCA GCTATCAGCG CTTCTCCATG ACTTATGCGG CGGGCCGCTA CGGCATCGAC
GATTCCGGCT GCCTGAAGAT CTGCTACCGC TTCGCCTCCG ACATGGGGCG GCCGCAATTC
ACCGATCCGA CCGCGCCGAA CTTCGTCGAA GTCGTCGCTT CGAACGGCGC CACGCTGGAC
GTGCGGTTCG ACTACAAGCA GAACACGCGG CCGTGGGATC GCACCATCTA CATCAAGGTC
GTCGCCGGCT TCATGAAGCA AGGCGACACC ATTCGGATCG ATTTCGGCGC CGACGAACGC
GGGCCGGGCA TCCGGATGCA GACCTTCGTC GATCCGGAAT TCTGTTTCCG CATACTGGTC
GACCCGATCG CGACCTACAC CTTCGTGGAA GTGCCGGGCG TGCCGGTGAT GCCGATCATC
GCCGGTCCGG CGGCGCGCTG GCACGCGGTG CTGCCGACCT GCCGGGCGGT CGGCGATACA
TTCGCGCTTG GCATTCGTGC CGACGACATG TGGGGCAATC CGACCAGCGT CCGTGGCGGC
GCGCGGGTTC AACTCGTCTG CAGCGGCCGG ATCGAGGGGC TGCCGGAGAC GGTGGCGTTC
GATCCGGCGT TGCCAGCGAC CGAAATCGGC GGATTGCGCG CCGTTCAGCC GGGCAGCGTG
TTCGTCGATC TGGTTGCCGA AGGACGCACT CTGACGCGCT CCAATGTACT CGTCGTCGAA
TCCGAGCTGG CGTTGCGCCC GTATTGGGGC GACCTGCACG CGCAATCCGG CGAGACCATC
GGCTCGGGCA CCGCGCAGGA CTACATGACG TACGCGCGTG ATTGCGCCTT CCTCGATGCG
ATCGGCCACC AGGGCAACGA TTTCCAGATC ACCGGGCAGT TCTGGCACCT CCTCAACGGC
CTGATGCGCG ACTGGAACGA GCCCGGGCGC TTCGTCACCA TCCCCGGCTA CGAATGGTCG
GGCAACACCT CGCTGGGCGG CGACCGCAAC GTGTTCTATC GCAGCGAGGA CCGCGTCATT
CACCGCTCGT CGCACGCGCT GGTGCCGGAG CGGAGCGACG CCGACACCGA CTGCTGGGAC
GCGCGCAGTC TGTTCGAGGC GCTGGAGCCG AAGCAGGCCG ACACCGTGGT GTGGGCGCAT
TGCGGCGGCC GCTATGCCGA TATCAAATAC GCCCATCACC ATGGTCTGGA GCGCGCCGTC
GAGGTGCATT CGAGCTGGGG GACGTTCGAA TGGCTGGCCG CTGACGCCTT CGACAGCGGC
TACCGCGTCG GCATCGTCGC CAACAGCGAC GGTCACAAGG GGCGGCCGGG CTACGAGCCG
CCCGGCGCCT CGCTGTTCGG CGCGCTGGGA GGCCTGACCT GTTACTGGCT GCCGGAACTG
ACCCGCGACG CGATGTTCGA CGCGCTGCAG GCGCGGCATC ACTACGCCAC CACCGGATCG
CGCGTGCACA TGACGGTGCA AAGCAACTTC GCGGAGCCTG TGACGGTGTG GACCGACGAC
CCCCGCGTCG CCGGCGCAAG GCATCAAAGC GCGACCTCGG TGATGATGGG CGACATCGTC
ACCGGCGCGC CCGACGAGGT CGCGTTCGAG TTCTGCATCG AGTCGGCGTC GCCGATCCTC
GACGTCGAGA TTCGCCGCGG CACCGAGGTG CTGGAGACCA TACGTCCGCA TGCCGCCGCG
CCGCTCGGAA CGCGCTGCCG CATCGAATGG AGCGGCGCCG AATATCGCGG CCGCGCCCGG
CAGACGGTGT GGGACGGTTC GTTGAAGCTC ACCGGGGCGA CCATCACGGC GTTCGAGCCG
ATCAACTTCT TCAATCCGGA CCGGCCGCTG CGGCAGCTCT CCGAGCATGA ACTGGCGTGG
CAGTCGATCA CCACCGGCAA TTTCGCCGGC GTCGATCTGT CGCTGTCGTC GGCCGATGCG
GTGCTCGAGA TCACCACCCC GCTGGGGACG TTCAAGCATG CGCTCGCCGA GCTGACGCAT
CAGCCGACCG TGCATCGGCT CGGCAAGCTC GATCGCGAGC TGCGACTGTC GCGGGAGCCG
GATCACGACA GCGAACGTTC GCTCCGCCTT CGGCGCTCGA TCGCGATGCG CGCGGGCGAC
AATCCGATCT GGCTCCGCGC GACCTTCGCG GACGGCCACC AGGCGTGGTC GAGCCCGATC
TACATCCTAC GCGACGCGTA G
 
Protein sequence
MNLHRPVHFN ASQTSGSDMT IRVTNTARAS RSVLENWMPE VTPVAAADAG SVTIDNTGPF 
VAGSYQRFSM TYAAGRYGID DSGCLKICYR FASDMGRPQF TDPTAPNFVE VVASNGATLD
VRFDYKQNTR PWDRTIYIKV VAGFMKQGDT IRIDFGADER GPGIRMQTFV DPEFCFRILV
DPIATYTFVE VPGVPVMPII AGPAARWHAV LPTCRAVGDT FALGIRADDM WGNPTSVRGG
ARVQLVCSGR IEGLPETVAF DPALPATEIG GLRAVQPGSV FVDLVAEGRT LTRSNVLVVE
SELALRPYWG DLHAQSGETI GSGTAQDYMT YARDCAFLDA IGHQGNDFQI TGQFWHLLNG
LMRDWNEPGR FVTIPGYEWS GNTSLGGDRN VFYRSEDRVI HRSSHALVPE RSDADTDCWD
ARSLFEALEP KQADTVVWAH CGGRYADIKY AHHHGLERAV EVHSSWGTFE WLAADAFDSG
YRVGIVANSD GHKGRPGYEP PGASLFGALG GLTCYWLPEL TRDAMFDALQ ARHHYATTGS
RVHMTVQSNF AEPVTVWTDD PRVAGARHQS ATSVMMGDIV TGAPDEVAFE FCIESASPIL
DVEIRRGTEV LETIRPHAAA PLGTRCRIEW SGAEYRGRAR QTVWDGSLKL TGATITAFEP
INFFNPDRPL RQLSEHELAW QSITTGNFAG VDLSLSSADA VLEITTPLGT FKHALAELTH
QPTVHRLGKL DRELRLSREP DHDSERSLRL RRSIAMRAGD NPIWLRATFA DGHQAWSSPI
YILRDA