Gene Rpal_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5221 
Symbol 
ID6412921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5632259 
End bp5634811 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content70% 
IMG OID642715111 
Producthypothetical protein 
Protein accessionYP_001994184 
Protein GI192293579 
COG category 
COG ID 
TIGRFAM ID[TIGR02302] conserved hypothetical protein TIGR02302 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.954731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGGCA GCATTCCCGA TCCGTCGCAG GCCCCGCACG ATGAAGAGGC GGTCGCGCGG 
CTGCACCTGG ACAAGGCGAT CGGGCGCGCG ACGCTGGCGA TTGCCTGGGA GCGCGCGTGG
CCGCATCTGG CGCGGGTAAT GAGCGTCGTC GGCCTGTTCC TGGCGCTCTC CTGGGCTGGG
CTGTGGCTGG CCTTGCCGTT TTCGGCGCGG ATCGCCGGCT TGGTACTACT GTCGGCCCTG
GTGCTTGCCG CGCTGATTCC CGCCATCAGG TTCCGCTGGC CGAGCCGCGA CGAGGCGCTG
GCGCGGCTCG ACCGCAACAC CGGCCTCAAG CACCGGCCGG CGACGGCGCT CGGCGACACG
CTGGCCTCCG GCGATCCGGT GGCGCGGGCG CTGTGGCAGG CGCAGCGCGA CCGCACCTTG
GCGACGATCC GCGGCCTCAA GCCCGGCCTG CCGTCGCCGC GGCTGCCGAT CCACGATCCC
TGGGGGCTCC GCGCCCTGGT GGTGATCCTG CTGGTCGCCA CGTTCTTCGC CGCTGGCGAG
GAGCGTACCC CGCGCGTGAT GGCGGCGTTC GACTGGAAGG GCGCGCTGTC GCCCTCGACC
GTCCGGGTCG ATGCCTGGGT GACGCCGCCG GTCTATACCA ACAAGCCGCC GATCATTCTC
ACCGCCGCCT CCAACAAGGA CCTGGGCACC CCAGGCAGCG GTCCGCTGCC GGTGCCGGTC
GGCTCGACCC TGCTGGTACG TTCCAGCGGC GGTGATCTCG ACATCGCGGT CGGCGGCGGC
GTGGTCGAGG TCAAGCCGGA CAGCGACGCG CCGAAGGGCA CCAGCGAACG GCATTTCCGC
ATCACCGGCG ACGGCACCGC GCGGGTCCGG GCGCCGTCGA GCGAAGCGCC GTGGAGCTTC
ACCGCCACCC CGGACAAGCC GCCGGCGATT GCGCTCGCCA AGGAGCCGCA GCGCCAGGCG
CGCGGTTCGC TGCAACTGTC CTACAAGCTC GAAGACGATT ACGGGGTCAC CGAGGCCGAG
GCGCAATTCG TCGCCGCGCC GCCCGCCAAG GCCCCGGGCG CCAAGCCGGG GGATGCGCCG
CGGCCGCTGT TCGAAGCGCC GCAGTTCAAG CTGGTGCTGC CGAATGCCCG CACCCGCGCC
GGCGTCGGTC AGACCGTCAA GGACGTCAGC GAAGATCCCT ATGCGGGCGC CGAAGTGACG
TTGACGCTCA CCGCCAAGGA CGAGGCCGGC AACCAGGGCC ACAGCGAGCC GTTCACGATG
CGGCTGCCGG AGCGGCTGTT CACCAAGCCG CTGGCGCGCG CGCTGATCGA GCAGCGCCGC
ATCCTGGCGC TCGATGCCAA TGCGAATAGC CAGGTGTACG CCGCACTCGA TGCGCTGCTG
ATCGCGCCGG AAGTGTTCAC GCCTGATGCC GGCCAGTATC TCGGGCTCTA TACCATCGCC
GATCAGCTCG AACGCGCCCG TACCGACGAT GCGCTGCGCG AAGTCGTTGG CAATTTGTGG
TCGCTCGCGG TCTCGATCGA AGACGGCGAT GCGTCGGATG CCGAGAAGGC GCTGCGTGCC
GCGCAGGACG CGCTCAAAGA CGCGCTGGAG CGCGGTGCGT CCGACGACGA GATCAAGCAG
CTCACCGACA AGCTGCGCGC TGCGCTCGAT ACCTATATGC GCCAGCTCGC GCAGCAGCTC
CGCAACAATC CGCAGCAGCT CGCCCGTCCG CTCGATCCGA ACACCAAGGT GATGCGGCAG
CAGGATCTGG AAAACATGAT CCAGCGGATG GAGCGGCTGT CGCGCTCCGG CGACAAGGAA
GCCGCCAAGC AGCTGCTCGA TCAGCTCGCG CAGATGCTGG AGAACCTGCA GATGGCGCAG
CCCGGCCAGG GCGGCGACAG CGGCGACATG GAGCAGGCGC TCAACGAGCT CGGCGACATG
ATCCGCAAGC AGCAACAGCT CCGTGACAAG ACCTACAAGC AGGGCCAGGA TCAGCGCCGC
GACCGGATGC GCGGCCAGGA CGGCGAGCAG AACCTCGGCG ACCTGCAGCA GGATCAGCAG
AACCTGCACG ACCGGCTGCG CAAGCTGCAG CAGGAGCTCG CCAAGCGCGG CCTCGGCCAG
AGCCCCGGCG GCGAAAAGGG GCAGCAAGGG CAGCAGGGCC AGCAAGGCGA GGGCGGTCTC
GACCAGGCCG ACTCGGCGAT GGGCGATGCC GAAGGCCGGC TCGGCGACGG CAATGCCGAC
GGTGCGGTGG ATTCGCAGGG CCGCGCGCTG GAAGCGCTAC GCCAGGGCGC CCAGAAACTC
GCCGAAGCGA TGCAGCAGGG CGACGGTGAT GGTCAGGGCG ATGGCCAGGG CAATCGCCCC
GGCCGCCAGC AGAGCGGCGC CAATCAGACC GATCCGCTCG GCCGTCCGCT GCGCGGCCGC
GACCTCGGCG ACGATCTCAC CGTGAAGATC CCCGGCGAAA TCGACGTGCA GCGCGTCCGC
CGCATCCTCG AAGAACTCCG CCGCCGCCTC GGCGACTCCG GCCGCCCGCA GATCGAACTC
GACTACATCG AGCGGCTGCT GAAGGATTAC TAA
 
Protein sequence
MSGSIPDPSQ APHDEEAVAR LHLDKAIGRA TLAIAWERAW PHLARVMSVV GLFLALSWAG 
LWLALPFSAR IAGLVLLSAL VLAALIPAIR FRWPSRDEAL ARLDRNTGLK HRPATALGDT
LASGDPVARA LWQAQRDRTL ATIRGLKPGL PSPRLPIHDP WGLRALVVIL LVATFFAAGE
ERTPRVMAAF DWKGALSPST VRVDAWVTPP VYTNKPPIIL TAASNKDLGT PGSGPLPVPV
GSTLLVRSSG GDLDIAVGGG VVEVKPDSDA PKGTSERHFR ITGDGTARVR APSSEAPWSF
TATPDKPPAI ALAKEPQRQA RGSLQLSYKL EDDYGVTEAE AQFVAAPPAK APGAKPGDAP
RPLFEAPQFK LVLPNARTRA GVGQTVKDVS EDPYAGAEVT LTLTAKDEAG NQGHSEPFTM
RLPERLFTKP LARALIEQRR ILALDANANS QVYAALDALL IAPEVFTPDA GQYLGLYTIA
DQLERARTDD ALREVVGNLW SLAVSIEDGD ASDAEKALRA AQDALKDALE RGASDDEIKQ
LTDKLRAALD TYMRQLAQQL RNNPQQLARP LDPNTKVMRQ QDLENMIQRM ERLSRSGDKE
AAKQLLDQLA QMLENLQMAQ PGQGGDSGDM EQALNELGDM IRKQQQLRDK TYKQGQDQRR
DRMRGQDGEQ NLGDLQQDQQ NLHDRLRKLQ QELAKRGLGQ SPGGEKGQQG QQGQQGEGGL
DQADSAMGDA EGRLGDGNAD GAVDSQGRAL EALRQGAQKL AEAMQQGDGD GQGDGQGNRP
GRQQSGANQT DPLGRPLRGR DLGDDLTVKI PGEIDVQRVR RILEELRRRL GDSGRPQIEL
DYIERLLKDY