Gene RPD_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0036 
Symbol 
ID4020490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp42301 
End bp44961 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content71% 
IMG OID637960212 
Producthypothetical protein 
Protein accessionYP_567177 
Protein GI91974518 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCAT CCCGCCGAAC GGTAGTCGTT CACACGGTGT TGGCCGGCCG CATGGAGCGG 
ACGGCGGCGG CGCGTTCGAA CGAACTCGGC GTGCAGATCG TGACGATGGG GCAGTTGGCG
GCGCGGCTCG CGGGCGGGCT GCTGCGTCCG GTCGATTCGG AGGCGCTGCA GGAGGCGGTC
TTTGCGGCCC TGCCGGTGGT CGACATGGGC GAACTGGAGC CAATCAAGCT GCTGCCCGGG
ATGCCGGGCG CGGTGACAGC GACCTTCGAC AAGGCTTGGC GGGCCGGCGT CGACCTGTCG
TCCGCGGCGC ATCCCCGGCT GGCGGCTTTG GCGGCGCTCG AGCAGGAGGT GGTGGGGCGT
CTTCCTGCCT CGATGAAGCG GCCGGCTGAA TTGGTGGCGA TGGCGAAGGA GCGCATCGCC
CAAGCGTCGA CGGTAATAGG CTCCCTTGAG ATCCACGGCC ACAGCGAGAT GTCGCCGTGC
TGGCGTCCGC TGCTGGAGGC CCTGGCGAAC GTCGTCCCGT TGACCTGGGT GGCCGGCCCC
CGGCACGTCC CGGCCTGGCT GGAGGGTACG GGCGTTACGG TCGCGAAGAC CGATCCGACG
GCGACGGCGA CGGCCTCCTA TTCCTGCGCG AACCAGCAGC ACGAGGTGAT CGAGGCTTTC
CGTTGGGTGC GGAAGCTGCT GGCGGAGGGT GTCCAGGCCG CCGACATCGC GATCGCGGCG
ACCGGTCCCG CCGACTACGA CGACCACATG TTCGCGCAGG TGCAGGAGAG TAACCTGCCC
GTTCATTTCG TGTCGGGCGT GAAGGCGCTG ACGACCGCCG ACGGCCAGGC GGCCGCCGCG
CTGGCGGAGG TTTTGGCGAA AGGCCTGTCC CAGGAGCGCG TCCGGAGGCT GTTCGCACTG
CTTCGCGGGT CGCCAGCGCT GGAGCGCATC CCGAGGGAAT GGACGCGGCT GCTGCCCCTT
GGCGCGCCGC TCACCGAGAT GGAGCGGTGG GAGCGGGTGT TCGCCCGTGT GGAGGCTTCG
GACTGGCCGG AGGGCCAGGA CCTGTCGGCG CTAGTGCTGG AGGTGCTGCG ACTGCTGGAT
CGCGGTCTGG CGGCGGCCAC CGACGTCGGC GAGGCGCTGC TGACGGGCGT GCCGCTGTCG
CTGTGGCGCC GGTCGCTGCG GGAGGGGCCG CCGGCGGCGC TGCCGGTCAC ACTGGGCCGC
CTGCGGACCC CGGACGCGTT GGAGCCGGCC GCGAGCGTCA TCTGGTGCTC GGCCATCGAT
CTGGCGTCGT CACCACGTCC GCATGTCTGG CTACTCGGGA TGAACGCAGG GCGGTGGCCA
CGGCGGATCT CCGAGGACCG CCTGATCCCT AACCACGTGC TGCCGATCGA GGAGTTGGAC
CCGCTGCCGA TCGCGGACGG CGACAAGCGG GATTTCGGAA CGATCCTGGC GACGGCGACG
GAGGTTGCGA TGTCGCACAG CCGCCGCGAC GCCGGCGGGC GGCTGCGGGG GCGATCGCCC
CTGATCAAGT CCTTCGCTGT GACCTACCTG GATCGCGGCC GGACGCCTGA GCACGCCGCG
AGCGAGGCGG ACCGACTGCT GGCCTGTCCC GCCGAATTCG CGAAGCTGCC GGTCGCGGTG
TCCGGGATCG GTTGCTGGCG CGACTGGTTC AGCGGCGGCC TCACCGCCCA CGACGGCCTG
GTGGCGGCGA ACCACCCGCG GATCACGAAG CTGTTCGAGC GGTCGCTGTC GGCTACCTCG
CTAAAGCTGT TGCTGCGGGA TCCGATCCGG TTCGTGTGGC GGTACGGGTT CAGGTGGACG
GCGCCCGAGG ACGCCGACGA GCCGCTGACG CTGGACGGCC TCGCCTTCGG CAACTTAGTG
CACGCGGTGC TGCAGAACGC GGTCGACGGC CTCTCCGAGG CCGGCGGTCT GGCCGCTGCC
GATCGGGCCG CGATCGACGC CCGGATCGCC GCCGCCATCG GTGGGACCGT GGCGGCGTGG
GAGGCCGAAT TCCCGGTGCC ACCGGCGGTG ATCTGGCGCA GCGCGGTGGA GCGCGCGTCG
ACGACGGCGA CGGCGGCGCT AACCTATCCG ATGGCCGCAT TGCCCGACCA GCGGAGCTGG
ACGGAGATTC CGTTCGGGAA GCCGGACGAG GCCGAGGGTG AGCGCGACCT GCCGTGGCCG
ACCGAGAAGA CGGTCGAGAT CCCGGGCACC GGGCTGCTGA TCGACGGCAA GATCGACCGT
TTGGATCTTG CCGGCGACCG GTCGAAGGCG CGGGTGCTGG ACTACAAGAC CGGCAAGCTC
GACCGGAAGA TGGCCGAGAA AGTGATCGAC GGCGGCCGCG AACTGCAGCG CTGCCTGTAC
GCCTTCGCGG TGAGGACGCT GGTCGAGGCC GGCGTCGGCG TCGAGGCCGC GCTGTTCTAC
CCGAACGCTC CCGAGGGCGA GCAGGCTGTG TTTCCGCTGA CCGACGCAGC CCTGGAGGCG
GCACTGGCGA AGGTGACGCA GGCGGTCTCG CTGTCGCGCG ACGCGCTGCT GGCCGGCGCC
GCCGTGCCGG GTGAGGACGC CGCCGACAAG TACAACGACC TTAGGTTCGC GCTCCCGGCC
AATGCCGGGT ATCTGCCGCG CAAAAAGTTG CCCGCCATCG ACCGGCTGGG ACAGGCGGCC
GCCGTCTGGA GCGAGACATG A
 
Protein sequence
MRASRRTVVV HTVLAGRMER TAAARSNELG VQIVTMGQLA ARLAGGLLRP VDSEALQEAV 
FAALPVVDMG ELEPIKLLPG MPGAVTATFD KAWRAGVDLS SAAHPRLAAL AALEQEVVGR
LPASMKRPAE LVAMAKERIA QASTVIGSLE IHGHSEMSPC WRPLLEALAN VVPLTWVAGP
RHVPAWLEGT GVTVAKTDPT ATATASYSCA NQQHEVIEAF RWVRKLLAEG VQAADIAIAA
TGPADYDDHM FAQVQESNLP VHFVSGVKAL TTADGQAAAA LAEVLAKGLS QERVRRLFAL
LRGSPALERI PREWTRLLPL GAPLTEMERW ERVFARVEAS DWPEGQDLSA LVLEVLRLLD
RGLAAATDVG EALLTGVPLS LWRRSLREGP PAALPVTLGR LRTPDALEPA ASVIWCSAID
LASSPRPHVW LLGMNAGRWP RRISEDRLIP NHVLPIEELD PLPIADGDKR DFGTILATAT
EVAMSHSRRD AGGRLRGRSP LIKSFAVTYL DRGRTPEHAA SEADRLLACP AEFAKLPVAV
SGIGCWRDWF SGGLTAHDGL VAANHPRITK LFERSLSATS LKLLLRDPIR FVWRYGFRWT
APEDADEPLT LDGLAFGNLV HAVLQNAVDG LSEAGGLAAA DRAAIDARIA AAIGGTVAAW
EAEFPVPPAV IWRSAVERAS TTATAALTYP MAALPDQRSW TEIPFGKPDE AEGERDLPWP
TEKTVEIPGT GLLIDGKIDR LDLAGDRSKA RVLDYKTGKL DRKMAEKVID GGRELQRCLY
AFAVRTLVEA GVGVEAALFY PNAPEGEQAV FPLTDAALEA ALAKVTQAVS LSRDALLAGA
AVPGEDAADK YNDLRFALPA NAGYLPRKKL PAIDRLGQAA AVWSET