Gene RPB_3316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3316 
Symbol 
ID3911117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3792806 
End bp3795811 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content53% 
IMG OID637885218 
Producthypothetical protein 
Protein accessionYP_486923 
Protein GI86750427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.215386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGCCCC CATTGAAATC CCGAACGCCC TACCGCACAT TCCGCGATAA CTCAAAAGCC 
AAGGCCGCCG TTTACAAGGC AATCAGAACG ATTCGCGACC AACGCTCCCG CAACCTAGTA
CGGGGAGCCA TCGCGAGGAA CACCGATTCG TCGCCCTCGC TTTCGATCAA CACTCTCCGT
CGCCAAGCAG ATCCGGGGGA CCTTTGCCTC GCAACAACTC TCCTCCTAGG TCAAAACGGC
CCATTTCCGT TGCCGCAGCT CGGAGGCACC TACCGTAACC TCATTATCAA GCACACGATG
CTGGCGACAG AACCCGACTC CGAGGTCGCT TTTGTTGTCG GTCACGTGAA CGGCTGGGCG
GACAACGCCA GACGCATCCT CGCCGACATG TCGGAACTAG CGCGATTGCC GCAGTACCAA
CCATCCCTTG CAATGGAAGC GCTCGCCGCG TTCGCGGAGT CGTACGGCTC ATCCTTCTAT
ATCCTGCGGA AGACTGCATA CCTAATGTCC CGCTACGCTG ACGACTCAAC ACTCCAGTCA
GCATTCCAAC GGATTGCAAA AGCGTTCAAC CAATCCGCTT ATCCCGAGCC ATACTTTTCT
GCCCTTCAAT TGATGGAAAC CGACTCCAGC TACTGTTCCA CCGCGACCAC CCGCGTCCGC
CTCTTTCAAA AGTATGTTGA AGACGACTAT CGCCAATATC TCCCCTTGAA CGAACTCGTT
CCGACCCCGC TCTCTCGGAC TGATCTTGCC GCGCTTCTTC GTCGCTCCCA CTCAAGTTCT
CTTGTCGATG AACTCGCCGC ACTCGCTATA ACCGCACATC TCAGATCGTT CTGGCCAGCC
TTATTTGACC AGACCATCGC CCTTCTCGAT CCGTCTATTG CACACCTCTT TCTTGAGTTC
TTGTCGATTC CGTTCGACCC ATCCGCTCTG TATTCGGACG TCGCTCGCAT GGACGCGGAC
GTGGTATATT ACCAGCGCGC AGCCGCCTTT TGCGAGTTCA AAGATTGCGC CATCTTCCGT
CGTTCAGTGG ACGCAATTCT TGTACCACGA TTGATGGAAG ACATCTCACC ACCCGTCGAC
ACAGACACCT CGCCGCACTT CCATACCAAG ATGCCAGACC TCATCAAGGC TACACAAGGC
TTCGTTTCTC CCACCGACTA CGAACGAGTA CAGAGCTGCG GGACATTCCT CCGAACCGTG
CAATTCCTAC AGTTCCTGAA CACTCAACGA AACTACTCAC CCCTGTCCGC CCATGACTTC
CGATTCATCT GCGAACATAC CGTCGCCCTC GACATACTAT TGTGCGACTC AGAGATCGAG
ATGCTCTACG CATCATCGGA CGACGACTCA CGCCCACTGA TAACCGTTCT CGCGCTCGCA
CTACATAAAG CAAAAAGTCG CGACGATGAC ATCGATTTCA AATTTCGCCA CTCGCTCTGC
AATACAGTAA TCACTCAATT CAATGGGAGC CTCGAAGCCT TCATTGCTTG GCTACTTCCG
AATACCCCCT CCATTGCCGA CTACCTCCTA ACAATACTCG ACAGACCAAC ACTCCAGAAA
CTCTACTGGA TAGTCAGATC AGCAGACGAA GCGGATCGCA CGAGACAATC GCTACTGCGC
ACCGTCGGAA AACAACGAAA TCAGATCGCA CATCTGATAG AGGCTGACGC AATCGAAGCG
CGGCGCCAAG TCGCAAAACT TCGAAAGTTC TTCGATGACA GCCGCATCTA TGTCGACGGC
CCAGCGATGA AAGAGTGGCT GGTCGCAAAC CCCAGCACCT ACGCTCAACA ATACGTAAAG
ATGATTGAGC ACGAGTTCAG CTCGTTGACA TTCCTGTCAA CATCAACCAG CGGAAAACTA
GTAATCACGG AGTGGAGCGA TCTCGACTAC GTCTTAGTCG AAGCCGGAAA AGCCGCGTTT
GAGCAGTTCT GCACAAACAA GCAGTTCGGG ATTGAGTCGT ACCTTGGTCG GCGCATTCGC
CACAACACTA TGAGCGGGAT GATGCGAGGC GGCATCGACG ACCTAATCGA GTCACCCACT
TATGATTTGC TAACATATGA TAGCGCTTTT GTTGACGCCA ACAAGCGATG GGTTGCTTCT
TATCACCGAA CGATTGAGCA CTTGCGGAAG GATCTACTGC AGTTTCGGTC CGACGCAAAG
CCATTGGGAA TCTTCAACTC AACTCTGAAG CGCGACGACA ATACCAACCT CGCCATCGCA
TCCCTTCGAA ACATGCTATT GAATGGACGC AATCCGTTAT TGATGAACGA CCTGCTTATA
CGCTTCTGCT GGCAAGAGAT TAACCCTCAG CTTCAAACTG CCTCACGCAT GATTTCCATC
GACCTCGTGA AAGAGGCCAC GAGAGAAATC GAACAGCATT TCTTTCATTT CGATTCCGAC
GATCTCCAGC GACGATACCG ACAACAACTC AGAGCGCTAG TCCACGAACG CTTCATGCGA
CTCGGTAGTT GGTTTCGACA ACCCGAGGAT GGCTTTGTGA GCGCACGCAC TCGTCAACTC
TGCGAACTCG TTCTTGTCGA GGCCACAGAT AGTAACTTAT TTGGCACACC AACGGTCGAG
TGGTCCGGTG ACGCTCTCGA CCTGGAAATC GATGGTCTAT CCGTCCACCG AATGTACGAT
TGCCTCTTCG TTATTCTTCA CAATGCGCTT ACTCACGGAC AAGAGAACGG CCTCATAACA
ATTCAGGTCT CACAGGAGGC CATGGCCTTC GAGAGCGTCG GCCACCTCAA GGCCACCGTT
TCCTCTCGTT TCTCCAGCAC AAGCGACAGG TCAAAACACA TAGCGCGATT GGCAGAGAGC
TTCGGATATG GAGATCCAGA GTCAGCCATG GTAACTGAAG GATATTCAGG AATTAAAAAG
CTACGGTATA TAACACGAAC TGGCGACAAC TATTCGAATG CCGGATACAC TATTGATGCC
GACACCTGCT CTGTCTTCTT CACACTTGCG GTCGAATTAG CTGATCTCGA AAGGCCAGAT
GTATGA
 
Protein sequence
MRPPLKSRTP YRTFRDNSKA KAAVYKAIRT IRDQRSRNLV RGAIARNTDS SPSLSINTLR 
RQADPGDLCL ATTLLLGQNG PFPLPQLGGT YRNLIIKHTM LATEPDSEVA FVVGHVNGWA
DNARRILADM SELARLPQYQ PSLAMEALAA FAESYGSSFY ILRKTAYLMS RYADDSTLQS
AFQRIAKAFN QSAYPEPYFS ALQLMETDSS YCSTATTRVR LFQKYVEDDY RQYLPLNELV
PTPLSRTDLA ALLRRSHSSS LVDELAALAI TAHLRSFWPA LFDQTIALLD PSIAHLFLEF
LSIPFDPSAL YSDVARMDAD VVYYQRAAAF CEFKDCAIFR RSVDAILVPR LMEDISPPVD
TDTSPHFHTK MPDLIKATQG FVSPTDYERV QSCGTFLRTV QFLQFLNTQR NYSPLSAHDF
RFICEHTVAL DILLCDSEIE MLYASSDDDS RPLITVLALA LHKAKSRDDD IDFKFRHSLC
NTVITQFNGS LEAFIAWLLP NTPSIADYLL TILDRPTLQK LYWIVRSADE ADRTRQSLLR
TVGKQRNQIA HLIEADAIEA RRQVAKLRKF FDDSRIYVDG PAMKEWLVAN PSTYAQQYVK
MIEHEFSSLT FLSTSTSGKL VITEWSDLDY VLVEAGKAAF EQFCTNKQFG IESYLGRRIR
HNTMSGMMRG GIDDLIESPT YDLLTYDSAF VDANKRWVAS YHRTIEHLRK DLLQFRSDAK
PLGIFNSTLK RDDNTNLAIA SLRNMLLNGR NPLLMNDLLI RFCWQEINPQ LQTASRMISI
DLVKEATREI EQHFFHFDSD DLQRRYRQQL RALVHERFMR LGSWFRQPED GFVSARTRQL
CELVLVEATD SNLFGTPTVE WSGDALDLEI DGLSVHRMYD CLFVILHNAL THGQENGLIT
IQVSQEAMAF ESVGHLKATV SSRFSSTSDR SKHIARLAES FGYGDPESAM VTEGYSGIKK
LRYITRTGDN YSNAGYTIDA DTCSVFFTLA VELADLERPD V