Gene RPB_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3834 
Symbol 
ID3911637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4376935 
End bp4378965 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content71% 
IMG OID637885734 
Producthypothetical protein 
Protein accessionYP_487438 
Protein GI86750942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0630158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACAC TGTCCCTGCT CCGTCTTCCG GCCCGCCTGG TCGCGCTCGC CGCGGCCGTT 
CTGCTGCCGT CCGTCGTCGC GCTCGCCCAG ACCGCGCCTC CGCCGCCGGG CCCCGAACAG
CCCGGCGCCG CCCTACCGAT CATGCCTCCC GTCGCCGCGA TCCCGGTCAC CGACGGCTCG
ATGCAGATCG TCTGGGAGGT CCGCAACCGC TTCCGGCTGT TTCGCGAGGA GCGCGACTTT
CGCGAGCAGG CCGAAGCGCT GCGCGGGCTC ACCGTGCTCG CCTCCGAACA GGCGCTGGGG
CTGCAGAGCG AGGGCCGCGG CTGGGCCCGC AACGTCGTCA ACCGGCTGTG CATCGATCTG
ACCGGCCGGG TCAACGAGCC CTGCACCCGC GACGGCGTCA AGGAAAGCTA CCTGACGCCG
ACCGAACACC CGGTCACGGT GCGGCTGACC GGCGCGATCC CGGTCGGCGC GATCTGCGCC
TGGCTGTTCG ACGACGGCGA CGGCCCGCGG GCGTCGACGC TGGACTGCGC CGAGCCGATC
AATTTCCGCG CCCGCTACGG CAAGCCGACG GTGGCGACCG TGGACGTCAC CAGCGGCGCC
GACGCGCCGC TCCGGGTGAC CACCGAGATC ATGGTGCGCG ATTTCTTCAT CGCGGGGCTG
GGCGACTCGA TCGCATCCGG CGAAGGCAAT CCCGACCGGC CGATCGCGCT GTCCGACGAC
GGCTTCTGCT ATCGCTCCTA TCTCGGCATC GGGTCCGGCG CACGCCCCGG CCAGTATTAT
CGGCCGAGCC GCGCCGGCTA CAAAGGCGGC CGCGCCTGCG AGGCGCCGGA CACGCTGGCC
AACTGGCAGC GTTATTCGGC GACCTGGTTC AACGCCGCCT GCCATCGCTC GCTGTACAGC
TACCAGACCC GCACCGCGCT GGCGCTGGCG GCGCGTCACC CGCACATCGC GGTGACCTAT
CTGCCGCTGG CCTGCACCGG GGCGACCATC GCCGACGGGC TGTTCGGCTC GCAGCGGCCG
CGCGAATGCT ATCGCACCAA GACCGGCGCC AATTGTCCCG GCAACGTCAA CAGCCAGCTC
GCCGAGCTGC GCGAGGCGCT CGCCGCCGCG CGCAAACGCC AGCCGCAACG CGGGCTCGAT
CTGGTGCTGC TGTCGGTCGG CGCCAACGAC ATCAACTTCT CCGGCCTGGT CGCCGACGTC
ATCGTCGATA GCCCGACCGA GCGCGGCATC TTCCGCCGCT CCGGCGTGAT CGGCTCGATC
GAGGAGTCGC GCAGCGCGCT GGCGCGAACC CTGCCGCAGA GCTTTTCGAA GATGCGTGAA
GCGCTCAAGG GGCTGGTCGA CGACATGTCG CGCGTGGTCT ACGTCACCTA CGCCAATCCG
GCGCTGGCGA ACCGAGGCGT CCCCTGCCCC GGCGGCCGCG CCGGCTTCGA CATCCATCCC
TCGTTCGACG CCGACCCCAA CCGGCTGGCC GCGGTGGCCT CCTTCGTCGA CAATGAGTTC
CTGCCGCGCC TGAAGGATCT GGCGCAATGC AGCGGCGGCG TGCTGTGCCG CAATCCGTCC
GCCGACGCCA TGACCTTCGT CGACGCGCAT CAGCGCACCT TCGCCCATCA CGGCTTCTGC
GCCCGTGCCG ACACCGATCC GGAATTCGAC CGCGCCTGCT TTTCGCCGCG CGGCGACAGC
TTCACCAGCG ACATCGTCGC GGCGGCGAAT TCGCCGATGA GCTGCGGTGC CGGCGCCAGC
AATTACCGCG CTTATCTGCC GCGCGCGCGC TGGATCCGCG ACGCCAATGA CAGTTACTTC
GCGGCGATGA CGTTCCCGCA AGGCCTGCCC GCGGCGATCC AGCCCGCCGA CATTCACGAC
GCCACCTGGG GCGTGGTGTC CGCGGTCTAT GGCGGCGCGG TCCACCCCTC CGCCGAAGGC
CACGCCGCGA TGGCCGACGC CGCGGTGCCC GCCGCCGAAG CGGTGCTGCA ACTGGAGTCG
GGACCGAACG TGATCAGCGC ACCGCTGCCG CCGCCGGGAG CAGTGGAGTA G
 
Protein sequence
MITLSLLRLP ARLVALAAAV LLPSVVALAQ TAPPPPGPEQ PGAALPIMPP VAAIPVTDGS 
MQIVWEVRNR FRLFREERDF REQAEALRGL TVLASEQALG LQSEGRGWAR NVVNRLCIDL
TGRVNEPCTR DGVKESYLTP TEHPVTVRLT GAIPVGAICA WLFDDGDGPR ASTLDCAEPI
NFRARYGKPT VATVDVTSGA DAPLRVTTEI MVRDFFIAGL GDSIASGEGN PDRPIALSDD
GFCYRSYLGI GSGARPGQYY RPSRAGYKGG RACEAPDTLA NWQRYSATWF NAACHRSLYS
YQTRTALALA ARHPHIAVTY LPLACTGATI ADGLFGSQRP RECYRTKTGA NCPGNVNSQL
AELREALAAA RKRQPQRGLD LVLLSVGAND INFSGLVADV IVDSPTERGI FRRSGVIGSI
EESRSALART LPQSFSKMRE ALKGLVDDMS RVVYVTYANP ALANRGVPCP GGRAGFDIHP
SFDADPNRLA AVASFVDNEF LPRLKDLAQC SGGVLCRNPS ADAMTFVDAH QRTFAHHGFC
ARADTDPEFD RACFSPRGDS FTSDIVAAAN SPMSCGAGAS NYRAYLPRAR WIRDANDSYF
AAMTFPQGLP AAIQPADIHD ATWGVVSAVY GGAVHPSAEG HAAMADAAVP AAEAVLQLES
GPNVISAPLP PPGAVE