Gene RPB_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3836 
Symbol 
ID3911639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4380201 
End bp4382687 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content61% 
IMG OID637885736 
Producthypothetical protein 
Protein accessionYP_487440 
Protein GI86750944 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCTCG ATGAAGCCAC AATGCTGGCG GAAGAAGAAG CGTTTCTATG GCCGACGACG 
AAACCTGACT ACCGTGCTGG CCTCTCTTTG TCGGGAGGCG GCATCCGCGC CGCCACGGTG
GCCCTCGGCG TGCTCGAGGG ATTGGCGTCA CGCGGTCTCT TGCAACGAAT TCACTACCTG
TCCACCGTGT CCGGCGGCGG CTATATCGGA TCGGCATTGT CCTGGTTTTG GTGCGAACGC
CGCGTGGTTG CTGAAGCGGC GCTGCAGAAG TCGAGAGAGC GCACCGTTCA TCGTTTCGGA
GCCGACACCG CCAGCTTCCC GTTTCAGGAG GAGCGGGCGA ACGCGTCCCC GGTTGCGGAG
GCGGCGGCCC TCAATCTCAA ATTTCTTCGT CAGCACGGCT CGTATCTCAC CTCGGGAGAC
GGCATCGGCT TCGCCGGCCT GATCATGGCC GTGCTGCGTA CGGTGTTGCT GAGCCTCGCT
GTCTGGATGC CGCTGCTGAT TGCGATCTTC CTGTCCTTCG AGGTCCTGGA TAGTTTTCTG
TCGGGCGCAA GCTCGGATGG CGAGTTGGCG GCGAAATGCG AGACCGCTGT CGGAACTACC
GTGTTCGCCT GCCGCCCTTC GTTCATTGCT CTCCTGTCAT TGGCCGGCGC GGTCGGCGTC
GCAATCTTTA TTGGAACGAT CCTTTTCGCG TTTCTCGGTC ACCTGGCGTC AATCAGGGCG
TCCGGCAAGC GAGGCCGTTG GATCGCACTC TCAGCTTCGG TCGGCATCGG CTTGGCCGCT
CATGTGATCT GGAAATACAA TAGTTCGGCA ACGTTACAGC CGCTCTTGGG CGCCCAGCTT
CTGCTGGAAT TATTCATGAT CGCGGCCGCG ATCAGTGTGG CAATTTCCCA GATGTCGCTG
CCCGAAAACT GGAGCTATTC GCTGCGACGA CGTTTCGAAA AAGCCTCGAG CAAGGGTCTC
CCGATCGCGA TCACGGCAGT GTCTATCGGC CTTCTACCCC TAATCGTCGC CACCTTGAAG
CTGACAGATC CGTCCAAACT CGGCGCTTTC GAGCCGGTTT GGGGAACCGT CACGCTGCTG
AGTGGAGTCG GCACCGCACT TTACGGCTAC TATCTCAAGG CCAAGAGTCT CTTGCCGGGC
GTCGCCGGCA AATTCTTCGC GATAGCTGGG TCGCTGCTCT TCCTTTCGGG ACTGCTTATT
CTCTCCTTTG CTACCGCCCG GCAATTGTTT CTTCTCAATA CAAACTGGGC TCTGACGGGA
GGCGCGGGTC TCTTCATGCT GTCGATCGCC ATCGGCGTCG CCGGCAGCCT CAATGCCACC
GGCCTCCACC GGTTCTATCG CGATCGGCTG ATGGAAACCT TCATGCCGAT GACCGACGCT
ATCAGCCAGG GCACAGCGCG GCAAAGCGAC GTTGCCGATA CGCTCACCGT CGTCGATGTG
GTGCGCAGCG CCGAGGAGCG CGGCGACCGG CCCTATCACC TGCTCAACGC CCATGCGATC
CTGGTCAACG AGCCGGACGA TCCGAAGCTG GCGCTCCGCG GCGGCGACAA TTTTCTGATA
TCGCCGGCAA TCATCGGCTC CTCGGCTACC GGCTGGATGC GCAGTCGCGA CTACCTTCGG
CTGCAGGGTC CGCTGACACT GGCCTCGGCA ATGGCGGCCT CCGGCGCGGC CACCAATGCG
AATGCCGGCT ATATCGGAAC CGGCGTGACG CGCGACCGTT TCCTCTCGGC GGTGATGTCC
ATCCTCAACA TCAGGCTGGG ACTGTGGGTC GGGAACCCGC GATGGCTCGC CGCTAAATCT
CTGTTCGGCC TGCAAGTCCT GAAGGCTCCG ACCTATTTCC AGCCCGGACT CACCGCCGGC
ATTCTCGGAT TCGGTCACCA CAGAAAGGCG AAATTCCTCG AACTCTCCGA CGGTGGCCAC
TTCGAGAATC TCGGCCTGTA CGAACTGGTG CGGCGGCGGC TCGACCTGAT CATCGTGGTC
GACGCTGAAC AGGACAAGGA CATCAACTTG TCGGCTCTGG TGTCGTCGCA CAATCGCATC
AAGGAAGACT TTGGCGTCGC TCTGAAGTTC GCCCCATCCG ACAAGGGCAA GGGACCGGAA
CTCTTTCTCG GCGAAGAGGC CAAGAACAGA TATCCGCGCG GCCTACCTCT CGCCAAATCG
CCGTTCATGG TCGCGCGAAT CGAATATCCC GCGACGAAGA GTGGCGAGCC CAACAAGACC
GGCGTGCTGA TCTATTTGAA ATCGACCATC GTCGAGGGGC TGGATTTCGC CACCCTCGGC
TATCGCGCGC TCAACGCCGA CTTTCCGCAC CAGACAACCG CAGATCAGTT TTTCGATCCC
GATCAGTTCC AGGCGTACCG CAACCTCGGC CTCAGGAGCT GCGAGATCAT GGCGACCGCG
CTCGACCTCG AAGCAAACTT CGACAAGCCA ACCGAACTGC TGAAGAAGTA CGACGACTGG
AAGCCGGGCG CGTCAGCAGA CAGTTGA
 
Protein sequence
MLLDEATMLA EEEAFLWPTT KPDYRAGLSL SGGGIRAATV ALGVLEGLAS RGLLQRIHYL 
STVSGGGYIG SALSWFWCER RVVAEAALQK SRERTVHRFG ADTASFPFQE ERANASPVAE
AAALNLKFLR QHGSYLTSGD GIGFAGLIMA VLRTVLLSLA VWMPLLIAIF LSFEVLDSFL
SGASSDGELA AKCETAVGTT VFACRPSFIA LLSLAGAVGV AIFIGTILFA FLGHLASIRA
SGKRGRWIAL SASVGIGLAA HVIWKYNSSA TLQPLLGAQL LLELFMIAAA ISVAISQMSL
PENWSYSLRR RFEKASSKGL PIAITAVSIG LLPLIVATLK LTDPSKLGAF EPVWGTVTLL
SGVGTALYGY YLKAKSLLPG VAGKFFAIAG SLLFLSGLLI LSFATARQLF LLNTNWALTG
GAGLFMLSIA IGVAGSLNAT GLHRFYRDRL METFMPMTDA ISQGTARQSD VADTLTVVDV
VRSAEERGDR PYHLLNAHAI LVNEPDDPKL ALRGGDNFLI SPAIIGSSAT GWMRSRDYLR
LQGPLTLASA MAASGAATNA NAGYIGTGVT RDRFLSAVMS ILNIRLGLWV GNPRWLAAKS
LFGLQVLKAP TYFQPGLTAG ILGFGHHRKA KFLELSDGGH FENLGLYELV RRRLDLIIVV
DAEQDKDINL SALVSSHNRI KEDFGVALKF APSDKGKGPE LFLGEEAKNR YPRGLPLAKS
PFMVARIEYP ATKSGEPNKT GVLIYLKSTI VEGLDFATLG YRALNADFPH QTTADQFFDP
DQFQAYRNLG LRSCEIMATA LDLEANFDKP TELLKKYDDW KPGASADS