Gene RPB_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3001 
Symbol 
ID3910800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3415825 
End bp3419559 
Gene Length3735 bp 
Protein Length1244 aa 
Translation table11 
GC content67% 
IMG OID637884907 
Producthypothetical protein 
Protein accessionYP_486614 
Protein GI86750118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.11101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAC AGGAGCGTAT CCTGCGCGTC GGCGCAAAAC CCTCCCTCGG CGATCAGGCT 
GCGACAAAGC GCGTGCACCG AGAGGCCATG GCCAGCGACC CATCGCAAAA CGCATCCGGG
CCGCATCCCC ACCACGACGC CGATCATTGG CAGGAGAGCG GCTGGGATCC CGAGCACGAG
GAGGCCGGGC GCCATCGCGC CCGCAAATTG CTGGCGAGAC GGGGGCTCGG CTTTCACCGG
GTGGACGATT CGATGCGGCG CTGGCAGCAC CGATTGTCGG GAGCCCCCTG GTTGAGGCGG
ATGGCGATCG CACTGCTGGT GGTCGGCTTC GTGCTGGCGG CCGGTTTTGG TGGTCTGTGG
CTGCGGCTCG GTGCCGGACC GATCAATCTC GATATGGCGA CGCCCTGGCT GGCCGACGCG
ATCGAAGAGA ACATCGGCAA CGGCAACACC GTCGAGGTCG GCGGAACCCA GATCGAGCGG
GTCGGACGGA TTCGCGTCGC GGTTCGGATC CGCGATATCG TGGTGCGCGA CCGCGACCAT
GTGGTTGTTG CCACTGCGCC GAAAGCGGAA GTCCGGCTGT CGGGCCGCGC CCTGCTGATC
GGCAAATTGC GCGCGGAGAG TCTGCGCCTG GTCGATGCCG AACTGGCGAT CCGGATCACG
CCGGATGGCC AGGTGACGGT GTCGACCGGC GACACCGAGC GGCCGATTGC GACTGGCGTC
GCCTCGACTC GCAAGCCACC GGAATTCACC TTGCCGGGCC AGTCGCCCGC CGCTGCCACG
GCGCCCGGAC AAGACGGCAC AGTGTCGCCG GCGCAGCCGG GCGGTGCGGC GCCGCCCGCT
GCGGCCGACC GGGACGGCGA TGCGATGAAG AGCCTGCTCG CCGGGCTCGA TTGGCTCGAC
AGCCTGAGCC TGACGGGGCT CGATGGCCAG AACCTCAACG AGATCGGCCT GAAGAACGGC
AATCTCGTCG TCGACGACCA GCAGCGCGGC AATCGCTGGA GCTTCGAGAA CATCAGTCTC
AGCCTGCGCC GTCCCAGCCG CGGCGGCGTC GCGCTCAGCT TCGGCGAGGA GGGGGCGAAG
GCGTGGTCGC TGCGGGTTCA GGTCGGTCCG CCTCAGGATG GCGTGCGCAC CGTCGAGTTG
CACGCCAATC AGGTGCCCAC GCGCAACATC CTGCTGGCGC TGCGGCTGAA GAACCTGACT
TACGGCGCCG ACTTCCCGCT CACCGGCGAT CTCAAGGGTG AAATCGGTCG CGACGGTCTG
CCGACCTATT TCCGCGGCAA GCTGGTCGCC GGCGCCGGCA CCGTGATCGA TTACGACACG
CCCGATTATC CGATGGCGAT CGATCAGGCG GAATTCAATT TCGAATGGGA TGCCAACCGC
CGCGTGCTGA TGGCGCCGTT CAAGATCATC GCCGGCGCCA ACCGCGTCAC GCTGCTGGCG
GCGCTCGAGC CGCCGAACGG CAGTACACCG GATTGGCGGC TCGCTTTGTC GGGCGGCACC
ATCGTGCTGC CCGGCGCGGA AGGCGAAGCG CCGCTGATCT TCAACCGCAT CGCCGTTCGA
GTCAGCTTCG ACACCGACCA TCGCCGGGTG CTGCTGACCC AGGCCGACAT CAACAACGGC
GAGATCGGCG TCGCCGGCTC CGGCAGCATC GACTACAGCG GCGAGCCGCG ACTGCAACTC
GGACTGGCCG GTACGCCGAT GTCGGTGTCG GCGCTGAAGC GGATGTGGCC GATCCTGGTG
GTGCCGGAAG TTCGGGAATG GGTCTATGAG CGGATCGACA AGGGATCGAT CCAGAGCATC
GATATCGCCG TCAATTCACC GGTGAAAAAC CTGTCGCGGC GAGGGCCGCC GATTCCGGAC
GAAGGGCTGC TGGTCAATAT CATCGGCACC GGTGCGACCA TCCACCCGGT CGACGGCATG
CCTTGGGTGC GCGACGCAGA TATGCGGGTT CGCGTCACCG GCCGCACCGC TGCGGTCGCG
ATCGCGCAAG CCGGCGCCGA TACGCCCAGC GGCCGCAAGA TCGCGCTATC GGATATCCTG
TTCGAGGTGC CGGACATGGC GCCGAAGCCG TCGCCGTCGC GGATCAAGTT CAAGCTGGAC
GCACCGGTGC CGGCCGTCGC CGAGGTGCTG TCGTCGGGCC GGCTCAGCGA CGTCAGCGAC
GTCCCGATGG ATCCCAACAC CAGCAAGGGC GCCGTCAGCG CGCAGGTCAT GCTCGGCATG
CCGATCCAGC GCGAACTGAC CAAGGCCGAC ACCACTTATT CGATCAATGC CGACGTCACC
GGCTTTTCCG CCGACAAGCT GGTGATGGGG CAGAAGCTCG AGGCGAACAC GCTGAAGGTG
AACGCCAACA ATCAGGGCTA TCAGGTCAAG GGCGACGTCA AGATCAACGG GCAGCCCGCG
ACGCTCGACT ATCGCAAGCC GGCGCAGGGC GAGGCCGATG TCCGGTTGCA GTCGACGCTG
GATGACGCCA GTCGCGCCAA GTTGGGGCTC GATCTCGGCA GCGCGGTTTC CGGCGCCGTG
CCTGTGAAGC TGGTCGGCAA GATCGGCGAC AGCGACCACG AGACCAAGTT CGGTATCGAC
GCGGACCTCA CCGCCCTGAA GCTCGACAAC ATCCTGCCCG GCTGGACCAA GCCGTCGGGC
AAGACGACCC GCGCGACGTT CAACGTCATT CAGAAGCCGC AGGCGATCCG GTTCGAGGAC
ATTCAGATCG AAGGCAACGG CACGCTGATC AAGGGCTCGC TCGAAGTCGA TGGCGACGGC
GACCTGATCA ACGCGAACTT TCCGGTGTAT TCGCCCTCCG AAGGCGACAA GACGACGCTG
AAGGCCGAGC GCGGCCAGGA CGGCGTGCTC AAGGTGGTGA TGCGCGGCGA CGTGTTCGAC
GGCCGCGGTT TCATCAAGTC GGCGCTGTCG GGCACCCAGG CCGAACCGAA GGCCAAGACC
AAGAGCCTCG ATTTCGATCT CGATCTCAAG CTCGGCGCTG TCGCAGGCTA TTTCGGCGAG
GCGCTGCGCA GCCTCGACGT CAAAATGGTC CGCCGCAACG GTGCATTCCG CACCTTCACG
CTGAGCGGCA AGCTCGGCCG CGATACGCCG ATCACCGCCG AATTGCGCGG CAAGAATCGC
GGCCGCGAGG TGATCGCTCT CGAGACCAAC GACGCCGGCG CATTCCTGCG CTTCTCCGAC
ACCTACTCGA AAATGTACGG CGGCCAGCTT GCGCTGGCGG TCGAGCCGCC GACCGCGGAG
CCCCGCCAGA AGGAAGGGCT GATCAACGTC CGCGACTTCA CCGTCAAGGG AGAGGCCGCC
CTCGAGCGCG CCGCCGCCGG CGCGCCCGGC GGCACGTCGA CGGGGGTGGC GTTCTCACGG
CTGCGGGCCG AGTTCACCCG GGACAACGGC CAGCTGTCGG TGCGCGACGG CGTGGTGAAG
GGGCCGACGA TCGGTGCGAC GATCGAGGGC TCGATCGACT ATCCCGCCAA CCAGGTCCGG
ATGAGCGGCA CCTTCGTGCC GATGTATGGT TTGAACAACA TGTTCGGGCA GATCCCGATC
GTCGGCCTGT TTCTCGGCGG CGGCAGCAAC GAGGGCCTGA TCGGCGTGAC CTATGAGGTC
GTCGGGACGC CGGGGCAGCC GGTGCTGCGC GTCAATCCGA TCTCGGCGAT GGCGCCCGGC
GTGCTGCGCA AGATCTTCGA ATTCAACACC GGCCGGCAGA ACAGCGGCGC CGATTTCCCG
GCGCCGCCGA ATTGA
 
Protein sequence
MPPQERILRV GAKPSLGDQA ATKRVHREAM ASDPSQNASG PHPHHDADHW QESGWDPEHE 
EAGRHRARKL LARRGLGFHR VDDSMRRWQH RLSGAPWLRR MAIALLVVGF VLAAGFGGLW
LRLGAGPINL DMATPWLADA IEENIGNGNT VEVGGTQIER VGRIRVAVRI RDIVVRDRDH
VVVATAPKAE VRLSGRALLI GKLRAESLRL VDAELAIRIT PDGQVTVSTG DTERPIATGV
ASTRKPPEFT LPGQSPAAAT APGQDGTVSP AQPGGAAPPA AADRDGDAMK SLLAGLDWLD
SLSLTGLDGQ NLNEIGLKNG NLVVDDQQRG NRWSFENISL SLRRPSRGGV ALSFGEEGAK
AWSLRVQVGP PQDGVRTVEL HANQVPTRNI LLALRLKNLT YGADFPLTGD LKGEIGRDGL
PTYFRGKLVA GAGTVIDYDT PDYPMAIDQA EFNFEWDANR RVLMAPFKII AGANRVTLLA
ALEPPNGSTP DWRLALSGGT IVLPGAEGEA PLIFNRIAVR VSFDTDHRRV LLTQADINNG
EIGVAGSGSI DYSGEPRLQL GLAGTPMSVS ALKRMWPILV VPEVREWVYE RIDKGSIQSI
DIAVNSPVKN LSRRGPPIPD EGLLVNIIGT GATIHPVDGM PWVRDADMRV RVTGRTAAVA
IAQAGADTPS GRKIALSDIL FEVPDMAPKP SPSRIKFKLD APVPAVAEVL SSGRLSDVSD
VPMDPNTSKG AVSAQVMLGM PIQRELTKAD TTYSINADVT GFSADKLVMG QKLEANTLKV
NANNQGYQVK GDVKINGQPA TLDYRKPAQG EADVRLQSTL DDASRAKLGL DLGSAVSGAV
PVKLVGKIGD SDHETKFGID ADLTALKLDN ILPGWTKPSG KTTRATFNVI QKPQAIRFED
IQIEGNGTLI KGSLEVDGDG DLINANFPVY SPSEGDKTTL KAERGQDGVL KVVMRGDVFD
GRGFIKSALS GTQAEPKAKT KSLDFDLDLK LGAVAGYFGE ALRSLDVKMV RRNGAFRTFT
LSGKLGRDTP ITAELRGKNR GREVIALETN DAGAFLRFSD TYSKMYGGQL ALAVEPPTAE
PRQKEGLINV RDFTVKGEAA LERAAAGAPG GTSTGVAFSR LRAEFTRDNG QLSVRDGVVK
GPTIGATIEG SIDYPANQVR MSGTFVPMYG LNNMFGQIPI VGLFLGGGSN EGLIGVTYEV
VGTPGQPVLR VNPISAMAPG VLRKIFEFNT GRQNSGADFP APPN