Gene RPD_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0801 
Symbol 
ID4021275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp900997 
End bp904401 
Gene Length3405 bp 
Protein Length1134 aa 
Translation table11 
GC content70% 
IMG OID637960991 
ProductSel1 
Protein accessionYP_567940 
Protein GI91975281 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.835897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.855509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCG TATCGTGGAG CGTAGAAGGT ATCGAGCCGT CGGTGCGCGA GCGGGCGGAA 
GCCGCCGCCA AGCGCGCCGG CATGTCGCTG GCCGATTGGA TCAACGGTCA GCTCGGCGAT
ACCGCGCCCC AGACTTTGGT TCAGTCCCAG CCCCGCTCGG CCGCCGAGGC CGGACATCAG
CCGTTCGGCG CCGCATTGGC GGAAAACAGC GCCACCGAAG TTGCCGAGAT CCATCAGCGG
CTGGATTCTA TTGCACGCCA GATCGACCAG ATGTCGCGGC CGCCGGTGCG TAACGAGCCC
GGCGTGGCGC GGCAGCTCAA CGACGCAATT TCGCGGCTGG ACGCCCGCCT GGCGCGGATC
ACCGAGCCCA AGGCTTCTAC GACGGCTTCT ACGACGGCTT CTACGACAGC TTCTACGGCT
TCCGCGACGG TTCCTGCGGC GGCTCCCCCG GCGGCGGCAA CCGCCCGGAC CGCACCGCTT
CAACACGCCC CTCTCCAGAC CCCGACCGAG CGGGTCGAAC GCGCTGCGGC TCAGGTCTAT
CACGCCTCGC CCCCGCTCGA TCCCAATGCG CTCGATCTGG CGATCGCTGA AATCGCCGCG
CGACAGAACG AACTCGACAG CACGGTGAAC CGGGTCGCGC CGCGCCAGGC CCCCCCGATC
GTCCCGGCGA TGGCGCCGCC GCCGGTCCGA ACCGGGCCGG ACTTCTCCAG CCTGGAGCAG
CAACTCCTCA AGATCACCAG CCAGATCGAC GCGCTGCAGC GGCCCGACGT GATCGAGCAG
TCGATCGCGG CGTTCCGCGC CGATCTCGCC GAAATCCGTC AGACCATCAC CGAGGCGATG
CCCCGCAAGG CGATCGAGTC GCTCGAGAGC GAGATCAAAT CGCTGTCGCG GCGGCTCGAC
GAAACCCGCA GCAACGGCAG CGACGCCAGC GTCATCGCCG GCATCGAACG CGCGCTCGGG
GAGATCCACG ACGCGCTGCG CTCGCTCACC CCGGCCGAGC AACTCGCCGG CTTCGACGAG
GCGATCCGCA ATCTCGGCGG CAAGATCGAC ATGATCGTGC GCAACAGCGA CGATCCCGGC
ACGATGCAGC AGCTCGAGAA AGCCATCGGC GCTTTGCGTA GCATCGTCTC CAACGTCGCC
TCCAACGAGG CGCTGGCGCA GCTCAGCGAC AATGTTCACA CGCTGGCCGA CAAGGTCGAC
CAACTCACCC GCGTCGATCA TCACAGCGAC TCGTTTGCGG CGCTTGAAAA CCGCATCTCG
GCGCTGACCG CGGCGCTCGA AAGCCGCGAG CGCCCGGTCG CCGCCGACTC CTCCGAGCAG
CTCGAAGGCG CGGTGCGCGC GCTGTCGGAG CGACTCGACC AACTGCCGGT CGGCAACGAC
AGCTCGTCGG CCTTCGCGCA TCTCGAGCAG CGCGTCTCCT ATCTCCTGGA ACGGATGGAA
ACCGCCGCGA CGCCGCGCGG CAGCGGCGAT TTCGGCCGGG TCGAGGAAGG GCTGCAGGAC
ATCCTGCGGA TGCTCGAGCG GCAGCAGGAG AATTTTCATC GCTTGGCCGA CATCGAGCGC
GCGCCGCCGC CGCCCGCTTT CGATCCCGGA GTTGTGGAGA CCCTCAAGCG CGAAGTCTCC
GACATGCGCT TCAGCCAGTC GGAAACCGGC CGGCACACCC AGGACTCGCT GGAAGCGGTT
CACAACACCC TCGGCCATGT CGTCGACCGG CTGGCGATGA TCGAAGGCGA CCTGCGCGCG
GCGCGCGCCG CGCCGCAGCC CGCGCCGGAG CCGGCCAGGC CGCTGCCGGT GACCGCCCAG
CCGGCGGCGT CGCCGCCGGT TTCGCTGCCG CCGCGTCCTG AAATGCCGAA TCCCGCCGCG
GCGACTGCAT TCAGCGCTGC GCCACGAGAG TTCGCGCCGG CGCAACCGGC GCCGGCACCC
GCACCTGCAC CGCGGGCGAT CCAGGACATT CTCGATCCGG CTGCGAGCCG GGCCGCGGCC
GGCCCCTCGA CCGAGCCGCA GATTTCGTCG CCACACGCAT CGATCAATCC CGCATTGCCG
CCGGACCACC CGCTGGAGCC AGGCTCCCGC CCGCCGGCCC GGGTCACCTC GCCGTCCGAG
CGGATCGCGG CATCGGAAAG CGCGATCAAC GAACTCGGCG GCGCCAAGCC GGAGCCGGCC
AGCAGCTCGA ACTTCATCGC CGCCGCCCGC CGCGCGGCGC AAGCCGCAGC GTCGGCGACC
GGCCATTCCA CCGACAAGTC CAAAGGCGAC GGCAAAGCCG GCCCGACCCC CGGCAAGGCC
GGACCGGGCT CCACCATCGG CTCCAAGATT CGCTCACTTC TGGTTGGCGC CAGCGTCGTC
GTGATCGTGC TCGGCACCTT CAAGATGGCG ATGAACTTGC TCGACGGTGG CCAGCCGGTC
CCGGCGGCAA GCCTCAGCGA GCCAGCGCCG CAAGGGATGG CGCCGTCCGA CGAGGACGAG
GATGACACGC CACCCGCCGC CAGCGCTCCT TCCGCACCGG CGCCATCGAT GACGTCGCCG
ACCCCGATCA ATCGCCAGTC GCTGTTCGCG CCGCAACAAC CGCCCGCGGC CGCACCGGCG
CCAGCCGCTC CCGCTCCGGC TCCGGCGGTC TCCCCGGCAA CCGCACCCGC AGACATCACC
GGCACCATCC CGGCGCCGCA GGCTGGCGCC GGGATGGGCG CCGCCGGCAA GATTGCGATT
CCGGCCGGCG AAACCCTGCC CGACGCGATC GGCGGACCGG TGCTGCGCAA GGCCGCGCTG
AAAGGCGACG CCGCGGCGGC TTTCGAAATC GGCAACCGCT ACGCCGACGG CAAAGGCATC
GCGGCGAATT TCGAAGAGGC CGCCAAATGG TACGGCCGCG CTGCGCAGGC CGGCATCGTG
CCGGCGATGT TCCGGATGGG CACCCTCAAC GAGAAGGGGC TCGGGCTGAA AAAGGATCTC
GATACGGCGC GGCGCTACTA CGTGCAGGCG GCCGATCGCG GCAACGCCAA GGCGATGCAC
AATCTCGCCG TGCTCGACGC CGACGGCGGC TCGAAGGGCG CGAACTACAA GACCGCGGCG
GAGTGGTTCA GGAAAGCCGC CGAGCGCGGC GTCGCCGACA GCCAGTTCAA CCTCGGAATC
CTGTATGCAC GAGGCATCGG CGTCGAACAG AACCTCGCCG AATCGTTCAA ATGGTTCAGC
CTCGCGGCGG CGCAGGGCGA CTCCGACTCC GCCCGCAAGC GCGACGATGT CGCCAAGCGG
CTCGATCCGC AATCGCTGTC GGCGGCCAAA CTCGCGATCC AGACGTTTGT CGTCGAGCCG
CAGCCCGACG ACGCCGTCAA GGTTGCAGCG CCCGCCGGTG GCTGGGACGC CCAATCCCCG
GCGGCGGCGA TCAGTCCTGC GACCAGCAAG CGCGCGGCAC GCTAA
 
Protein sequence
MNRVSWSVEG IEPSVRERAE AAAKRAGMSL ADWINGQLGD TAPQTLVQSQ PRSAAEAGHQ 
PFGAALAENS ATEVAEIHQR LDSIARQIDQ MSRPPVRNEP GVARQLNDAI SRLDARLARI
TEPKASTTAS TTASTTASTA SATVPAAAPP AAATARTAPL QHAPLQTPTE RVERAAAQVY
HASPPLDPNA LDLAIAEIAA RQNELDSTVN RVAPRQAPPI VPAMAPPPVR TGPDFSSLEQ
QLLKITSQID ALQRPDVIEQ SIAAFRADLA EIRQTITEAM PRKAIESLES EIKSLSRRLD
ETRSNGSDAS VIAGIERALG EIHDALRSLT PAEQLAGFDE AIRNLGGKID MIVRNSDDPG
TMQQLEKAIG ALRSIVSNVA SNEALAQLSD NVHTLADKVD QLTRVDHHSD SFAALENRIS
ALTAALESRE RPVAADSSEQ LEGAVRALSE RLDQLPVGND SSSAFAHLEQ RVSYLLERME
TAATPRGSGD FGRVEEGLQD ILRMLERQQE NFHRLADIER APPPPAFDPG VVETLKREVS
DMRFSQSETG RHTQDSLEAV HNTLGHVVDR LAMIEGDLRA ARAAPQPAPE PARPLPVTAQ
PAASPPVSLP PRPEMPNPAA ATAFSAAPRE FAPAQPAPAP APAPRAIQDI LDPAASRAAA
GPSTEPQISS PHASINPALP PDHPLEPGSR PPARVTSPSE RIAASESAIN ELGGAKPEPA
SSSNFIAAAR RAAQAAASAT GHSTDKSKGD GKAGPTPGKA GPGSTIGSKI RSLLVGASVV
VIVLGTFKMA MNLLDGGQPV PAASLSEPAP QGMAPSDEDE DDTPPAASAP SAPAPSMTSP
TPINRQSLFA PQQPPAAAPA PAAPAPAPAV SPATAPADIT GTIPAPQAGA GMGAAGKIAI
PAGETLPDAI GGPVLRKAAL KGDAAAAFEI GNRYADGKGI AANFEEAAKW YGRAAQAGIV
PAMFRMGTLN EKGLGLKKDL DTARRYYVQA ADRGNAKAMH NLAVLDADGG SKGANYKTAA
EWFRKAAERG VADSQFNLGI LYARGIGVEQ NLAESFKWFS LAAAQGDSDS ARKRDDVAKR
LDPQSLSAAK LAIQTFVVEP QPDDAVKVAA PAGGWDAQSP AAAISPATSK RAAR