Gene RPD_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3661 
Symbol 
ID4024175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4086497 
End bp4088977 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content61% 
IMG OID637963865 
Producthypothetical protein 
Protein accessionYP_570785 
Protein GI91978126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCG TCGTACCCGT CGAACCGTTT GCGGACGAAA GTCTCGCCGG CCTGATCGTG 
CGGGCGACTG CGCGCAATTA CCACCGAAAC GCCGTGCAGA CGCTTCGCGC GATCGGCGTC
GGGCACGAAC CGCAATCGCT TTGCACGGGC ACGCCGGCCC TTGCCGCCCC GGTCGCGAGA
TTGATCGGTT CTGCCGATGC CAGTTCCGTA GCGCGCTTAT TCCACCCGGC GATCGAGGGT
CGCAGGAACT TGCTTGATTT CTTCGGCGAA CCCTTGCGGC CATTCTATCG GGAATCCGAG
CGTCGGCGTG TCGCTCCCAG CGCGATGAAG AATGGAGCAT ACATTAGGGC CATATGGTCA
CTTCGTCCGT TTTCCTTCGA CCCGGTAACA AAGGAGACCT TGATCGACGC GTGCCCGCAG
TGTCGGCGCG AGCTTCGTTG GCGGCTGACC TACGGCGTCG CGTTCTGCGA TTACTGCTGT
CGCGAAGATC GGTTCATGGA CATGTCATGG GATTATCCGG GTCTTGATCT TCGGGACTTT
CCGCAGCCGA AGGTAGAGGT CGAAGATGAA GAGGCGCTAG ATTTCGCCAC CGGTCTCATC
GACCCGTTGG CGGGCAGAAA GGAGGCTGCG CGTCGTTTGA TACCCGAGAT GTGGGCCGGA
CTCTCGAACG GAGACCTCTT CGAAGTGCTG ATGACGTTCG TCAGCATGAT GAACGTGGAG
CGGTGGGATG AGAAGAAGAC CCTGTCCCGC CGGGTCGGCA GGACCGCTGA CGGTTGGGAT
AAGGTCACCC CGCAACTGAT CGCAGTGGCC GGTAGAGCGA TCATCGGAGG CAAGTCCGGC
TTTGAACGGT TCGGCGACCT GATGCGGATC GAAGGCAAGG ACAAGCCAAG AGACAGACGG
TACGGCAAGC GCCTCCAGAT CGGGCCGCTC TCCCTTATCG ACCCGAACCT CACGGAAAAG
GCGAAGGCCA TACTAAAGGC AGCGACGGAG CGTTATTTCC ACGCGCGCAG GGACCCCGAC
ATGGTGCCTC TGCTCCACCT CTCGCGCAAA TATCGCATCG AACGTACGGC CCTGAGCCGA
CTCGCCGATA GTCGACTCAT TCCGACCGAG ACCTGCGAAG ATCTTAAAAA GGCTCCGGTG
TTGATGTCCG ACAAGGCACT GGCTCCGCTC ATAGCGGAGC GTCGCAATGC ATTATCTGCC
GGCAAGGTGG GCACCCTGAT CGGTATTCAC CGCATACACG TCGCAGACCT TCATGCCCGT
GGTCTGCTGG ACAAGATCGA TGGTCCGGCG CTGAAGCTGT TGAAATCGGA TGCGTACTAC
ACGCGCTCGT CGGTCGATAC TTTGATCCGC GGCACGGCAG GCCGTAAATC CGGAGAAGCG
GCTGTAAAAG CGGTACGCCT GAGGGTCGCG CTCCGAACGC TGAAGGTCAC GCATGTTCCA
TGGGCTGCGA TAGTCGCTGC CATCGTGGAA CGCCGCCTGT CGGTTATTGC CCTGAAGGCC
AAGGGTCCGC TGGGCGATCG GCTCGCCGTG CAAGACCCCG CGGATCTGGC TCGAATTCTC
GATCAGGAAA CCGAAGCGGT CGAGCGTTCG GAATGGCTCG GCAACGCGGT TGTGGCAGAA
ATCCTTGGCA CCAACGAGGC CGCAGTATGG CGCCTGATGC AGACGGGAAT GCTGAAGCAG
CACGCTTGCG CTCCGAGGTA TCAGCCCTTC AAACGGAGCG AAGTGGACAA GCTTGCGCGG
GATGTCGTGT TTACGCCGGA GATCGTCGCG CGGGGCTCGT TCAATACCTA TCGTGCCGTA
GCGTCCTGGC TTCGCGAACG GGAAATCGAG CCGGTGAAGG AGTTGAAGAA GGCGGGCTGG
AGGCTCTACT CCCGTCCGCA GGTCGAGCAG GCTCTGAAGG AAAGGCGTCG GACGCTGCTC
GCCAGGCCCC GTCGGCTCCC GCCGCCAAGG CCAAAAGGCG TGCTGCATGG ACAGAATTCG
CCAGAAGGCA AGCTGGCCGA CGTTCGGGAG AGGGCGGATG CTTCTAGGAT CGGCCATGCC
ACCGCGGCGG CGCTCTTGGG ATGTTCGATC TTTGCGGTCC AGAGACTCGC CACGATCGGA
AGGCTGAATC AGGTGGGAAA GGTGACGCCC TACGCCAGGA GCGAGGTTCA CGCGTTGGCG
CGGGAACTCG TCTTCCTGCC TGAAATAATG AGCCGGACCG GCCTTATTAC AGTGCGTGGC
ACGAATGCAT GGCTCCGGAA GAACGGCATG AAGCCCACGA TGTTGCTCAA AGGTGGCGAC
CGGCTTCCAG TTTTCGACCG TGCTGCCGTG GAGAAGGTGT TGGGAAAGCC CATCTCGATC
GGGAGGAGTT ATTCGGCCGA GACGAAGGCA GGACTTCTGG CCATGGTGGA CGGTGGAAGC
AGCGTGCACG CGGCTTCGAA GCATCTCGGA GTGAAGTATG CTACCGCGAA GGCGTGGGTC
AGGGAGAGGA GGAAGGCCTA G
 
Protein sequence
MIFVVPVEPF ADESLAGLIV RATARNYHRN AVQTLRAIGV GHEPQSLCTG TPALAAPVAR 
LIGSADASSV ARLFHPAIEG RRNLLDFFGE PLRPFYRESE RRRVAPSAMK NGAYIRAIWS
LRPFSFDPVT KETLIDACPQ CRRELRWRLT YGVAFCDYCC REDRFMDMSW DYPGLDLRDF
PQPKVEVEDE EALDFATGLI DPLAGRKEAA RRLIPEMWAG LSNGDLFEVL MTFVSMMNVE
RWDEKKTLSR RVGRTADGWD KVTPQLIAVA GRAIIGGKSG FERFGDLMRI EGKDKPRDRR
YGKRLQIGPL SLIDPNLTEK AKAILKAATE RYFHARRDPD MVPLLHLSRK YRIERTALSR
LADSRLIPTE TCEDLKKAPV LMSDKALAPL IAERRNALSA GKVGTLIGIH RIHVADLHAR
GLLDKIDGPA LKLLKSDAYY TRSSVDTLIR GTAGRKSGEA AVKAVRLRVA LRTLKVTHVP
WAAIVAAIVE RRLSVIALKA KGPLGDRLAV QDPADLARIL DQETEAVERS EWLGNAVVAE
ILGTNEAAVW RLMQTGMLKQ HACAPRYQPF KRSEVDKLAR DVVFTPEIVA RGSFNTYRAV
ASWLREREIE PVKELKKAGW RLYSRPQVEQ ALKERRRTLL ARPRRLPPPR PKGVLHGQNS
PEGKLADVRE RADASRIGHA TAAALLGCSI FAVQRLATIG RLNQVGKVTP YARSEVHALA
RELVFLPEIM SRTGLITVRG TNAWLRKNGM KPTMLLKGGD RLPVFDRAAV EKVLGKPISI
GRSYSAETKA GLLAMVDGGS SVHAASKHLG VKYATAKAWV RERRKA