Gene RPD_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1993 
Symbol 
ID4022475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2226929 
End bp2230780 
Gene Length3852 bp 
Protein Length1283 aa 
Translation table11 
GC content71% 
IMG OID637962186 
Productgene transfer agent (GTA) orfg15 
Protein accessionYP_569129 
Protein GI91976470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.679187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCCC TCGTCCTCTC CATCGCCGGC GGCGCGGTTG GCGCGCTATT CGGGCCGGTC 
GGCGCGATTG CCGGGCGGAT CGCCGGTGCG CTCGCCGGCA GTATGCTCGA CCACGCGCTG
CTCGGCGGCG GCGACCGCAA CGTCGAAGGG CCTCGGCTCG CCGATCTCGA CATCATGGCG
TCGACCGAAG GCGCGCCGAT CCCGCGCGTC TATGGCCGGG CGCGTCTGGC AGGCCAGGTG
ATCTGGGCGA CGCAGCTCGA GGAAGTGATT TCGAACGAAC AGAGTTCATC GGGCGGCAAA
GGCTTCGGCG GCCCGACCAC CACGACGACG ACCTACGCGT ATTTCGCCAA TTTCGCGGTC
GGGCTGTGCG AGGGGCCGAT CGGCCGCGTC GCGCGGATCT GGGCCGATGG CAAGCCGCTC
GACCTGTCGG GCCTCAATGT CCGCGTTCAT CGCGGCAGCG AGGACCAGAG CCCGGATGAT
CTGATCGTCG CGAAACAGGG CGCCGGCGAT GCGCCGGCCT ATCGCGGGCT CGCTTATGTG
GTGTTCGAGC GGCTGCCGCT GGCTGCCTAT GGCAACCGGA TTCCGCAACT CTCATTCGAG
ATCGTCCGGC CGATCGGCGG GCTGGAAGCG ATGGTGCGCG CGGTGACGCT GATCCCCAGC
ACCACGGAAT TCGGCTACGA GCCGTCGACG CGGGTGCAGA TCTCCGGCAA AGGCAGCGCG
ACGCCGGAGA ACCGCCACGT CGCGCACGCC GCCTCCGACG TGGTCGACTC GCTCGACGAG
CTGCAGGGCG TATGCCCGCG GCTCGAACGC GTGGCCGTGG TGGTGGCGTG GTTCGGCTCC
GATCTGCGCG CCGGGCAATG CTCGGTGCGG CCCGGCGTCG AGAGCCGCGA TAAATGGGTC
GATCGCGCGA TCTGGTCAGT CGCCGGCGCG AGCCGCGGCG ACGCCTGGCT GGTGTCGCAG
GTCGACGGCC GGCCGGCGTT CGGCGGCACG CCGTCGGACG ACAGCGTCGT GCATCTGATC
GGCGAGCTGA AAGCGCGCGG GCTGAAGGTC ACCTTCTATC CGTTCGTGAT GATGGACGTG
CCGGCCGGCA ACGGCCTGCC GAATCCGTGG ACCGGCGCTG CGCCGCAGCC GCCTTATCCC
TGGCGCGGCC GCATCAGCTG CGATCCGGCG CCCGGGCAGG CGGGATCGCC GCAAGGCACC
GCGGCCGCGG CGGCGCAGGT CGAAAGCTTC TTCGCCGGTG GGACGTGGAA TTATCGCCGG
ATGATCCTGC ACTACGCGCA ACTCTGCGCG GCGGCAGGCG GCGTCGATGC GTTCCTGATC
GGCTCGGAGC TGCGTGCGCT GACGCGGGTG CGCTCGGGCC CGGGCGTCTA TCCGGCGGTG
CAGCATCTCG TCGCGCTGGC CAGCGACGTC AAGGCGATCG TCGGCGGCGG CACGCTCGTC
ACCTATGGCG CGGACTGGAC CGAATACGGC GCCGACGTCG TCACCGCCGA CGCCTCCGAG
GTGCGGTTTC CGCTCGATCC GCTGTGGGCG TCGCCTGCGA TCGACGCGAT CGGGATCGAT
TATTACGCGC CGCTCGCCGA CTGGCGCGAC GACGCCGGCC ATCTCGACGC AACGGCGGCC
GCCTCGAACT ATGACCGCGC CTATCTCGAT CGCAACGTGT TCGGTGGCGA GGCGTTCGAC
TGGTACTACC CGGACGATGC CGCGCGCGCC GCCCAAAGCC GTGCGCCGAT CACCGACGGT
CTCGGCAAGC CCTGGACCTT CCGGGTCAAG GACATCCGGA ATTTCTGGTC GCAACCGCAT
TATGAACGCG TCGGCGGCGC AGAGCTGACA TCGCCGACGG CGTGGGCGCC GATGAGCAAG
CCGGTGTGGC TGACCGAGGT CGGCTGCCCC GCGGTCGACA AGGGCGCCAA CCAGCCGAGC
GTGTTTCCCG ATCCGAAATC CAGCGAGAAC TTCGCGCCTT ATTTCTCCAG CGGCGACCGC
GACGATCTGA TCCAGCGCCG TTATCTCGAA ACCATCATCG CGGCGTTCGA CCCGGTATTC
GGCGCGAGCG AGGCGCGCAA TCCGGTGTCG CCGGTCTATG GCGGCCGCAT GGTCGATCCG
TCCGCGATCC ATCTGTGGAC CTGGGATGCG CGGCCCTATC CGGTGTTTCC GGCCGCGCAG
GAGGTGTGGA GCGACGCGCC GAACTGGCAG ACCGGGCATT GGCTGACCGG ACGGCTCGGC
GGCGCGCCGC TCGATGCGCT GGTGGCGCGG CTATTGGCCG ACAGCGGCGT CGGCGGCGTC
GATTCCAGCG CGCTGCAGGA GTGTTGCGAC GGCTATGTGG TCGACCGGCC GATGACGCCG
CGGGCGATGA TCGAGCCGCT GGCGATGGCC TATGCGTTCG ACGCCACCGC CGCGGACGGT
ACGCTTCGCT TCGTGCAGCG CGGCGGCGCG CCGGTTGTCG AACTCACCGA GGACGATCTG
GTGCTGCCGG ACAGAGCGGC GCTGTCGCGG CTGACGCGCG CGCAGGAGAC CGAACTGCCG
CGCGAGGTCG CGTTCGGCTT CTCGGATGCG ATCGCGGATT ATCGCCGCTC GGCGGCGTCG
TCGCGCCGGC TGGTCGGCGG CGCGGCCCGC ATCGTTCACG CCGATCTCGC GGTGGTGACC
AACGACGCCG CGGCCGCGCG CCGCGCCGAA ATCTTTCTGC AGGACCTGTG GGCCGGCCGC
GAGACCGCGA GCTTCGCGCT CGGGCTGTCG CATCTGTCGC TCGCGCCCGG CGACGTCGTG
GCGCTGACTC TGAACGGCCG GCGGCGGCTG TTCGAGATCG GCGAACTGGT CGACACCATG
TCGCGCGCGG TCAAGGCGCG CAGCATCGAT CCGGAGGTGT TCGCACTGCC GCAGCGGGCG
CCGCGCCTCG GCGCGCCGCA AATCCCCGCG GCGCTCGGGC CAGCGCATGT CGTGGCGCTC
GATCTGCCGG TGATCGACGC CGCCGCGCCG GAGGTGCTGA CGCGGCTCGC GGTGTTCGCC
AATCCCTGGC CCGGCGCGGA GCTGATCTAC GCCTCCGCCG ACGGCGCGAG CTACCAGCCG
CTGGCGAGCG CGACCGCGCG CGCGATCCTC GGCGAGACGC TCGATCCGCT GCCGCGCGGG
CCGCTCGGGT TGTGGGACCG CCGCAATCGC GTGCGGGTGC GGATCTATGG CGGCGCGTTG
TCGTCGCTGT CGGACGCCAG CGTGCTCAAT GGCGGCAATG CGGCGGCGGT GCAGAATCCC
GACGGCGAAT GGGAGATCCT GCAATTCGCC AATGCCGAAC TGGTCGACGG CAACACTTAT
GCGTTGTCGC GGCTGCTGCG TGGCCAGGCC GGCAGCGAGC AGGCGATGCG CGATTCCTTG
CCCGCGGCTG CGCCGTTCGT GCTGCTCGAC AGCCATCTGG TGCAGCTCGC GCGCGGCGTC
GACGCGCTCG GGCGGCCGCT GCAACTCCGC GTCGTCGCCG CGGGCCGCAG CCACGACGAT
CCGAGCGCAA CGGCGCTGGC GGTGACCCCG GGCCCGACCG CGCTGCGGCC GTTCGCGCCG
GTGCATCTGA AGGCCGTGCG CACGACCGCC GGCGTCGCGC TGAGCTGGAT CCGCCGCACC
CGGATCGGCG GCGATGGCTG GAGCGGGGAG GCGCCGCTCG GCGAGGAGTC CGAGGCCTAT
GCGCTCGACA TCCGCGCGCC AAGCGGGGCG GTCGTCCGGT CGATCACGAC CGCTGCGCCG
CAAGCGCTCT ACACGACCGC CGACGAGATC GCCGATTTCG GCGCGCCGCA GACCGCGCTG
CGCATCCGCG TCGCGCAGCT TTCCGCCACC GTCGGTCCGG GCTTCGCCGC CGACGCCACG
CTCGCGCTGT AG
 
Protein sequence
MVALVLSIAG GAVGALFGPV GAIAGRIAGA LAGSMLDHAL LGGGDRNVEG PRLADLDIMA 
STEGAPIPRV YGRARLAGQV IWATQLEEVI SNEQSSSGGK GFGGPTTTTT TYAYFANFAV
GLCEGPIGRV ARIWADGKPL DLSGLNVRVH RGSEDQSPDD LIVAKQGAGD APAYRGLAYV
VFERLPLAAY GNRIPQLSFE IVRPIGGLEA MVRAVTLIPS TTEFGYEPST RVQISGKGSA
TPENRHVAHA ASDVVDSLDE LQGVCPRLER VAVVVAWFGS DLRAGQCSVR PGVESRDKWV
DRAIWSVAGA SRGDAWLVSQ VDGRPAFGGT PSDDSVVHLI GELKARGLKV TFYPFVMMDV
PAGNGLPNPW TGAAPQPPYP WRGRISCDPA PGQAGSPQGT AAAAAQVESF FAGGTWNYRR
MILHYAQLCA AAGGVDAFLI GSELRALTRV RSGPGVYPAV QHLVALASDV KAIVGGGTLV
TYGADWTEYG ADVVTADASE VRFPLDPLWA SPAIDAIGID YYAPLADWRD DAGHLDATAA
ASNYDRAYLD RNVFGGEAFD WYYPDDAARA AQSRAPITDG LGKPWTFRVK DIRNFWSQPH
YERVGGAELT SPTAWAPMSK PVWLTEVGCP AVDKGANQPS VFPDPKSSEN FAPYFSSGDR
DDLIQRRYLE TIIAAFDPVF GASEARNPVS PVYGGRMVDP SAIHLWTWDA RPYPVFPAAQ
EVWSDAPNWQ TGHWLTGRLG GAPLDALVAR LLADSGVGGV DSSALQECCD GYVVDRPMTP
RAMIEPLAMA YAFDATAADG TLRFVQRGGA PVVELTEDDL VLPDRAALSR LTRAQETELP
REVAFGFSDA IADYRRSAAS SRRLVGGAAR IVHADLAVVT NDAAAARRAE IFLQDLWAGR
ETASFALGLS HLSLAPGDVV ALTLNGRRRL FEIGELVDTM SRAVKARSID PEVFALPQRA
PRLGAPQIPA ALGPAHVVAL DLPVIDAAAP EVLTRLAVFA NPWPGAELIY ASADGASYQP
LASATARAIL GETLDPLPRG PLGLWDRRNR VRVRIYGGAL SSLSDASVLN GGNAAAVQNP
DGEWEILQFA NAELVDGNTY ALSRLLRGQA GSEQAMRDSL PAAAPFVLLD SHLVQLARGV
DALGRPLQLR VVAAGRSHDD PSATALAVTP GPTALRPFAP VHLKAVRTTA GVALSWIRRT
RIGGDGWSGE APLGEESEAY ALDIRAPSGA VVRSITTAAP QALYTTADEI ADFGAPQTAL
RIRVAQLSAT VGPGFAADAT LAL