Gene RPB_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3458 
Symbol 
ID3911260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3965132 
End bp3969025 
Gene Length3894 bp 
Protein Length1297 aa 
Translation table11 
GC content71% 
IMG OID637885361 
Productgene transfer agent (GTA) orfg15 
Protein accessionYP_487065 
Protein GI86750569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.795293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCG TCCCACGCTC ACACGAAGTA ACAACCATGG CAGCTCTTGT TCTTTCGGTC 
GCCGGTGCGG CGGCTGGTGC GGTGTTCGGG CCGGTCGGCG CGATGGCCGG GCGGATCGCG
GGGGCGCTGA TCGGCAATGC GCTGGATCAG TCGCTGTTCG GCGCCGGGTC GAAATCGGTC
GAGGGGCCGC GGCTGGCGGA TCTCGAGGTG ATGGCCTCGA CCGAAGGCGC GCCGATCCCG
CGGGTCTACG GCCGGGTGCG GCTGGCGGGG CAGGTGATCT GGGCGACGCA ACTCGAGGAG
GTGATCACCT CCGAGGAAAG TTCGTCCAGC GGCGGCAAGG GTCTCGGCGG CGGCGCGACG
ACGACCACCA CGACCTACGC GTATTTCGCC AATTTCGCGG TCGGGCTGTG CGAGGGGCCG
ATCGGCCGCG TCGCGCGGAT CTGGGCCGAC GGCAAGCCGC TCGATCTGTC GGGCCTCGGG
CTTCGCATCC ATCACGGCGG CGAGGACCAG GCACCGGACG GTCTGATCGT CGCCAAGGAG
GGCGCCGCCA ACGCGCCGGC CTATCGCGGG CTGGCCTATG TGGTGTTCGA GCGGCTGCCG
CTGGCCGACT ACGGCAACCG GATTCCGCAA ATGTCGTTCG AGATCGTGCG GCCGATCGGC
GGGCTCGAAC AGATGGTGCG CGCGGTGACG CTGATCCCCG GCACCACCGA ATACGGCTAC
GAGCCGTCGA CATTGGTGCA GATCCTCGAC CGAGGCACCT CGACGCCGGA GAACCGCCAC
GTCGCCCACG CGGTCTCCGA CGTGGTCGCC TCGCTCGACG AATTGCAGGG CGTGTGCCCG
CGGCTGGAGC GCGTCGCGAT CGTGGTGGCG TGGTTCGGCT CCGATCTGCG CGCCGGCCAG
TGCAGCGTGC GGCCGGGCGT CGAGAACCGC GACAAATGGG TCGCCGGCGC GAGCTGGTCG
GTCGACGGCG CGCACCGCGG CAATGCCTGG CTGGTGTCGC AGGTCGACGG CCGCCCCGCC
TTTGGCGGCA CGCCGTCCGA CGACAGCGTG CGACATCTGA TCGGCGAGCT GAAGGCGCGC
GGGCTGAAGG TGACGTTCTA TCCGTTCGTG ATGATGGACG TGCCGGCTGC TAACAGCCTG
CCGAACCCGT GGACCGGCGC CGCGCCGCAG CCGCCTTATC CGTGGCGCGG GCGGATCAGT
TGCGATCCGG CGCCGGGCGT GGCCGGCTCG CCGCAGGGTA GCGCGACGGC GGCGGCGCAG
GTCGACAGTT TCTTCGCCGG CGGCGACTGG AATTATCGCC GGATGATTTT GCACTACGCG
CGGCTTTGCG CGGAGGCCGG CGGCGTCGAT GCGTTCCTGA TCGGCTCGGA GCTGCGCGGG
CTGACGCGGG TGCGCTCCGG CGCGGGCGTG TATCCGGCGG TGCAGCAGCT CGTCGCGCTG
GCGAACGACG TGAAAGCGAT CGTCGGCGGC GGCACGCTCG TCACCTATGC GGCGGACTGG
ACCGAATACG GCGCCGACGT CGTCACGCCG GACGCTTCCG AAGTGCGGTT TCCGCTCGAT
CCGCTGTGGG CAGCGCCCGC GATCGGCGTG ATCGGGATCG ACTACTACGC GCCGCTCGCC
GACTGGCGCG ACGACGCCGG CCATCTCGAC GCGGCGATCG CCGCTTCGAC CTGGGACCGC
CTCTATCTCG ACCGCAACGT CACCGCCGGC GAGGCATTCG ACTGGTACTA TCCGGACGAG
GCCGCGCGCG TGGCGCAAAG CCGCGTGCCG ATCACCGATG GGCTCGGCAA GCCGTGGACC
TTCCGGGTCA AGGACATCAA GAGCTTCTGG TCGCAGCCGC ATCACGAACG CGTCGGCGGC
GTCGAGCTCG CCGCGCCGAC CGCGTGGGCG CCGATGAGCA AGCCGGTGTG GCTGACGGAA
GTCGGCTGCC CCGCGGTCGA CAAGGGCGCC AACCAGCCGA GCGTGTTTCC CGATCCGAAA
TCCAGCGAGA ACTACGCGCC GTATTTCTCC AGCGGCGACC GCGACGACCT GATCCAGCGG
CGCTATCTCG AAGCGATCCT CGCCGCGTTC GATCCGGTGT TCGGCGCCAG CGAGGCGCGC
AATCCGCTGT CGCCGGTGTA TGGCGGCCGG ATGATCGATC CGTCGGCGAT CCATCTGTGG
AGCTGGGACG CGCGGCCCTA TCCGGTGTTT CCGGCCGCCG ACGAGGTCTG GAGCGACGCG
CCGAACTGGC AGAGCGGGCA CTGGCTGACC GGGCGGCTCG GCAATGCGCC GCTGGATGCG
CTGGTGGCGC AGCTCTGCGC CGACAGCGGC GTCGTCGGCG TCGATGCGAG CGCGTTGCGC
GACGCCTGCG ACGGCTATGT GGTCGACCGG CCGATGACGC CGCGGGCGAT GATCGAGCCG
CTGGCGATGG CCTACGCGTT CGACGCCACG GCGGCCGACG GCACGCTGCG CTTCGTGCAG
CGCGGCGGCG CGCCGGTGGC CGAATTGACC GAGGACGATC TGGTGCTGCC GGACAAGGGC
GCCTTGTCGC GGCTGACGCG GGCGCAGGAG ACCGAGCTGC CGCTCGAGGT GGCGTTCGGC
TTCACCGACG CGATCGCCGA CTATCGCCGC GCCGCGGCGG CGTCGCGCCG GCTGGTCGGC
GGCGCGCATC GCATCGTCCA CGCCGATCTC GCGGTGGTGA CCAACGACGC CGCCGCGGCG
CGACGCGCCG AGATCTTCCT GCAGGATCTG TGGGCCGGAC GCGAGACCGC GAGCTTCGCG
CTCGGGCCGC AGCGTCTGGC GCTGGCGCCC GGCGATGTGG TGGCGCTGAC GCTGAACGAT
CGGCGGCGGC TGTTCGAGAT CAGCGAAGTG GTCGACACCC AATCGCGCGC CGTCAAGGCG
CGCAGCATCG ACCCCGAAGT GTTCGCGCTG CCGCAGCGCG CGCCGCGCCG GATCACGCCG
GCAATCCCGG CGGCGCTGGG GCCGGCGCAT GTGGTGGCGC TCGACCTGCC GGTGATCGAC
AACACCGCGC CGGACGTACT GACGCGGCTC GCGGTGTTCG CCAATCCCTG GCCGGGATCG
GAAGTGATCT ACGCGTCCGC CGACGGCGCG AGCTATCAGC CGCTGGCGAG CGCGACGGTG
CGCGCGATCA TCGGCGAGAC GCTCGATCCG CTGCCGCGCG GGCCTGTGGC GGTGTGGGAC
CGCGGCAACC GGCTGCGGGT GCGGATTCTC GGCGGCGCGC TGTCGTCGCT GTCGGAGGCG
CGGGTGCTCG ACGGCGGCAA CGCCGCGGCG GTGCAGAACC CCGACGGCGA GTGGGAGATC
CTCCAATTCG CGCAGGCCGA ACTGATCGAC GGCAACACGT ACTTGCTGTC GCGGCTGCTG
CGCGGCCAGG CCGGCAGCGA GCAGGCGATG CGCGACCCGG TGCCGGCCGG CGTGCCGTTC
GTGCTGCTCG ACCGGCATCT GGTGCCGTTG GCGCGCGGGC TCGACGCGCT CGGGCGGGCG
AAGGCGTTGC GCGTGGTTGC GGCGGGACGC AGCCACGACG ATGCCAGCGC GGTGACGCTG
AGCGTGACGC CGGGGCCGAC GGCGCTGCGG CCGCTGGCGC CGGTGCATCT GAGGGCCAGG
CGCGAGGTCG ATGGCGTGAA TCTGTCGTGG ATCCGCCGCA CCCGGATCGA CGGCGATGGC
TGGGGCCTCG AGGTGCCGCT CGGCGAGGCC AGCGAGGCCT ATGTGATCGA CATCCTCGCA
CCGGACGGCG CATTGGTTCG CACGCTCAGC AGCGCCGCGC CGCAGGCGCT GTATCCGGCG
GCTTCCGAAC TCGCCGATTT CGGCGCGCCG CAACAGGGCC TGCGCGTCCG CGTCGCGCAA
CTCTCCGCCG GCGTCGGCCG CGGCTTCGCC GCCGACGCGA CGCTCGCACT TTAG
 
Protein sequence
MPAVPRSHEV TTMAALVLSV AGAAAGAVFG PVGAMAGRIA GALIGNALDQ SLFGAGSKSV 
EGPRLADLEV MASTEGAPIP RVYGRVRLAG QVIWATQLEE VITSEESSSS GGKGLGGGAT
TTTTTYAYFA NFAVGLCEGP IGRVARIWAD GKPLDLSGLG LRIHHGGEDQ APDGLIVAKE
GAANAPAYRG LAYVVFERLP LADYGNRIPQ MSFEIVRPIG GLEQMVRAVT LIPGTTEYGY
EPSTLVQILD RGTSTPENRH VAHAVSDVVA SLDELQGVCP RLERVAIVVA WFGSDLRAGQ
CSVRPGVENR DKWVAGASWS VDGAHRGNAW LVSQVDGRPA FGGTPSDDSV RHLIGELKAR
GLKVTFYPFV MMDVPAANSL PNPWTGAAPQ PPYPWRGRIS CDPAPGVAGS PQGSATAAAQ
VDSFFAGGDW NYRRMILHYA RLCAEAGGVD AFLIGSELRG LTRVRSGAGV YPAVQQLVAL
ANDVKAIVGG GTLVTYAADW TEYGADVVTP DASEVRFPLD PLWAAPAIGV IGIDYYAPLA
DWRDDAGHLD AAIAASTWDR LYLDRNVTAG EAFDWYYPDE AARVAQSRVP ITDGLGKPWT
FRVKDIKSFW SQPHHERVGG VELAAPTAWA PMSKPVWLTE VGCPAVDKGA NQPSVFPDPK
SSENYAPYFS SGDRDDLIQR RYLEAILAAF DPVFGASEAR NPLSPVYGGR MIDPSAIHLW
SWDARPYPVF PAADEVWSDA PNWQSGHWLT GRLGNAPLDA LVAQLCADSG VVGVDASALR
DACDGYVVDR PMTPRAMIEP LAMAYAFDAT AADGTLRFVQ RGGAPVAELT EDDLVLPDKG
ALSRLTRAQE TELPLEVAFG FTDAIADYRR AAAASRRLVG GAHRIVHADL AVVTNDAAAA
RRAEIFLQDL WAGRETASFA LGPQRLALAP GDVVALTLND RRRLFEISEV VDTQSRAVKA
RSIDPEVFAL PQRAPRRITP AIPAALGPAH VVALDLPVID NTAPDVLTRL AVFANPWPGS
EVIYASADGA SYQPLASATV RAIIGETLDP LPRGPVAVWD RGNRLRVRIL GGALSSLSEA
RVLDGGNAAA VQNPDGEWEI LQFAQAELID GNTYLLSRLL RGQAGSEQAM RDPVPAGVPF
VLLDRHLVPL ARGLDALGRA KALRVVAAGR SHDDASAVTL SVTPGPTALR PLAPVHLRAR
REVDGVNLSW IRRTRIDGDG WGLEVPLGEA SEAYVIDILA PDGALVRTLS SAAPQALYPA
ASELADFGAP QQGLRVRVAQ LSAGVGRGFA ADATLAL