Gene RPB_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4031 
Symbol 
ID3911838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4600978 
End bp4602399 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content68% 
IMG OID637885935 
ProductPucC protein, chlorophyll MFS exporter 
Protein accessionYP_487635 
Protein GI86751139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.344544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAG TCAGCCAAAA AATGATGAGA GTCTGGGCCT CTATGGGGTC TCGCTTTCTG 
CCGTTCGCGG ATGCGGCGAC GCCGGATCTG CCGCTGTCGC GGTTGCTGCG CCTGTCGCTG
TTTCAGGTGG CGGTCGGGAT GTCGCTGGTG CTGCTGGTCG GCACTTTGAA CCGCGTGATG
ATCGTCGAAC TCAACGTGCC GGCCTCGATC GTCGGCGTGA TGGTCTCGCT GCCTTTGCTG
TTCGCGCCGT TCCGCGCGCT GATCGGTTTC AAATCCGACG TCCACAAATC CGTGCTCGGC
TGGCGGCGCG TTCCCTTCCT CTATAAGGGC ACGCTGGTGC AATTTGGCGG CCTGGCGATC
CTGCCGTTCG CGCTCCTGGT GCTGTCCGGC AGCGGCGATG CGGGCAACGC CCCGGTGTGG
ATCGGACAAT TCGGCGCGGC GCTGGCGTTC CTGCTGATCG GCGCCGGCGT TCACACCACC
CAGACCGTGG GCCTTGCGCT CGCCACCGAC CTCGCCTCGC CGGAATCGCG GCCGAAGGTC
GTCGGCCTGA TGTACACCAT GCTGATGTTC GGCATGATCG CCGCGGCGAT CGTGTTCGGC
ATGCTGCTCG CTGATTTTTC GCCCGGCCGG CTGATCCAGG TGATCCAGGG CTCGGCCGTC
GTCACCATCG TTCTCAACGG CATCGCCGTC TGGAAGCAGG AAGCGCGGCG CACCTCCGGC
GCGACGCAGG CGACCGCGCA TCCCGGCGCG CCCTCCGCGA GCTTCCGCGA ATCCTGGGAC
GTCTTCATCC AGGGCAAGGA CGCGATGCGC CGGCTGATCG CGGTCGGCTT CGGCACCATG
GCGTTCAGCA TGGCGGACGT GCTGCTCGAA CCCTATGGCG GCCAGATCCT GTCGATGTCG
GTCGGCGACA CCACCAAGCT CACCGCGGCG CTCGCGGTCG GCGGTCTGCT CGGCTTCGGC
CTCGCCTCGC GCGTGCTGAG CCGCGGCGCA GATCCGTTCC GGATGGCGAG CTTCGGCTCG
CTGGTCGGCA TTCCGGCCTT TCTCGCGGTG ATCTTCGCCG CCGAACTGCA GGGCGCCGCG
TCGGTGCTGA CATTCGGCTG CGGTACCGCG CTGATCGGCT TCGGCGCCGG CCTGTTCGGC
CACGGCACGC TGACCGCGAC GATGAACGCC GCGCCGAAGG ACCAGGCCGG CCTCGCGCTC
GGCGCCTGGG GCGCGGTGCA GGCCTCCGCG GCGGGCGTGG CGATTGCGCT CGGCGGCATC
ATCCGGGATC TCGTGACGGC GTTCGCTCCG CAGTTCGGCC CGGCCGCGGG TTACAACGCC
GTCTACGGCC TCGAACTGCT GCTGTTGCTG GCGACGCTGG CGACGATGGT CCCGCTGATC
AAGCGACGGG ACACATTGTT GATGCAGGGA CAACTGAACT GA
 
Protein sequence
MNAVSQKMMR VWASMGSRFL PFADAATPDL PLSRLLRLSL FQVAVGMSLV LLVGTLNRVM 
IVELNVPASI VGVMVSLPLL FAPFRALIGF KSDVHKSVLG WRRVPFLYKG TLVQFGGLAI
LPFALLVLSG SGDAGNAPVW IGQFGAALAF LLIGAGVHTT QTVGLALATD LASPESRPKV
VGLMYTMLMF GMIAAAIVFG MLLADFSPGR LIQVIQGSAV VTIVLNGIAV WKQEARRTSG
ATQATAHPGA PSASFRESWD VFIQGKDAMR RLIAVGFGTM AFSMADVLLE PYGGQILSMS
VGDTTKLTAA LAVGGLLGFG LASRVLSRGA DPFRMASFGS LVGIPAFLAV IFAAELQGAA
SVLTFGCGTA LIGFGAGLFG HGTLTATMNA APKDQAGLAL GAWGAVQASA AGVAIALGGI
IRDLVTAFAP QFGPAAGYNA VYGLELLLLL ATLATMVPLI KRRDTLLMQG QLN