Gene RPB_3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3991 
Symbol 
ID3911798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4557829 
End bp4559172 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID637885895 
Productlight harvesting pigment MFS transporter Bch2 
Protein accessionYP_487595 
Protein GI86751099 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.772479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.25213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGAC CATTGTCTTG GCTTGGCATT GTCCGAATGG GTCTGGTGCA AACCGGCCTT 
GGCGCCATCG TCGTGCTCAC CACCTCGACC TTGAACCGCG TGATGGTGGT GGAACTGGCA
CTTCCGGCGA TGCTGCCGGG CGCGCTGGTC GCGATCCACT ATGCGCTGCA GGTGTTCCGC
CCGGCCTGGG GCCACGGCTC CGACCGCGGC GCGCGGCGGA CGCCATGGAT CATCGGCGGC
ATGGCCGTGC TCGCGCTCGG CGGCTTCCTC GCTGCAGTCG CGACGGCATG GATGAGCACG
CAGCCGCTGT TCGGCGTGGC TCTCGCCATC GTCGCGTTTT GTCTGATCGG CGGTGGCGTC
GGCGCGGCCG GAACATCGCT GTTGGTGCTG CTCGCCAAGC GCACCGACGA ACGCCGACGC
GCGGCGGCGG CGACCATAGT GTGGGTGATG ATGATCGCAG GATTTATCGT CACCACCGGC
TTCGCCGGCC ATCTGCTCGA TCCGTTCTCG CCGGCGCGGC TGGTCGCGGT GTCGGGCGGG
GTCTCGGTGA TCGCGATGCT GCTCACTTTC GTCGGCGTCT GGGGCATCGA AGGCAAAGCG
GCCACCGCCG AGGTGGTGGC AAAGCAAGCG GCCGACAAGG GCTCGTTCCG CGCCGCCTTC
AAGGAAGTCT GGGCCGAGCC GCAGGCGCGC CGGTTCGCGA TCTTCGTATT CGTGTCGATG
CTCGCCTACA GCGCTCAGGA CCTGATCCTG GAGCCGTTCG CCGGTGCAGT GTTCGGTTTC
ACGCCGGGCG AGACCACCAA ATTGTCGAGC GTGCAGCATG GCGGCACGCT GATCGGCATG
GCGCTCGTGC CGCTGATCGG CGCGCTGTTT CCTCGATCGC GCGGCAATTT GCAGATCTGG
ACCGTCGGCG GCTGCATCGC CTCGGCGATC GCCTTGCTGG GCCTGTCGAC GGCTGCGATG
GTCGGGCCGT CCTGGCCGCT GCGGGAAACC GTGTTTCTGC TCGGCATCAC CAACGGCGCC
TATGCGGTCG CGGCGATCGG CTCGATGATG GAACTGGTCA CCGCCGGCGG CGAAAAGCGC
GAAGGCGTCC GCATGGGGTT GTGGGGCGCG GCGCAGGCGA TCGCCTTCGG CATCGGCGGC
TTCGTCGGGA CTCTGGCCAG CGACGTCGCG CGCTTCATCC TGTCGTCGCC GGCGCTGTCC
TATGCGTCGG TGTTCGCCGC TGAGGCGGGA CTGTTCATTG CCTCCGCCGC GATGGCCGTC
TGGGTGCATC GCGCCCAGGT CCGTTCTGCC CGAAATCAGA GTCAAGTCGT CGGTCTGTCC
AACGCCGCGG TTGCGGGAGG GTGA
 
Protein sequence
MMRPLSWLGI VRMGLVQTGL GAIVVLTTST LNRVMVVELA LPAMLPGALV AIHYALQVFR 
PAWGHGSDRG ARRTPWIIGG MAVLALGGFL AAVATAWMST QPLFGVALAI VAFCLIGGGV
GAAGTSLLVL LAKRTDERRR AAAATIVWVM MIAGFIVTTG FAGHLLDPFS PARLVAVSGG
VSVIAMLLTF VGVWGIEGKA ATAEVVAKQA ADKGSFRAAF KEVWAEPQAR RFAIFVFVSM
LAYSAQDLIL EPFAGAVFGF TPGETTKLSS VQHGGTLIGM ALVPLIGALF PRSRGNLQIW
TVGGCIASAI ALLGLSTAAM VGPSWPLRET VFLLGITNGA YAVAAIGSMM ELVTAGGEKR
EGVRMGLWGA AQAIAFGIGG FVGTLASDVA RFILSSPALS YASVFAAEAG LFIASAAMAV
WVHRAQVRSA RNQSQVVGLS NAAVAGG