Gene RPB_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1643 
Symbol 
ID3909920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1872997 
End bp1874226 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID637883537 
Productmajor facilitator transporter 
Protein accessionYP_485262 
Protein GI86748766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.916878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCA ACTACGTCAT CGTCACGGCC TCCTATTGGG GCTTCACGCT GGTCGACGGC 
GCGTTGCGGA TGCTGGTGCT ATTTCACTTC TTCCGACTGG GCTACACCCC GTTCACGCTG
GCGTTTCTGT TTCTGCTGTA TGAGGCAGCC GGCATCGCGG CGAATCTGGC GGGTGGCTAC
TTCGCCTCTC GATTCGGCAT TCCGCGGATG CTGGCCATTG GTCAGGCGCT GCAGATCGCC
GGCCTGTTGA TGCTGTCGGC GCTCGATCCG GCATGGACCG TGGCGGCCTC GGTGGCCTGG
GTGGTGGCGG CGCAGGGCAT CGCCGGCGTC GCCAAGGACC TGACCAAGAC CGCTTCGAAA
TCCGCCATCA AGGCGACCTC GGCGGAGGGC AGCGGGCAGT TGTTCCGCTG GGTGGCCTGG
TTCACCGGAT CGAAGAACGC GATGAAGGGC ATCGGCTTCT TCCTCGGTGG CCTGTTGCTC
GACCTCGTCG GCTTTCGGCC CGCGCTCTGG CTGATGGCTG CGCTGCTCGG GGTGATCTTT
GTCGCTGGTC TGGCTCTGCT GCCGCGCCAG CTCGGCAAGG CCAAGTCGTC GAAAACGATA
CGCGAACTGT TCGGCAAGTC GCGCGGCGTG AACCTGCTGG CCGCCGCCCG CATCTTCATG
TTCGGGGCGA GGGACGTCTG GTTCGTCGTC GGCCTTCCTG TGTTCCTGTA TGCCAACGGT
TGGCGCTTTC TCGAGGTCGG CGGATTTCTG GCGGCGTGGA CGATCGCTTA TGGCGGCATC
CAGGCAATCG CGCCGAGCCT GGTGACGCGG AGCGACGACG GCCTCAGCCG CGAAATCCCA
GCGGCGCGAC TATGGGCCTT GCTCCTCGCC GCGGTGCCGA TCGTGTTGGC CGTGGCGATG
GTCGCAGTCC CGATGGTGCG CCCGGATCTG GTGCTGGTGA TCGGTCTGGC GCTGTTCGGC
GTGCCGTTCG CGGTGAATTC GTCGCTGCAT TCCTATCTGA TCCTGGCCTA TGCCGGATCG
GAAAAGGCCG CCGAGGATGT CGGCTTCTAC TACGCGGCGA ATGCGGCTGG GCGGCTGCTC
GGGATCATTC TGTCGGGCGC GCTGTACCAG CTCGCGGGCA TCACCGGCTG TCTCATGGGA
TCTGCGGTCA TGCTGCTGCT GTGCTGGCTG ATCACGCTGG TGTTGCCGGT GACGGCTAGT
CCAACTCCGA TCCGACAGCA GCCGATCTGA
 
Protein sequence
MVRNYVIVTA SYWGFTLVDG ALRMLVLFHF FRLGYTPFTL AFLFLLYEAA GIAANLAGGY 
FASRFGIPRM LAIGQALQIA GLLMLSALDP AWTVAASVAW VVAAQGIAGV AKDLTKTASK
SAIKATSAEG SGQLFRWVAW FTGSKNAMKG IGFFLGGLLL DLVGFRPALW LMAALLGVIF
VAGLALLPRQ LGKAKSSKTI RELFGKSRGV NLLAAARIFM FGARDVWFVV GLPVFLYANG
WRFLEVGGFL AAWTIAYGGI QAIAPSLVTR SDDGLSREIP AARLWALLLA AVPIVLAVAM
VAVPMVRPDL VLVIGLALFG VPFAVNSSLH SYLILAYAGS EKAAEDVGFY YAANAAGRLL
GIILSGALYQ LAGITGCLMG SAVMLLLCWL ITLVLPVTAS PTPIRQQPI