Gene RPB_0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0915 
Symbol 
ID3909768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1055753 
End bp1057234 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content65% 
IMG OID637882808 
Productmajor facilitator transporter 
Protein accessionYP_484537 
Protein GI86748041 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.86159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.650234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTT CCACAAGCGC CGCTTGGACG CAACCACCAC CGGCCCCGCG CGTCGGCGGT 
TGCATCATTG AAACCGACAT TCCGGCGCGG CTGGACGGCC TGCTGTGGAG CGGCTTTCAT
ACCCGCGTGG TGGTCGCGCT CGGCGTCACC TGGATTCTGG ACGGTCTCGA AGTGACGCTC
GCCGGCACGC TCTCGGGCGC GCTGAAGCAG CAGCTTCAGT TCTCCAACCT CGATGTCGGC
GTCGCCAACA GCGCCTATCT CGCCGGCGCG GTGCTCGGCG CGCTCGGCTT CGGCTGGCTG
ACCGACCGGA TCGGCCGCAA GAAACTGTTC TTCATCACGC TGGCGCTGTA TCTCGCCGCG
ACCGCGGCGA CGGCGCTGTC CTGGAATGTC TGGAGCTACG CGCTGTTTCG ATTTCTCACC
GGAGCCGGCA TCGGCGGCGA ATACACCGCC ATCAACTCGA CGATCCAGGA GCTGATGCCG
GCGCGCTATC GCGGCTGGAC CGATCTGGTG ATCAACGGCA GCTTCTGGCT CGGCGCGGCG
CTCGGTGCGG TCTGCGCCAT CGTGCTGCTC GACCCGGCGG TGATCGATCC CGAATATGGC
TGGCGGCTCG CTTACTTCAC CGGCGCCGTG CTCGGCATCG TGGTGTTCGT GATGCGGTTG
TGGATTCCGG AAAGCCCGCG CTGGCTCATG ATTCACGGCC GCCCGGAACA GGCCGAGGCG
ATCGTCGCCG AGATCGAGCG GACGGCGCGG ATCGCAACCG AGCCGGAGCA TCGCATCAAG
TCGAAGATCC GCCTGCAGAT GCGCAGCCAC ACGCCGCTGC GCGAGGTCGC GCACACGCTG
CTCACCACAT ACAGGCAACG TTCGTTCGTC GGACTGACGC TGATGGCGGC GCAGGCGTTC
TTCTACAATG CGATCTTCTT CACCTACGCG CTGATCCTGA CCGACTTCTT CGGCATTCCG
TCCAGCCATA TCGGCTGGTA CATCCTGCCG TTCGCCGCCG GCAATTTTCT CGGGCCGTTG
CTGCTCGGTC GATTGTTCGA TACGCTCGGC CGCCGCAAGA TGATCGCGTT CACCTACGGC
ATCTCGGGAC TGCTGCTCGC CGGCTCCGGC TATCTGTTCT CGATCGGTGC GCTGACCGCG
CAGAGCCAGA CGATCGCCTG GATGGTGATC TTCTTCTTCG CATCGCCGGC CGCGAGCGCG
GCCTATCTCA CGGTCAGCGA GACCTTCCCG CTCGAGGTTC GTGCGCTGGC GATCGCGATC
TTCTACGCGA TCGGCACAGG AATCGGCGGC GTCGCCGGCC CGGCGCTGTT CGGCGCGCTG
ATCGATACCG GATCGCGCAC CACCGTGTTC GCCGGCTATC TGCTCGGCGC GGCTCTGATG
ATCGTCGCCG CAATGGTCGG TTGGCGTTAT GGTGTGGCGG CCGAACGCAG GTCGCTGGAA
CAAGTGGCGC GGCCGCTGGC CGCCATAGAG GAAAACCGAT GA
 
Protein sequence
MAVSTSAAWT QPPPAPRVGG CIIETDIPAR LDGLLWSGFH TRVVVALGVT WILDGLEVTL 
AGTLSGALKQ QLQFSNLDVG VANSAYLAGA VLGALGFGWL TDRIGRKKLF FITLALYLAA
TAATALSWNV WSYALFRFLT GAGIGGEYTA INSTIQELMP ARYRGWTDLV INGSFWLGAA
LGAVCAIVLL DPAVIDPEYG WRLAYFTGAV LGIVVFVMRL WIPESPRWLM IHGRPEQAEA
IVAEIERTAR IATEPEHRIK SKIRLQMRSH TPLREVAHTL LTTYRQRSFV GLTLMAAQAF
FYNAIFFTYA LILTDFFGIP SSHIGWYILP FAAGNFLGPL LLGRLFDTLG RRKMIAFTYG
ISGLLLAGSG YLFSIGALTA QSQTIAWMVI FFFASPAASA AYLTVSETFP LEVRALAIAI
FYAIGTGIGG VAGPALFGAL IDTGSRTTVF AGYLLGAALM IVAAMVGWRY GVAAERRSLE
QVARPLAAIE ENR