Gene RPB_2783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2783 
Symbol 
ID3910576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3173039 
End bp3174268 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content67% 
IMG OID637884683 
Productmajor facilitator transporter 
Protein accessionYP_486396 
Protein GI86749900 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.35447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGG GCGACAGTGC GGTCGTTCGG CCGAGCCTGC CGCCCGCGTT GAAGATCATT 
GCGCTGTCGG GCTTTGCCGC TAGCCTGTCC GCCCGCGCGC TCGATCCGGT GCTGCCGCGG
ATCGCGTCGG AATTCTCGGT CTCGATCGCC ACCGCCGCAG GTCTCGCCGC CGTGACGGCC
TTCACCTTCG CGGTGGTGCA GCCGGCCATC GGTGCGCTCG CCGACATGTT CGGCAAAGCT
CGACTGATGA TCGTCTGCCT CGCATTGCTC GGCTTCGCGA GCCTGCTCGG CGCGGTGTCG
TCGTCATTCG AGGTCCTGTT TCTGTGCCGG ATTCTCGCCG GCATCGGAGC CGGCGGCGTG
TTCCCGGTGG CGCTGGGGCT CACGAGCGAT CTGGTCGAAC CTGGGCGGCG GCAGGTCGCG
ATCGGGCGCG TGCTCGGCGG CTCGATGACC GGCAATCTGC TCGGCGCCTC GGCCTCCGGC
GTGATCGGCG ACGTGCTCGG CTGGCGCGGT GTGCTGGCCG TGCTCGGCGT CCTGGTGATC
GTCGCTGCGT TGGCAGTGTC GTTCGGCTTT CGCAATAAAC CGATGCGTCC TGGCGCACCG
ATGGATCTGG CGGCCTTACG CCGCGGCTAT CGGACCATCT TCAGCAATCC GAATGCTCCG
ATCTGCTTCG CAGCAGTATT GGTCGAGGGG ACCTGCGTGA TGGGGGCGTT CCCCTTCGTC
GCCGCCTTCC TGCACGATCA GGGGCAGGAA TCACTGGCGG TCGCCGGGCT GGTGATCGCG
GGCTTTGCTG TCGGCGGGCT GCTCTACACG CTGACCGTCG CACGACTGCT GCCACGGCTC
GGCGTTCGGG GCATGATGAT CGGTGGCGGC GCGCTGGTCG GCCTGCAGTT GGCGATCATC
GCGTTCGGGC CCCCGTGGCA GGCCCAGGCC GTGGCGTTCG TGGCGATGGG GATGGGCTTC
TACATGCTGC ACGGCTGCGT GCAGGTGTTC GCCAGCGAGC TCAGTGAGAC CGCGCGCGGC
ACCGCGATGT CGCTGCATTC GTTCTTTTTC TTCCTCGGCC AGACGACCGG CCCGATCGCC
TACGGCTTCG GCCTCGCTCA TGCCGGTAAA GTTCCCACGA TGCTGATTGC CGCCATCACG
ATGCTGGCGC TGGGGTTTAT TCTTGCGGTG GTGTTGCGAC CGCGACCGCC GACTGATGCG
GTCGAGAGCC CACGGCCGGA TCCTGCGTGA
 
Protein sequence
MSQGDSAVVR PSLPPALKII ALSGFAASLS ARALDPVLPR IASEFSVSIA TAAGLAAVTA 
FTFAVVQPAI GALADMFGKA RLMIVCLALL GFASLLGAVS SSFEVLFLCR ILAGIGAGGV
FPVALGLTSD LVEPGRRQVA IGRVLGGSMT GNLLGASASG VIGDVLGWRG VLAVLGVLVI
VAALAVSFGF RNKPMRPGAP MDLAALRRGY RTIFSNPNAP ICFAAVLVEG TCVMGAFPFV
AAFLHDQGQE SLAVAGLVIA GFAVGGLLYT LTVARLLPRL GVRGMMIGGG ALVGLQLAII
AFGPPWQAQA VAFVAMGMGF YMLHGCVQVF ASELSETARG TAMSLHSFFF FLGQTTGPIA
YGFGLAHAGK VPTMLIAAIT MLALGFILAV VLRPRPPTDA VESPRPDPA