Gene RPC_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2068 
Symbol 
ID3974026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2264188 
End bp2265375 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content63% 
IMG OID637925176 
Productmajor facilitator transporter 
Protein accessionYP_531941 
Protein GI90423571 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.731019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.214462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA GGCCGCTTGT CGTTATTTTT GCCACCATCT GCCTGGACGC CGTCGGCATC 
GGCCTCGTCT TCCCTGTTCT GCCGCGGCTT CTTGAGGATG TCACGCACAG CCCGAACATC
GCGCCCTATA TCGGGATCAT GACCGCGCTC TATGCGGTGA TGCAGTTTGT CTTCGCCCCC
GTGCTCGGTG CTTTGAGCGA CAACCTCGGC CGGCGTCCCG TGCTGCTGAT TTCGCTCGCG
GGAGCGGCCA TCAACTACGT CATCATGGCG TTCGCACCGC AGCTTTGGAT GCTGATGCTC
GGCCGCGCAG TTGCCGGCCT GACCAGCGCC AATGTCTCCG TGGCAACGGC CTACATCACC
GACATTTCAT CCGAGGACCA GCGCGCCCGC AGATTCGGCC TGTCCAACGC CATGTTCGGC
ATCGGCTTTA TCATCGGGCC GGTTCTCGGC GGCCTGCTCG GCGACACCTG GCTGCGGCTG
CCTTTCATCG CCGCCGCCGC GCTGAACGCC GGCAATCTCC TGCTGGCGTT GTTCGTGCTG
CCGGAATCCC GCACGGCGGC CCGTGAGAAG ATCGATCTGG TCGCGCTCAA CCCGCTTCGG
CCGCTGCGTT GGGTGTTCGC CATCAAGGGG CTTTTGCCCG TCGTGTTTGT GTACTTCATC
TTGAGCGCAA CCGGGGAGGC CTACGGCGTC TGCTGGGCGT TGTGGGGTTT CGACACGTTT
CAATGGAGCG GCCTGTGGAT CGGGCTTTCG CTCGGCGCTT TCGGTATCTG CCAAACGCTG
GTGCAGGCCT TGCTGCCCGG CCCCGCGACC AAACTGCTTG GCGAGCGCAT GGCCGTTGTG
GTGGGCATCG CCTGCGCCTG TATCGCTCTT GTCGCGCTGG CCTTCGCCAA TCAAGGCTGG
ATGGTATTCG CCATCATGCC GTTGTTCGCT CTCGCCGGAA TCGGCACGCC GGCGTTTCAA
GCCCTGGCCA CCCGGCAGGT TGATCCGGAC CGTCAAGGTC AATTGCAAGG CGTGCTGGCC
TCGGCCGTCA GCCTGGCGTC GATCGCAGCG CCGCTCGCCT TCTCGACGTT CTACTTTGTC
GTTCAAAACG ACTGGCCCGG AGCCATCTGG CTCTTGGTGG TCGTCATTTA TGCAATCGCG
ATTCCGCTGG TTCTTCTCGG CACCCGGACC GCCCGCGCTG CGACGTGA
 
Protein sequence
MMNRPLVVIF ATICLDAVGI GLVFPVLPRL LEDVTHSPNI APYIGIMTAL YAVMQFVFAP 
VLGALSDNLG RRPVLLISLA GAAINYVIMA FAPQLWMLML GRAVAGLTSA NVSVATAYIT
DISSEDQRAR RFGLSNAMFG IGFIIGPVLG GLLGDTWLRL PFIAAAALNA GNLLLALFVL
PESRTAAREK IDLVALNPLR PLRWVFAIKG LLPVVFVYFI LSATGEAYGV CWALWGFDTF
QWSGLWIGLS LGAFGICQTL VQALLPGPAT KLLGERMAVV VGIACACIAL VALAFANQGW
MVFAIMPLFA LAGIGTPAFQ ALATRQVDPD RQGQLQGVLA SAVSLASIAA PLAFSTFYFV
VQNDWPGAIW LLVVVIYAIA IPLVLLGTRT ARAAT