Gene RPB_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2449 
Symbol 
ID3910238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2804705 
End bp2805979 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID637884348 
Productmajor facilitator transporter 
Protein accessionYP_486065 
Protein GI86749569 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0110452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.393497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGG CCGGCCACGC GAGCGGGACG AGCGCGCCGG CACCCGGCGC GCGCCCCGCC 
GCGGTCGGCT TCATCTTCGT CACCATCCTG CTCGACATGC TGAGCGTCGG CATGATCCTG
CCGATCCTGC CGAAGCTGAT CGAGAGCTTC TCCGACAACG ACACCGCGGC TGCGGCCAAG
ATCTACGGCC TGTTCGGCAC GGCCTGGGCG CTGATGCAGT TCGTCGCGTC GCCGGTGCTG
GGCGCGCTGT CTGATCGTTT CGGCCGCCGC CGGGTGATCC TGCTGTCCAA TCTCGGTCTC
GGGCTCGATT ACATCCTGAT GGCGCTGGCG CCGACGCTGG CCTGGCTGTT CATCGGCCGG
GTGATCTCCG GCATCACCTC GGCGAGTATC TCAACCTCGT TCGCCTATAT CGCCGACGTC
ACACCTGCGG AAAAACGCGC GGCCGTGTTC GGCAAGGTCG GCGCCGCCTT CGGCCTCGGC
TTCATCTTCG GCCCGGCGAT CGGCGGATTG CTCGGCGGCG TCGATCCGCG GCTGCCGTTC
TGGGTGGCGG CCGGCTTGAG CCTGTGCAAC GCGCTGTATG GCTTGTTCGT GCTGCCGGAA
TCGCTGCCGC CGGAGCGGCG GTCGCCGTTC CGCTGGCGGG CCGCCAATCC GATCGGCGCG
GTGCAGTTGC TGTCGTCGAA TGCGATACTC GCCGGTATGG CGATCGTGGC GTTCTGCGCC
GAGGTCGCCC ATGTGGCGCT GTCCGCGACC TTCGTCCTCT ACGCCAGCTA TCGCTACGCG
TGGGATCAGA CCACGGTCGG TCTCGCCTTG GCCTTCGTCG GCTTCTGCAC TACCGTGGTG
CAGGGCTTTC TCGTCGGCCC CGCGGTCAAG CGGCTCGGTG AGCGGCGCGC CCAAGTGATC
GGCTATCTCG GCGGCGCAGC GGGCTTTCTG ATCTATGCGC TGGCGCCGAC CGGCGCGCTG
TTCTGGATCG GCATTCCGGT GATGACGCTG TGGGGCATCG CTAAGCCGGC GACCGCCGGG
GTGATGACGC GCCTGGTGGC GCCGGCGCAG CAAGGACAAT TGCAGGGCGC AACCACCAGC
ATGAACAGCA TCGCGGCGCT GATCGGGCCT TTCCTGTTCA CGGGCATCTT CGCCTATTTC
ATCGAGCCCG ATGCGCCGAT TTGGTTTCCC GGCGCGCCGT TCCTGCTCGC CGGCGCGCTG
CTGATGGTTT CGATGCTACT CGCAGGCATC TCGACGCCGC CGCAAAGCAA ATCAGGCGCC
GCAAGCGGCG CCTGA
 
Protein sequence
MTAAGHASGT SAPAPGARPA AVGFIFVTIL LDMLSVGMIL PILPKLIESF SDNDTAAAAK 
IYGLFGTAWA LMQFVASPVL GALSDRFGRR RVILLSNLGL GLDYILMALA PTLAWLFIGR
VISGITSASI STSFAYIADV TPAEKRAAVF GKVGAAFGLG FIFGPAIGGL LGGVDPRLPF
WVAAGLSLCN ALYGLFVLPE SLPPERRSPF RWRAANPIGA VQLLSSNAIL AGMAIVAFCA
EVAHVALSAT FVLYASYRYA WDQTTVGLAL AFVGFCTTVV QGFLVGPAVK RLGERRAQVI
GYLGGAAGFL IYALAPTGAL FWIGIPVMTL WGIAKPATAG VMTRLVAPAQ QGQLQGATTS
MNSIAALIGP FLFTGIFAYF IEPDAPIWFP GAPFLLAGAL LMVSMLLAGI STPPQSKSGA
ASGA