Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3004 |
Symbol | |
ID | 4023507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3348891 |
End bp | 3350156 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637963203 |
Product | major facilitator transporter |
Protein accession | YP_570131 |
Protein GI | 91977472 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.42057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG AGGAGCAACC TCCCGGGACG ATCGCGCCGC TCCCGGGCGC GCGCCCCGCC GCGGTCGGCT TCATCTTCGT CACCATCCTG CTCGATATGC TGAGCGTCGG CATGATCCTG CCGATCCTGC CGAAGCTGAT CGAGAGTTTT TCCGACAACA ACACCGCGGA CGCGGCGCGA ATCTACGGCG TGTTCGGCAC AGCGTGGGCG CTGATGCAGT TCGTCGCGTC GCCGGTGCTG GGCGGGCTGT CCGATCGTTT CGGCCGCCGC CCGGTGATCC TGCTGTCCAA TCTCGGTCTC GGTCTCGACT ACATCCTGAT GGCGCTGGCG CCGACGCTGA GCTGGCTGTT CATCGGCCGG GTGATCTCCG GCATCACGTC GGCGAGTATT TCGACCTCGT TCGCCTATAT CGCCGACGTC ACGCCGGCGG AGAAGCGCGC GGCCGTGTTC GGCAAGGTCG GCGCCGCGTT CGGTCTCGGC TTCATCTTCG GCCCGGCGAT CGGCGGTTTG CTCGGTGGTA TCGATCCGCG ACTGCCGTTC TGGGTGGCGG CTGGGCTCAG CCTGTGCAAC GCGCTGTACG GTCTGTTCGT GCTGCCGGAA TCGCTGCCGC CGGAGCGGCG CTCGCCGTTT CGCTGGAGGT CCGCCAATCC GGTCGGCGCT GTGCGGCTGC TGGGCTCGAA TGCCCGGCTG GCGGCGATGG CTCTGGTCGA GTTCTGCGCC GAGGTGGCGC ATGTCGCGCT GCCGGCGATC TTCGTGTTGT ACAGCACCTA CCGTTACGGC TGGGACCAGA CCACGGTCGG GCTCGCGCTC GCTTTCGTCG GGGTCTGCAC CGCGATCGTG CAGGGCGGCT TGGTGGGGCC TGCCGTGAAG CGACTCGGCG AACAAAGGGC CCAGATCATC GGCTATGGCG GCGGCGCGCT AGGCTTTCTG ATCTACGCGC TGGCGCCGAC CGGAGCGCTG TTCTGGATCG GCATCCCGGT GATGACGCTG TGGGGCATCG CAGGGCCGGC GACCTCCGGC ATGATGACGC GGCTGGTGTC GCCGGACCAG CAGGGCCAGT TGCAGGGCGC CATCACCAGC CTCAAGAGCA TCGCCGAACT GATCGGGCCG TTCCTGTTCA CGCTGATCTT CGCGTATTTC ATTGGAGGCA ACGCGCCGCT GGCTCTTCCC GGGGCGCCGT TCCTGCTCGC AGGCCTGCTG CTGATGGTCT CGGCGCTGAT CGCCGCGTCC ACCAATGAAG CGACCAAACA GGCCGGCACC GGCTAG
|
Protein sequence | MTDEEQPPGT IAPLPGARPA AVGFIFVTIL LDMLSVGMIL PILPKLIESF SDNNTADAAR IYGVFGTAWA LMQFVASPVL GGLSDRFGRR PVILLSNLGL GLDYILMALA PTLSWLFIGR VISGITSASI STSFAYIADV TPAEKRAAVF GKVGAAFGLG FIFGPAIGGL LGGIDPRLPF WVAAGLSLCN ALYGLFVLPE SLPPERRSPF RWRSANPVGA VRLLGSNARL AAMALVEFCA EVAHVALPAI FVLYSTYRYG WDQTTVGLAL AFVGVCTAIV QGGLVGPAVK RLGEQRAQII GYGGGALGFL IYALAPTGAL FWIGIPVMTL WGIAGPATSG MMTRLVSPDQ QGQLQGAITS LKSIAELIGP FLFTLIFAYF IGGNAPLALP GAPFLLAGLL LMVSALIAAS TNEATKQAGT G
|
| |