Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1794 |
Symbol | |
ID | 3972059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1949785 |
End bp | 1951362 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637924907 |
Product | major facilitator transporter |
Protein accession | YP_531672 |
Protein GI | 90423302 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0821089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT CCGAGAGCCA GGGCCGCGAA TCGGCCCAGC AGCGCGCCTT GCCGGCGCCG ATCGCGCCGC CGGCGGCGCC CGGCGATATC GATGTGGCGA TGATCGCCGC GCCGCCGCTG GCGCCCGCCG CGCCGCTGAC CCGCGCCGAA GTCCGCGGCA TCCTGCTCAG CCTGTTGCTG GCGATGTTCC TGGCCGCGCT CGACCAGACC ATCGTCGCCA CCGCGCTGCC GACCATCGGG CGCCAGTTCC ACGACGTCGA CAATCTGTCC TGGGTGATCA CCGCGTATCT GCTGGCGTCC ACCGCGATGG CCCCGGTGTT CGGCACGCTG AGCGACATCT TCGGCCGCCG CGCCACCATC ATCAGCGCGA TGGGTCTGTT CATCGCCGGC TCGGTGCTGT GCGCGGTGGC GCCGAACATG AGCGTGCTGA TCGTCGCGCG CGTGCTGCAG GGGCTCGGCG GCGGCGGCAT CCTGCCGATC GTGCAGACCG TGATCTCCGA CGTGGTCACC CCGCGCGAAC GCGGCCAGTA CCAGGCGTAT TTCTCCGGAG TCTGGGTCGC CGCCTCGATC GGCGGACCGG TGATCGGCGG GGTGTTCGCC GAACATCTGC ACTGGTCGCT GATTTTCTGG ATCAACCTGC CGCTCGGGGT CGGCGCGCTG GCGCTGCTGC TGCCGAAGAT GGGCAAGATC CCGGTGTTTC ATCGCCGCCG CAAGGTCGAC TGGATGGGCG GGCTGTTGCT GATGGCCGCG GCGGTGGCGG TGATGCTGGT GCTGACCTGG GGCGGCACGC GGCTGGCCTG GGCGTCGCCG ACGATTCTGG CGATGGCCGG CGCCGCGGCG TTGCTGGCGC TGCTGTTCGT TTGGCATGCG TTGCACACCG CAGAGCCGTT TTTGCCGCTG CAACTGATGT CCGGCTCGGT GGTGCCCTTT GCGATGGCCG CCGGCGGCTG CGGCATGGGC GTGCTGATCG GTCTCACCGT GAACATGCCG CTGTATTACG AAGTGGTCTA TGGGCTGAGC GCCAGCGCCT CCGGACTGGC GCTGATTCCC ATCGTCGCGG TGTCGGTGCT CGGCGCCGCC TTCGCCGGCC GCGCGACCAC GCATTGGACG CATTACAAGC GGGTGGCGAT CGCCGGCAAC TCCTTCTCGG CGGCGATGGC CTGCGGTTTG GCGTTGGCGA CGCCGCTGCC GCTGTGGGCG CTGCTGATCG GGCTGGGATT GCTGGCGATC GGGCTCGGCA CGGTGTTCTC GATCAGCCTG GTGTCGATCC AGAACGCGGT GCCGCGGCCG CAGGTCGGCA CCGCCACCGG GGCGATGAAT TTCTTTCGCG CGCTGATGGC GTCGTTCACG GTGGCGGCGT TCACCACGAT CCTGCTCGGT TCGCTCGGCG GCGGAATTTC CTTCGGTGCC GAACACCGCC ACGCGGTGGG CGGGATCCCT GCCGCCGACA TGGTGGCGGC GTTCCGCTAC GTGTTCGCCG CCTCCGCCGC GCTGATGGCC ACCGCGGCGC TGTGCATGGT GGTGATGGAA GAGCTGCCGC TGGCCGGACC AGCCGTCACG CCGGATCTCG CCGAATAG
|
Protein sequence | MSTSESQGRE SAQQRALPAP IAPPAAPGDI DVAMIAAPPL APAAPLTRAE VRGILLSLLL AMFLAALDQT IVATALPTIG RQFHDVDNLS WVITAYLLAS TAMAPVFGTL SDIFGRRATI ISAMGLFIAG SVLCAVAPNM SVLIVARVLQ GLGGGGILPI VQTVISDVVT PRERGQYQAY FSGVWVAASI GGPVIGGVFA EHLHWSLIFW INLPLGVGAL ALLLPKMGKI PVFHRRRKVD WMGGLLLMAA AVAVMLVLTW GGTRLAWASP TILAMAGAAA LLALLFVWHA LHTAEPFLPL QLMSGSVVPF AMAAGGCGMG VLIGLTVNMP LYYEVVYGLS ASASGLALIP IVAVSVLGAA FAGRATTHWT HYKRVAIAGN SFSAAMACGL ALATPLPLWA LLIGLGLLAI GLGTVFSISL VSIQNAVPRP QVGTATGAMN FFRALMASFT VAAFTTILLG SLGGGISFGA EHRHAVGGIP AADMVAAFRY VFAASAALMA TAALCMVVME ELPLAGPAVT PDLAE
|
| |