Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3634 |
Symbol | |
ID | 3970649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4040942 |
End bp | 4042639 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637926742 |
Product | major facilitator transporter |
Protein accession | YP_533488 |
Protein GI | 90425118 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.836817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0365069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATG CCCCCGCCGG CGACGTCGGT TCGAGTTTCT TTCGCCAGCC CCGCGCGGTC TGGGCCACCG CCTTCGCCGC GGTGGTCGGC TTCATGGGGA TCGGCCTGGT CGATCCGATC CTGACCTCGA TCGCCGAGGG GCTGCAGGCC ACGCCGAGCC AAGTGTCGCT GCTGTTCACC AGCTATTTCG CGGTGACCTC GGTGATGATG CTGGCGACTG GCTTCGTCTC CAGCCGGATC GGCGGGCGGC GGACGCTGCT GCTCGGCGCG GCGCTGATCG CCTGCTTCGC TGCACTCGCC GGCACGTCGC ATTCGGTCAC CGAACTGGTG CTGTATCGGG CCGGCTGGGG GCTCGGCAAC GCGTTCTTCG TCGCCACCGC GCTGTCGGTG ATCGTGGCGG CGGCGAGCGG CGGCACCGCC ACCGCGATTC TGTTGTATGA GGCGGCGCTC GGGCTCGGCA TTTCGGTCGG CCCGCTGCTC GGCGCAGCCC TTGGTAACCT GTCGTGGCGC TATCCGTTTT TCGGCACCGC GGCGCTGATG ACCATCGGCT TCGTAGCGAT TGCGTTGTTT CTCGAGGTGC AGCCCAAGGC GGCGCGGAAA ACCAGTCTGG TCGATCCGAT CCGCGCGCTC GGCCATCGCG GGCTGTTGTC GGTGGCGGGC AGCGCGTTCT TCTACAATTA CGCGTTCTTC ACCGTGCTGG CGTTCGTGCC CTTCGTGCTG CAGGCCTCGG CGCACACCGT CGGGCTGATC TTCTTCGGCT GGGGACTGGC GCTGGCGGTG TTCTCGGTGC TGGTGGCGCC GCGGCTGCAG ATCCTGTTCA GCGCGCTGAC GCTCGGCGTC GGCAATCTGC TGGCGCTGGC GGTGCTGCTC CTCGGCATGG CCTTCGGCTC GGTGCCGATC ATCGTCGCCG CGGTGGTGCT GTCCGGCGCG GTGATGGGCA TCAACAACAC CGTGTTCACC GAAATGGCGC TGGAGATTTC GCCGTTTCCG CGCCCGGTGG CCTCGGCGGC CTATAATTTC GTGCGCTGGT TCGCCGGCGT GATCGCGCCC TTTGCGGCGC CGAAGATCGC CGAGCATTTC GGCGCCTCGG CGTCGTTCGT GGTCGCGGCG GTGTCGGCGC TGGCCGCCGC AGGCGTGCTG CTGGCGATGC GCGGCAATCT CGGTCGGTTC GCCTCAAAGC ATCCCGCCGC GGCGCCCGCG GAAGCGCCCT CGGCGGCAAC CGGGCCGATC CTGGTCGCGG TCGACGGCAC CGCCAACGAC CGGGCGATCC TTGCCCGTGC GGCCAAGGTA GCGCTGGCGC TGGGCGCGCC GATCGAGGTG CTGCATGTCC GCCCGCTGGA ACTGGTCGAG GGTGAAGCCG CCGAGGCGGA AAGTTCCACC GGCTCCGCCG CCATTCTCGA CCTCGCCTTG GCGCAACTGC GCGACGCCGG CCTGCAGGCC GCCGGTGCGG TGCGGGAAGA GGTTGCCGCA AGAACGCCAC AAGCGATTCT CGACCATGCC GCGGATCTCG ACGCCCGGCT GATTGTGCTC GGCGCCCGTC ACCACGACGA CCCGACCGAC ATCATTCATG GCAGCGTCGC CGATATCATC GGCCGCAGGG CCACTCGCCC AGTGATGTTG GTGCCGGAGC CGGGCGCGAA TGAGCGGCGA TGCGAGTCCG CGTCGCCCGC CGCAGATCGC TCGAAAATCG GTATCTGA
|
Protein sequence | MSNAPAGDVG SSFFRQPRAV WATAFAAVVG FMGIGLVDPI LTSIAEGLQA TPSQVSLLFT SYFAVTSVMM LATGFVSSRI GGRRTLLLGA ALIACFAALA GTSHSVTELV LYRAGWGLGN AFFVATALSV IVAAASGGTA TAILLYEAAL GLGISVGPLL GAALGNLSWR YPFFGTAALM TIGFVAIALF LEVQPKAARK TSLVDPIRAL GHRGLLSVAG SAFFYNYAFF TVLAFVPFVL QASAHTVGLI FFGWGLALAV FSVLVAPRLQ ILFSALTLGV GNLLALAVLL LGMAFGSVPI IVAAVVLSGA VMGINNTVFT EMALEISPFP RPVASAAYNF VRWFAGVIAP FAAPKIAEHF GASASFVVAA VSALAAAGVL LAMRGNLGRF ASKHPAAAPA EAPSAATGPI LVAVDGTAND RAILARAAKV ALALGAPIEV LHVRPLELVE GEAAEAESST GSAAILDLAL AQLRDAGLQA AGAVREEVAA RTPQAILDHA ADLDARLIVL GARHHDDPTD IIHGSVADII GRRATRPVML VPEPGANERR CESASPAADR SKIGI
|
| |