Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3739 |
Symbol | |
ID | 3970334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4164063 |
End bp | 4165304 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637926849 |
Product | major facilitator transporter |
Protein accession | YP_533593 |
Protein GI | 90425223 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.101854 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCAG CCCAACTCGG CGCATGGCAG CGCTGGTCGA TCCTGGCGGG CGCCGCGATC CTGCTGAGCC TGGCGATGGG GATGCGGCAA AGCTTCGGGC TGTTTCAACC CTCGGTCATT CGCGACATCG GCATCACCTC GGCGGATTTC TCGCTCGCCA CCGCGCTGCA GAACGTGGTC TGGGGCGTCA CCCAACCCTT CGTCGGCATG TTCGCCGATC GCTACGGCAC CCGCTATGTG ATGCTCGGCG GCGTGCTGAT CTATGCCGCG GGTCTGGTGT TGATGATGGT CGCGACCTCG GCGTTGGTGT TCACGCTGGG CGCCGGGTTC TGCGTCGGGC TGGCGCTGTC CTGCACCGCG TCGAGCCTGA CCATGACGGT GACCTCGCGC ACGGTGTCGG CGGCCAAGCG CAGCGTCGCG ATGGGCGCGG TGTCGGCGGT CGGATCGCTC GGGCTGGTGA TCGCCTCGCC ATTGGCGCAG ACGCTGATCT CGACCTCGGG CTGGAAGATG GCGCTGATCG GCTTTCTCGG TCTCGCCGCG GTGATGCTGC CATCGGCATT GTTCGCCGGA CGCTCCGACA AAATCGAGAT CGACAAGGCC GACGACAGCG AGCAATCGCT CGGCTCCGTG ATGCAATCCG CGCTCGGGCA TTCCGGTTTC GTGGTGATGT CGCTGGCGTT CTTCGTCTGC GGGTTGCAAT TGGTGTTCAT TACCACGCAT CTGCCGAACT ATCTGGATAT TTGCGGGCTC GATCCGTCGC TCGGCGCCAC TGCGCTCGCC ATCATCGGGC TGTTCAACGT GATCGGCTCC TATGCCTGCG GCTGGCTCGG CGGCCGCTAT CCGAAACAGC TGCTGCTCGG CGCGATCTAC ATCATCCGCT CGGTGGCGCT CGCCGCCTAT TTCTATTTTC CGGCGTCGGC CGCCTCCACC ATGGTGTTCG CCGCGGTGAT GGGATCGCTG TGGCTCGGCG TGGTGCCGCT GGTCAACGGG TTGGTGGCGC AACTGTTCGG GCTGCGCTTC ATGGCGACGC TGGCCGGCAT CGCCTTCCTC AGCCATCAGG CCGGCTCGTT CCTCGGCGCC TGGGGCGGCG GGATGATCTA CGACCGGCTC GGCAGCTATG ACGCTGCCTG GCAAGCCGCG GTGCTGATCG GATTGATCGC CGGCGCTTTT CAGATGTTGA TGAACGTACG TCCACCGCAG CGTCGCGATG CGCTGGGCGG TGCGGTGGCC AATGCGGCGT GA
|
Protein sequence | MRAAQLGAWQ RWSILAGAAI LLSLAMGMRQ SFGLFQPSVI RDIGITSADF SLATALQNVV WGVTQPFVGM FADRYGTRYV MLGGVLIYAA GLVLMMVATS ALVFTLGAGF CVGLALSCTA SSLTMTVTSR TVSAAKRSVA MGAVSAVGSL GLVIASPLAQ TLISTSGWKM ALIGFLGLAA VMLPSALFAG RSDKIEIDKA DDSEQSLGSV MQSALGHSGF VVMSLAFFVC GLQLVFITTH LPNYLDICGL DPSLGATALA IIGLFNVIGS YACGWLGGRY PKQLLLGAIY IIRSVALAAY FYFPASAAST MVFAAVMGSL WLGVVPLVNG LVAQLFGLRF MATLAGIAFL SHQAGSFLGA WGGGMIYDRL GSYDAAWQAA VLIGLIAGAF QMLMNVRPPQ RRDALGGAVA NAA
|
| |