Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0334 |
Symbol | |
ID | 3908715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 379097 |
End bp | 380284 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637882220 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_483956 |
Protein GI | 86747460 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCTGC TCGACCGCCC TGACGCCCCG CCGGCCCATC CGGCGCGGCT GATCCTGATC CTGTCGCTCG CGCCGACCGT CGGTCTGGGG ATCGGCCGCT TCGCCTATTC GCTGCTGCTG CCCGACATGC GCGACAGCCT GCAATGGTCC TATTCGGCGG CCGGCTTCAT GAACACCGTC AATGCCGCCG GCTATCTCGC GGGCGCGCTG GCGACCTCGC AGCTGGTGCG CCGCTACGGG CTGGCGGCGA TCGTCCGCGC CGGCACGCTG GCCTGCGTGG CGTCGCTGGC GCTGTGCGCA TTGTCGGGGA GTTTCGTGGC GCTGTCGGCG GCGCGGCTGA TCGCCGGGAT CGGCGCGGCG CTGGCCTTCG TCGCCGGCGG GGCGCTGGCG ACCACGATCG CGCAGTCGCA GCCGGCGCGT TCGGCGTTTC TGCTCAGCCT GTTTTACGCC GGCCCGGGCA TCGGCATCCT GTCGTCGGGG CTGATCACGC CGTTTCTGCT CGAGGCCGCC GGTCCTGGCT CGTGGTGGAT CGGCTGGCTG GTGATGGCGG TGCTGTCGGC GGCGATGACG CTGCCGCTGC TGCTGGCACC GCTGGCCAGC GCCGCCAACC TCGGCGGCGG TGCGGCGCGC TTCACGATCC GCCCGGTGTG GATCTATCTG GCCGGCTATT TCATGTTCGG CGCGGGCTAC ATCGCCTACA TGACCTTCAT GATCGCCTAT GTGCGCGATG CCGGCGGCGG GGCTGCGGCG CAGAGCGCGT TCTGGTGCCT GATCGGGGCC AGCGCCTTCG TCACCCCGTG GGTCTGGCGC CGGATCATGG CGATGGACCG CGGCGGCATG TCGACCACGA TCATCCTCGG CGTCAACGCG GTCGGCGCGG TGCTGCCGCT GTTCGGCCTG TCGCCGCTGA TGCTGGCGAT CTCGGCGCTG GTGTTCGGCG TGTCGTTTTT CGCGGTGGTG GCCTCGACCA CCGCCTTCGT CCGCTTCAAC TACAGCCAGG CGCAATGGCC CGGCGCGATC GCCGCGATGA CGATCGCATT CGGCATCGGC CAGACGCTCG GCCCGCTGCT GGTCGGCGCC ATCACCGACG CGATCGGGTC GCTGTCCTCG GCGCTGGCGG TCTCGGCCGC GACGCTGGCG CTCGGCGCGG TGCTGTCGGC ATTCCAGCGG CCACTGCGGC GTGCCTGA
|
Protein sequence | MTLLDRPDAP PAHPARLILI LSLAPTVGLG IGRFAYSLLL PDMRDSLQWS YSAAGFMNTV NAAGYLAGAL ATSQLVRRYG LAAIVRAGTL ACVASLALCA LSGSFVALSA ARLIAGIGAA LAFVAGGALA TTIAQSQPAR SAFLLSLFYA GPGIGILSSG LITPFLLEAA GPGSWWIGWL VMAVLSAAMT LPLLLAPLAS AANLGGGAAR FTIRPVWIYL AGYFMFGAGY IAYMTFMIAY VRDAGGGAAA QSAFWCLIGA SAFVTPWVWR RIMAMDRGGM STTIILGVNA VGAVLPLFGL SPLMLAISAL VFGVSFFAVV ASTTAFVRFN YSQAQWPGAI AAMTIAFGIG QTLGPLLVGA ITDAIGSLSS ALAVSAATLA LGAVLSAFQR PLRRA
|
| |