Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5288 |
Symbol | |
ID | 6412989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5704721 |
End bp | 5705968 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642715178 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001994250 |
Protein GI | 192293645 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.120613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGATG CGACGGCCAA CGACATCGTC GAAGACGATG CCCGTGCCCG TTCCAACGTG GTGCGGCTGG TCGCGGCGCA GGCGCTGACC GGCGCAAACG CCGCAGTGAT TTTCGCGACC GGTTCGATCA TCGGTGCGCA GCTCGCGCCG GAGATGTCGC TCGCGACTGT GCCGCTGTCG ATGTACGTGC TCGGCCTTGC TGCCGGCACG CTGCCGACTG GGTGGATCGC ACGGGCTTAC GGCCGCCGCG TCTCGTTCAT GATCGGCACC GGCTGCGGCA CGCTCACCGG CGTGCTCGGC GCCGTCGCAA TCCTGTACGG CTCGTTCCTG CTGTTCTGCG TCGCGACGTT CATCGGCGGG CTTTACGCCG CCGTGTCGCA GTCCTACCGG TTCGCCGCCG CCGACGGCGC CAGCGCCTCC TATCGGCCGA AGGCGGTGTC GTGGGTGATG GCAGGCGGCG TGTTCGCCGG CGTGCTCGGT CCGCAGCTCG TGCAGTGGAC CATGGACGTC TGGCAGCCTT ATCTGTTTGC GTTCTCCTAT GTGGTGCAGG CGGCGATTGC GCTGATCGCC ATGGCGGTGC TGTGGGGCGT CGATGCACCG AAGCCGAAGC CGGCGGAGCG GGCTGGCGGT CGGCCGCTGC TGGAGATCGC GCGACAGCCG CGCTTCATCG CCGCGGCGCT GTGCGGCGCG ATCGCCTATC CGATGATGAA TTTGGTGATG ACCTCGGCGC CGCTGGCGAT GCAGATGTGC GGGCTGAGCG TCGGCGATTC CAATTTCGGC CTGCAATGGC ACATCGTGGC GATGTACGCG CCGAGTTTCG TCACCGGCTC GCTGATCGCC AAGTTCGGCG CGCCGCGCGT GGTCGCGGCC GGCCTGGTCC TGGAAGCGCT CGGCGCGTCG ATCGGCCTGC TCGGCGTCAC CGCCCCGCAC TTCTGGGCGA CGCTGTTCGT GATCGGCGTC GGATGGAATC TGGCGTTCGT CGGCGCCTCG GCGCTGGTGC TGGAAACGCA CCAGCCGAAC GAGAAGAACA AGGTCCAGGC GTTCAACGAC TTCATCATCT TCGGCCTGAT GGCGCTGGGG TCGTTCTCGT CGGGCCAGCT GCTGGCGAAC TACGGCTGGA CCACCGTGAA CCTCGCGGTG TTCCCGCCAG TGCTGCTCGG CCTGATCGTG CTGGCGATCA CCGGCTGGTC GAAAGTCCGG AAACGGGTGG CCGAGGCCGC CACCGAACTA TCCGATCGCG GCGTCTGA
|
Protein sequence | MVDATANDIV EDDARARSNV VRLVAAQALT GANAAVIFAT GSIIGAQLAP EMSLATVPLS MYVLGLAAGT LPTGWIARAY GRRVSFMIGT GCGTLTGVLG AVAILYGSFL LFCVATFIGG LYAAVSQSYR FAAADGASAS YRPKAVSWVM AGGVFAGVLG PQLVQWTMDV WQPYLFAFSY VVQAAIALIA MAVLWGVDAP KPKPAERAGG RPLLEIARQP RFIAAALCGA IAYPMMNLVM TSAPLAMQMC GLSVGDSNFG LQWHIVAMYA PSFVTGSLIA KFGAPRVVAA GLVLEALGAS IGLLGVTAPH FWATLFVIGV GWNLAFVGAS ALVLETHQPN EKNKVQAFND FIIFGLMALG SFSSGQLLAN YGWTTVNLAV FPPVLLGLIV LAITGWSKVR KRVAEAATEL SDRGV
|
| |