Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0919 |
Symbol | |
ID | 6408573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 973736 |
End bp | 975400 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642710833 |
Product | General substrate transporter |
Protein accession | YP_001989952 |
Protein GI | 192289347 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.15683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACAC TTGCCGCGAC GAGCAGGCGA TCGGGTGGAA TGACCCGCGA CGAGCGCTTC GTCATTCTTG CATCTTCGCT CGGTACCGTT TTCGAATGGT ACGACTTCTA TCTGTACGGT TCGCTCGCCG CGATTATCGG CGCCCAGTTC TTCAGCGCTT ATCCGCCGGC CACCCGGGAC ATTTTCGCGC TGCTCGCCTT CGCCGCCGGC TTCCTGGTGC GCCCGTTCGG CGCCATCGTG TTCGGCCGGG TCGGCGACAT CGTCGGCCGT AAATACACCT TCCTGGTCAC CATCCTGATC ATGGGCCTGT CGACCTTTAT CGTCGGCCTG CTGCCCAATG CGGCCACCAT CGGCATTGCC GCCCCGATCA TCCTGATCAC GCTGCGCCTG CTGCAGGGCC TCGCGCTCGG CGGCGAATAC GGTGGCGCCG CGACCTACGT GGCCGAGCAT GCTCCGCCCG GCAAGCGCGG CTACTACACC GCATTCATTC AGACCACCGC GACCCTCGGC CTGTTCCTGT CGCTGCTGGT GATCCTTGCC ACCCGCACCA TCGTGGGCGA AGTGGCGTTC GCCGATTGGG GCTGGCGCGT GCCGTTCCTG GTGTCGGTCG CCTTGCTCGG CGTCTCGGTC TGGATCCGGC TGCGGCTCAA CGAGTCGCCG GTGTTCAAGA AGATGAAGGA GGAAGGCAAG AGCTCGAAGG CGCCGCTGAC CGAAGCTTTC GCCAACTGGG GCAACGCGAA GATCGTGCTG ATCGCGCTGC TCGGTGCGGT GATGGGCCAG GGCGTTGTCT GGTACACTGG CCAGTTCTAC GCGCTGTTCT TCCTGCAATC GATCCTGAAG GTCGACGGCT ACACCTCGAA CCTGCTGATC GCGTGGTCGC TGCTGCTCGG CACCGGCTTC TTCATCATCT TCGGCTGGCT GTCGGACAAG ATCGGCCGCA AGCCGATCAT CCTCACCGGC TGTCTGATCG CGGCACTGTC GTTCTTCCCG ATCTTCCGGA TGATCACGAC ATACGCCAAC CCGGCGCTGG AAAAGGCGAT CGAGACCGTG AAGGTGCAAG TCGTCGCCGA TCCGGCGGGC TGCGGCGACC TGTTCAACCC GGTCGGCACA CGCGTCTTCA CCAAGCCGTG CGACACCGCG CGCGACTTCC TGTCGAAGTC CTCGGTGAAG TACTCCACCG TCAACGGCCC CGCCGGCTCC GGCGTCAAGG TGATGGTGAA CGAGAAGGAA GTGCCGTACA CCGACGCCAA GACCTCCAAC CCGCAGGTGC TGGCGGCGGT GCAGGAAGCT GGTTACCCGA AGGCCGGCAA CCCGCAGATC ATCAAGATGG CGCACCCGTT CGACGTGTTC AATTCGAGCA CCGCAGCGGT GATCGGACTG TTGTTCGTCC TGGTGCTGTT CGTCACGATG GTCTACGGCC CGATCGCGGC GCTGCTGGTC GAACTGTTCC CGACTCGGAT TCGCTATACC TCGATGTCGC TGCCGTATCA CATCGGCAAC GGCTGGTTCG GCGGCCTGCT GCCGGCGACT GCGTTCGCGA TCGTCGCCTC GACCGGCGAT ATCTATGCCG GCCTGTGGTA CCCGATCGTC TTCGCGTCGA TCACTGTGGT GATTGGCCTG ATCTTCCTGC CGGAAACCAA GAACGTCGAC ATCAGCAAGA CCTGA
|
Protein sequence | MATLAATSRR SGGMTRDERF VILASSLGTV FEWYDFYLYG SLAAIIGAQF FSAYPPATRD IFALLAFAAG FLVRPFGAIV FGRVGDIVGR KYTFLVTILI MGLSTFIVGL LPNAATIGIA APIILITLRL LQGLALGGEY GGAATYVAEH APPGKRGYYT AFIQTTATLG LFLSLLVILA TRTIVGEVAF ADWGWRVPFL VSVALLGVSV WIRLRLNESP VFKKMKEEGK SSKAPLTEAF ANWGNAKIVL IALLGAVMGQ GVVWYTGQFY ALFFLQSILK VDGYTSNLLI AWSLLLGTGF FIIFGWLSDK IGRKPIILTG CLIAALSFFP IFRMITTYAN PALEKAIETV KVQVVADPAG CGDLFNPVGT RVFTKPCDTA RDFLSKSSVK YSTVNGPAGS GVKVMVNEKE VPYTDAKTSN PQVLAAVQEA GYPKAGNPQI IKMAHPFDVF NSSTAAVIGL LFVLVLFVTM VYGPIAALLV ELFPTRIRYT SMSLPYHIGN GWFGGLLPAT AFAIVASTGD IYAGLWYPIV FASITVVIGL IFLPETKNVD ISKT
|
| |