Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4566 |
Symbol | |
ID | 3912383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5165032 |
End bp | 5166696 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637886470 |
Product | general substrate transporter |
Protein accession | YP_488160 |
Protein GI | 86751664 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAGTT ATGCCGCAAC GCAGCGGCGG TCCGGCGGAA TGACCAAGGA CGAACGTTTC GTCATTGTCG CATCCTCGCT CGGCACCGTT TTCGAATGGT ACGATTTCTA TCTGTACGGG TCGCTCGCCG CGATCATCGG CGCGCAGTTC TTCAGCGCCT ACCCGCCCGC CACACGCGAC ATTTTCGCCC TTCTGGCGTT CGCCGCCGGC TTCCTGGTGC GCCCGTTCGG CGCCATCGTG TTCGGCCGCA TCGGCGACAT CGTCGGCCGC AAATACACCT TCCTCGTCAC CATTCTGATC ATGGGCCTGT CGACCTTCAT CGTCGGCCTG CTGCCCAATG CGGCGACCAT CGGCATTGCG GCGCCGATCA TCCTGATCTG TCTGCGTCTG CTGCAGGGCC TCGCGCTCGG CGGCGAATAT GGCGGCGCGG CGACCTATGT GGCCGAGCAT GCTCCACCCG GCAAACGCGG CTACTACACG TCGTTCATCC AGACCACCGC CACGCTCGGC CTGTTCCTGT CGCTGATCGT GATCCTGGTC ACCCGCACGG TGCTGGGCGA ACCGGAATTC GCCGCCTGGG GTTGGCGCAT TCCGTTCCTG GTATCGGTCG CGCTGCTCGG CGTCTCGGTC TGGATCCGGC TGCGGCTGAA CGAATCGCCC GTGTTCCAGA AGATGAAGGA AGAGGGCAAG AGTTCGAAAG CGCCTCTGAC CGAAGCCTTC GCCAATTGGG GCAACGCCAA GATCGTGCTG ATCGCCCTGT TCGGCGGCGT GATGGGCCAG GGCGTGGTCT GGTACACCGG CCAGTTCTAC GCGCTGTTCT TCCTGCAATC GATCCTGAAG GTCGACGGCT ACACGTCGAA CCTGCTGATC GCCTGGTCGC TGCTGCTCGG CACCTTCTTC TTCATCTTCT TCGGCTGGCT GTCGGACAAG ATCGGCCGCA AGCCGATCAT CCTGACCGGC TGCGCGATCG CCGCGCTGTC GTTCTTCCCG ATCTTCAAGG CGATCACGAC CAACGCCAAC CCGGCGCTGG AGCGGGCCAT CGAGACCGTC AAGGTCGAGG TGGTGTCGGA TCCCGCGCTG TGCGGCGACC TGTTCAACCC GGTCGGCACC CGCGTGTTCA CCGCGCCTTG CGACACTGCG CGCGCCTACC TGTCGCAGTC GTCGGTGAAG TACTCGACCG CGAAGGGCCC GGCGGGCTCG GGCGTCAAGG TGCTCGTGAA CGGCGCCGAG GTGCCCTACG TCGACGCCAA GACCTCCAAC CCGGCGGTGC TGGCGGCGAT CCAGGCGGCG GGCTATCCGA AGGCCGGCAA CGCCGAGATC ATCAAGATGG CGCATCCGTT CGACGTCTTC CAGCCGCGCA TCGCGGCGAC GATCGGGCTG CTGTTCGTGC TGGTGCTGTT CGTCACCATG GTCTACGGGC CGATCGCCGC GATGCTGGTC GAACTGTTCC CGACCCGCAT CCGCTACACC TCGATGTCGC TGCCCTATCA CATCGGCAAC GGCTGGTTCG GCGGCCTGCT GCCGGCGACC GCCTTCGCCA TCGTGGCGTC GACCGGCGAT ATCTACGCCG GCCTCTGGTA TCCGATCATC TTCGCGTCGA TCACCGTGGT GATCGGCCTG ATCTTCCTGC CCGAGACCAA GAACGTCGAT ATCAGCAGGA ACTGA
|
Protein sequence | MSSYAATQRR SGGMTKDERF VIVASSLGTV FEWYDFYLYG SLAAIIGAQF FSAYPPATRD IFALLAFAAG FLVRPFGAIV FGRIGDIVGR KYTFLVTILI MGLSTFIVGL LPNAATIGIA APIILICLRL LQGLALGGEY GGAATYVAEH APPGKRGYYT SFIQTTATLG LFLSLIVILV TRTVLGEPEF AAWGWRIPFL VSVALLGVSV WIRLRLNESP VFQKMKEEGK SSKAPLTEAF ANWGNAKIVL IALFGGVMGQ GVVWYTGQFY ALFFLQSILK VDGYTSNLLI AWSLLLGTFF FIFFGWLSDK IGRKPIILTG CAIAALSFFP IFKAITTNAN PALERAIETV KVEVVSDPAL CGDLFNPVGT RVFTAPCDTA RAYLSQSSVK YSTAKGPAGS GVKVLVNGAE VPYVDAKTSN PAVLAAIQAA GYPKAGNAEI IKMAHPFDVF QPRIAATIGL LFVLVLFVTM VYGPIAAMLV ELFPTRIRYT SMSLPYHIGN GWFGGLLPAT AFAIVASTGD IYAGLWYPII FASITVVIGL IFLPETKNVD ISRN
|
| |