Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0835 |
Symbol | |
ID | 4021309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 935891 |
End bp | 937555 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637961025 |
Product | general substrate transporter |
Protein accession | YP_567974 |
Protein GI | 91975315 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0740201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCT ATGCCGCGAC GCAGGTGCGG TCCAGCGGAA TGACGAAGGA CGAACGTTTC GTCATTGTCG CATCGTCGCT CGGTACCGTT TTTGAGTGGT ACGATTTCTA TCTGTACGGA TCGCTCGCAG CCATTATCGG CGCGCAATTC TTCAGTGCCT ATCCGCCTGC GACGCGCGAC ATCTTCGCGC TTCTGGCCTT CGCCGCCGGC TTTCTCGTGC GCCCGTTCGG CGCCATCGTG TTCGGTCGTG TCGGCGACAT CGTCGGCCGT AAATACACCT TCCTCGTCAC CATTCTGATC ATGGGCCTGT CGACGTTCAT CGTCGGCCTG CTGCCCAATG CGGCGACGAT CGGCATCGCG GCGCCGATTA TCCTGATCGG TCTGCGCCTG CTGCAGGGCC TCGCGCTCGG CGGTGAATAT GGCGGCGCGG CGACCTATGT GGCCGAGCAC GCACCTCCCG GCAAACGCGG CTACTACACG TCGTTCATCC AGACCACGGC AACACTCGGA CTATTCCTGT CGCTGATGGT GATCCTGTTC ACCCGCACGA TTCTCGGCGA AGCCGCGTTC GCGGATTGGG GCTGGCGTGT TCCGTTCCTG GTGTCGGTCG TGCTGCTCGG CGTTTCGGTC TGGATCCGGC TGCGGCTGAA CGAATCTCCC GTGTTCCAGA AGATGAAGGA CGAGGGCAAG AGCTCGAAGG CGCCGTTGAC CGAAGCCTTT GCGAACTGGG GCAACGCCAA GATCGTGCTG ATCGCACTGT TCGGCGCCGT GATGGGTCAG GGCGTGGTCT GGTACACCGG CCAGTTCTAC GCGCTGTTCT TCCTGCAATC GATCCTGAAG GTCGACGGCT ACACCTCGAA CCTGCTGATC GCCTGGTCGC TGCTGCTCGG CACCTTCTTC TTCATCGTGT TCGGTTGGCT GTCCGACAAG ATCGGTCGCA AGCCGATCAT CCTCACCGGC TGCGCGATCG CTGCGCTGTC GTTCTTCCCG ATCTTCAAGG CGATCACCTC CAACGCCAAC CCGGCGCTGG AAAGGGCCAT CGAGACCGTC AAGGTCGAGG TGGTGTCGGA TCCGGCGCTG TGCGGCGACC TGTTCAACCC GGTCGGCACC CGCGTCTTCA CCGCCCCGTG CGACACCGCC CGCGCCTACC TGTCGCAATC CTCGGTCAAG TACTCGACCA CCAACGGCCC GGCCGGCTCC GGCGTCAAGG TGCTCGTGAA CGGCACGGAA GTGCCCTACA CCGACGCCAA AACGTCCAAT CCGCAGGTGC TGGCGACGAT CCAGGCGGCC GGCTATCCGA AGGCGGGAAA TTCTGAAATC ATCAAGATGT CGAACCCATT CGACATCTTC CGCCCGCAAG TGATGGCGGT GATCGGGCTG CTGTTCGTCC TGGTGCTGTT CGTCACCATG GTTTACGGGC CGATCGCGGC GATGCTGGTC GAACTATTCC CGACCCGCAT CCGCTACACC TCGATGTCGC TGCCCTACCA CATCGGCAAC GGCTGGTTCG GTGGCCTGCT GCCCGCGACC GCCTTCGCGA TCGTGGCCTC GACCGGCGAT ATCTACGCCG GCCTCTGGTA CCCGATCATC TTCGCGTCGA TCACCGTCGT GATCGGCCTG ATCTTCCTGC CAGAGACCAA ACACGTCGAT ATCAGCAAAA CCTGA
|
Protein sequence | MSTYAATQVR SSGMTKDERF VIVASSLGTV FEWYDFYLYG SLAAIIGAQF FSAYPPATRD IFALLAFAAG FLVRPFGAIV FGRVGDIVGR KYTFLVTILI MGLSTFIVGL LPNAATIGIA APIILIGLRL LQGLALGGEY GGAATYVAEH APPGKRGYYT SFIQTTATLG LFLSLMVILF TRTILGEAAF ADWGWRVPFL VSVVLLGVSV WIRLRLNESP VFQKMKDEGK SSKAPLTEAF ANWGNAKIVL IALFGAVMGQ GVVWYTGQFY ALFFLQSILK VDGYTSNLLI AWSLLLGTFF FIVFGWLSDK IGRKPIILTG CAIAALSFFP IFKAITSNAN PALERAIETV KVEVVSDPAL CGDLFNPVGT RVFTAPCDTA RAYLSQSSVK YSTTNGPAGS GVKVLVNGTE VPYTDAKTSN PQVLATIQAA GYPKAGNSEI IKMSNPFDIF RPQVMAVIGL LFVLVLFVTM VYGPIAAMLV ELFPTRIRYT SMSLPYHIGN GWFGGLLPAT AFAIVASTGD IYAGLWYPII FASITVVIGL IFLPETKHVD ISKT
|
| |