Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0189 |
Symbol | |
ID | 3907794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 208007 |
End bp | 209626 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882070 |
Product | hypothetical protein |
Protein accession | YP_483811 |
Protein GI | 86747315 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATG TCGAACGCGA TCCCCCGATG ACGGCCGAAA CACGGGCGTC CCCCTGGATC GCGTTCCGCC ACACCGCCTT TACGGTGGTC TGGACCGCAA CCGTCGTCGC CAACGTCGGC ACCTGGATGT ACAACGCGGC ATCCGGCTGG CTGATGACGA GCCTGGAAGC CGATCCACTG ACCGTTTCGC TCGTTCAAGT CGCGTCCAGC CTTCCGATGT TCCTGTTCGC GATCCCGGCC GGCGCGCTGG CGGACATCGT CGACAAGCGG CGCTTCCTGA TCCTGATCGA GATCGTGCTC ACCGTGTTTG CGGCCGCGAG CGCGGTGCTG GTCTGGCTCG GGCTGATGAA CCCGTTCCAG CTGTTGCTGT TCACGTTTCT GCTCGGCGCG GGTGCGGCGT TCGCGGCGCC GGCCTGGCAA TCGATCGTGC CGGATCTCGT GCCCAAGGAG CACCTCGCAT CGGCGGTGGC GAGCAATGGC GTCGGCATCA ATGTCAGCCG GGCGATCGGC CCGGCGCTGG GCGGCGTGGT GATCGGCGTC GCGGGCATCG CGGCGCCGTT CTGGATCAAC GCGCTGAGCA ATTTCGCGGT GATCGGCGCG CTGCTGTGGT GGCGCCCCGC CGCCAAGCGC GCGGCGACGC TGCCTCCGGA ACGGTTGTTC AGCGCGATCG TGATCGGCTT TCGCCACGCG CGATACAATC TCGATCTGCG CGCCACACTG GTGCGCGCGG TCGCGTTCTT CTTTTTCGCC AGCGCGTATT GGGCGCTGCT GCCGCTTGTC GCCCGCTCGC GCATCGCCGG GGGACCGGAA CTGTACGGCA TTTTGCTCGG CGCAATCGGC CTCGGCGCGA TTGTCGGCGC GTTCCTGTTG CAGGGGTTGA AATCGGCGCT GGGGCCGGAT CGCTTGGTCG CGGCCGGCAC GCTCGGGACA GCGGTCAGCC TCGTTCTGCT CGGCAGCGTC CAACGCGTCG AACTCGCGAC CGCGGCCTGT TTCATCGCGG GCGTTTCGTG GATCGCGGTT CTCGCCAACC TCAACGTTTC CGTTCAGGTC GCGCTACCGG ACTGGGTGCG CGGCCGCGGC CTGGCGATGT TCGTCACGGT GTTCTTCGGC GCGATGACGG CCGGCAGCGC GCTCTGGGGT CAGTTGGCGT CGTCGTTCGG ATTGCCGGCA GCGCATTTCG CCGCGGCCGT CGGCGCCGTC GTCGGCATCG CCGTGACGTG GCGCTGGAAA CTGCGCGGCA GCGCCGAGCA CGATCTCGCA CCCTCGATGC ATTGGCCCGC GCCGGTGCTG GCCATCGATG CCGATGCCGA TCAAGGCCCG GTGCTGATTA CGGTCGAGTA CCATGTCGCA GCGGACAGGC GCGACGCCTT CCTGCTGGCT ATGCGAAAAT TGAGCCGACA GCGACGGCGT GACGGCGCAT ATGCGTGGGA CGTGTTCGAA GATACCTCGG AGCGCGGACG ATTCGTCGAG GTGTTCAAAG TTGCGTCATG GCTCGAGCAT CTTCGGCAAC ATGATCGCGT CACCAATGCG GATCGGATCG ATCAGAATGC GATCCGCCAT TTCCACGCGA GCGCAGAGCC TCGCGTGACG CACTTGCTCG CTGCGAAGTT CCCGGCATGA
|
Protein sequence | MTNVERDPPM TAETRASPWI AFRHTAFTVV WTATVVANVG TWMYNAASGW LMTSLEADPL TVSLVQVASS LPMFLFAIPA GALADIVDKR RFLILIEIVL TVFAAASAVL VWLGLMNPFQ LLLFTFLLGA GAAFAAPAWQ SIVPDLVPKE HLASAVASNG VGINVSRAIG PALGGVVIGV AGIAAPFWIN ALSNFAVIGA LLWWRPAAKR AATLPPERLF SAIVIGFRHA RYNLDLRATL VRAVAFFFFA SAYWALLPLV ARSRIAGGPE LYGILLGAIG LGAIVGAFLL QGLKSALGPD RLVAAGTLGT AVSLVLLGSV QRVELATAAC FIAGVSWIAV LANLNVSVQV ALPDWVRGRG LAMFVTVFFG AMTAGSALWG QLASSFGLPA AHFAAAVGAV VGIAVTWRWK LRGSAEHDLA PSMHWPAPVL AIDADADQGP VLITVEYHVA ADRRDAFLLA MRKLSRQRRR DGAYAWDVFE DTSERGRFVE VFKVASWLEH LRQHDRVTNA DRIDQNAIRH FHASAEPRVT HLLAAKFPA
|
| |