Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4684 |
Symbol | |
ID | 3912502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5298431 |
End bp | 5299651 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637886589 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_488278 |
Protein GI | 86751782 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCC GCAACGCGCT GCTTGCAGCC GGACTATTTG CGCTCGCCGC AACCCAGCCT GCCTTCGCCC AGAAGCAATA CGGCCCCGGC GTCACCGACA CCGAAATCAA GATCGGGCAG ACCATGCCCT ACAGCGGCCC GGCTTCGGCA TATGGCGTGC AGGGCCATGT CCAGAACGCC TACTACGCGA TGATCAACGC GAAGGGCGGC GTCAACGGCC GCAAGATCAA CCTGATCAGC CTCGACGACG CCTATTCGCC GCCGAAGACG GTGGAGCAGA CCCGCAAGCT GGTCGAGCAG GACGAGGTGC TGGCGATCGT CGGCACCGTC GGCACGCCGA CCAATTCCGC GACTCAGAAA TATCTCAACG GCAAAAAGGT GCCGCAGATC TTCATCTCCA CGGGGGCGGC AAAATGGGAT GATCCGAAGA CCTTCCCGTG GACCACGCAG CTCTATCCTC CCTATCAGAT GGAAGGCATG ATTTTCGCGA AGTACCTGCT CAAGAACAAG CCCGACGCCA AGCTCGGCGT GTTCTCGCAG AACGACGACG CCGGCAAGGA CTACGTCAAG GGCCTGAAGG AAGGGCTCGG CGACAAGGCC AAGACGATGA TCGTCAAGGA GGTCACCTAC GAGGTCACCG ATCCGACCGT CGACTCGCAG ATCGTCGCGC TGAAGGCGTC GGGCGCCGAC ACGCTGTTCA CGATGGCGAC GCCGAAATTT GGCGCCCAAG CGATCCGCAA GGTCCACGAA CTGAACTGGA AGCCGCTCAA CTTCGTCGTC AGCGTCGCCA GCTCGATCAA GGGCGTGCTC GAACCCGCCG GCAGCGAAGC CTCGACCGGG TTGCTCACCG CGCTCGCCAT GAAGACGCCG ACCGACCCGC GGTTCGAGAA CGATGCCGAC GTCAAGGAAT TCAAGGAATT CCTGGCCAAG TGGTTTCCGA AAGGCGACAT CGCCGACGGC AGCGTGGTGA TCGGCTACAT CTCGGCCTAT ATGACGGCGA AGACGCTCGA AGCCTGCGGC GACAATCTCA CCCGCGACAA CCTGCTCAAG CAGGCGACCA ACATCAAGCC GACGGTTGCT CCGCTGCTAC TGCCGGGCGT CAAGATCTCG ACGCGGCCGG ACCGCTACGC GCCCTACACC CAGATGCAGA TCGCCCGTTT CGACGGCAAG AGCTGGGTGC CTGAAGGCGA AGTGTTCAAC ACCGACGCGA CCAGCCAGTA A
|
Protein sequence | MSTRNALLAA GLFALAATQP AFAQKQYGPG VTDTEIKIGQ TMPYSGPASA YGVQGHVQNA YYAMINAKGG VNGRKINLIS LDDAYSPPKT VEQTRKLVEQ DEVLAIVGTV GTPTNSATQK YLNGKKVPQI FISTGAAKWD DPKTFPWTTQ LYPPYQMEGM IFAKYLLKNK PDAKLGVFSQ NDDAGKDYVK GLKEGLGDKA KTMIVKEVTY EVTDPTVDSQ IVALKASGAD TLFTMATPKF GAQAIRKVHE LNWKPLNFVV SVASSIKGVL EPAGSEASTG LLTALAMKTP TDPRFENDAD VKEFKEFLAK WFPKGDIADG SVVIGYISAY MTAKTLEACG DNLTRDNLLK QATNIKPTVA PLLLPGVKIS TRPDRYAPYT QMQIARFDGK SWVPEGEVFN TDATSQ
|
| |