Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0226 |
Symbol | |
ID | 3909468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 254956 |
End bp | 256557 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882108 |
Product | extracellular solute-binding protein |
Protein accession | YP_483848 |
Protein GI | 86747352 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAT TTGGAAGTGC GTTGGTGCTG GCCGCGATGA TCGCAGCCGC GGGCGTCATC GGGCCCGCGC GTGCGGAATC GGTGGTGCGT TACGGCATCT CTATGGCGGA CATTCCGCTC ACCACCGGCC AGCCCGATCG CGGCGCAGGT GCCTATCAAT TCACCGGCTA TACGATCTAC GATCCGCTGG TGGCCTGGGA GATGAATGTC GCCGATCGGC CCGGCAAGCT GGTGCCCGGC CTTGCGACCG AATGGAAAGT CGACGAGTCC GACAAGACCA AATGGCGCTT TACTCTGCGC AAGGGCGTCA AGTTCCACGA CGGCAGCGAC TTCAATGCCG ACGCAGTGAT CTGGAATCTC GACAAGGTGC TCAACGAAAA GGCACCGCAG TTCGACAAGC GGCAGAGCGC GCAGGTCAAG ACCCGGCTTC CGTCGGTAAA GAGTTACGCC AAGATCGACG ATTCAACCAT CGAGATCACC ACCAAGGCGG TCGACTCCTT CTTCCCCTAT CAGATGCTCT GGTTCCTGGT GTCGAGCCCG GCGCAATACG ACAAGGTCGG CAAGGACTGG GACAAGTTCG CGGCGCAGCC GTCGGGCACG GGTCCCTTCA AGCTCACCAA GCTGGTGCCG CGCGAACTCG CCGAGCTGAC CAGGAACGCC GACTATTGGG ACAAGGCGCG GCTGCCGAAG ACCGACAAGC TCGTGCTGGT GCCGATGCCC GAAGCGCTCA CCCGCACCAA CGCGTTGCTC GCCGGCCAGG TCGATCTGAT CGAGACGCCC GCGCCCGATG CCGTGCCGCA GCTCAAATCG GCCGGCATGA AGCTGGTCGA CAATGTCACG CCGCATGTCT GGAACTATCA CCTCAGCGTG CTGCCCGGCT CGCCGTGGAC CGATGTCCGT CTGCGCAAGG CGCTGAATCT CGCGATCGAC CGCGACGCCG TGGTCGGGCT GATGAACGGC CTCGCCAAGC CGGCGGTCGG CCAGGTCGAT CCGTCGAGCC CGTGGTTCGG CAAGCCGACC TTCAAGATCA AATACGACCT CGCCGAGGCC AAGCGGCTGG TGAAGGAAGC CGGCTACTCG CCGGAGAAGC CGCTGAAGGC CAAGTTCATC ATCGCCACCG GCGGCACCGG CCAGATGCTG TCGCTGCCGA TGAACGAGTT CCTGCAGCAG AGCTTCAAGG AGATCGGCAT CGACGTCGAG TTCAAGGTGG TCGAACTCGA AGTGCTGTAC ACCGCGTGGC GCAAGGGCGC GGCCGACGAG AGCATGGCCG GCATCACCGC CAACAATATC GCCTATGTGA CGTCCGATCC GCTCTACGCG ATCGTGCGGT TCTTCCACTC CGGACAGGTG GCGCCGGTCG GCGTCAATTG GGGCGGCTAC AAGAACCCCA AGGTCGACGC GCTGATCGAC GAGGCGAAGA CCAACTTCGA TCCGGTCAAG CAGGACGAGC TGCTGGCGCA GGCGCATGCC CAGATCGTCG ACGACGCGGC GCTGGTCTGG GTGGTGCACG ACACCAACCC GCACGCGCTG TCGCCGAAGG TGAAGAGCTT CGTGCAGGCC CAGCACTGGT TCCAGGACCT GACCACGATC GGGCTGCAAT AG
|
Protein sequence | MKRFGSALVL AAMIAAAGVI GPARAESVVR YGISMADIPL TTGQPDRGAG AYQFTGYTIY DPLVAWEMNV ADRPGKLVPG LATEWKVDES DKTKWRFTLR KGVKFHDGSD FNADAVIWNL DKVLNEKAPQ FDKRQSAQVK TRLPSVKSYA KIDDSTIEIT TKAVDSFFPY QMLWFLVSSP AQYDKVGKDW DKFAAQPSGT GPFKLTKLVP RELAELTRNA DYWDKARLPK TDKLVLVPMP EALTRTNALL AGQVDLIETP APDAVPQLKS AGMKLVDNVT PHVWNYHLSV LPGSPWTDVR LRKALNLAID RDAVVGLMNG LAKPAVGQVD PSSPWFGKPT FKIKYDLAEA KRLVKEAGYS PEKPLKAKFI IATGGTGQML SLPMNEFLQQ SFKEIGIDVE FKVVELEVLY TAWRKGAADE SMAGITANNI AYVTSDPLYA IVRFFHSGQV APVGVNWGGY KNPKVDALID EAKTNFDPVK QDELLAQAHA QIVDDAALVW VVHDTNPHAL SPKVKSFVQA QHWFQDLTTI GLQ
|
| |