Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4373 |
Symbol | |
ID | 4024898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4836584 |
End bp | 4837897 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637964583 |
Product | extracellular solute-binding protein |
Protein accession | YP_571491 |
Protein GI | 91978832 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.104197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGGA CCCCGCTGAT ATTCGCCCTT GTGTTGCTAG TCGTCGCCGG CGCAGCGCCC GCGCGGGCCG CGACGGAGAT CGCGTGGTGG CACGCGATGT CCGGAGAACT CGGCCGCCGT CTCGAAAAGC TCGCAGCCGA TTTCAATGCG TCGCAATCCG ACTACCGTGT GGTGCCGACC TACAAAGGCA ACTACACCGA GACGGTCACC GCGTCGATCT TCGCGTTCCG CTCGTCGACT CAGCCGGCGA TCGTTCAGGT CAACGAGATC GCCACCGCAA CAATGATGGC CGCCAAGGGC GCGGTTTATC CGGTCTACGA GCTGATGCGC GACGAGAAGG AGGCGTTCTC GCCGTCCGAC TATCTGCCCG CGGTCGCCGG CTACTATGTC GATCTCGCCG GCAATATGCT GTCGTTTCCG TTCAACGCCT CGACGCCGAT TCTGTACTAC AACAAGACGC TGTTCAAAAA GGCGGGGCTC GATCCGGAGA CGCCGCCGGG CACTTGGCCG GACGTCGGCG CCGCGGCGAA GCGGCTGATC GACGCGGGCG TGCCCTGCGG ATTCACCACC TCCTGGCCCT CCTGGGTCAA TGTCGAGAAT TTCTCCGCCT ATCACAATCT TCCGCTCGCG ACTCGGGCCA ACGGCCTCGG CGGGCTGGAC GCGGTGCTGG TGTTCAACAA TCCGCTGGTG ATCAGGCACG TCGCCACGCT CGCGGAATGG CAGAAAACCA AGGTGTTCGA CTATGCCGGC CGCGCCACCG CCGCAGAGCC GCGCTTTCAG CAGGGCGACT GCGGCATCTT CATCGGCTCG TCCGCCACCC GTGCCGATAT CATCGCCAAT TCCAATTTCG AGGTCGGCTA CGGCAGGCTG CCGTATTGGC CGGAGGTTCC CGGGGCGCCG CAGAATACGA TCATCGGCGG GGCGACGCTG TGGGTGCTGC GCGGCCGGCC GGCGACGGAC TATCACGGCG TCGCCAAGTT CTTCACCTAT CTGTCGCGGC CCGAAGTGCA GGCCGCCTGG CACCAGAACA CGGGCTATCT TCCGGTGACA CGGGCCGCCT ATCAGCTGAC CCGTGCGCAG GGCTTTTACG ACCGCAATCC GGGCACCGCG ATCTCGATCG AGCAGATCAT CTCGAAGCCG CCCACCGAAA ACTCCCGCGG GCTCCGGCTC GGTTCTTTCG TTCTGATCCG CGACGTCATC GACGACGAGC TCGAACAGGC ATTCAGGGGC AAGAAACCCG CGCAGGCCGC GATGAATTCC GCGGTCGAGC GCGGCAACAA GTTGCTGCGC CAGTTCGAAC GGACGCAGCC ATGA
|
Protein sequence | MARTPLIFAL VLLVVAGAAP ARAATEIAWW HAMSGELGRR LEKLAADFNA SQSDYRVVPT YKGNYTETVT ASIFAFRSST QPAIVQVNEI ATATMMAAKG AVYPVYELMR DEKEAFSPSD YLPAVAGYYV DLAGNMLSFP FNASTPILYY NKTLFKKAGL DPETPPGTWP DVGAAAKRLI DAGVPCGFTT SWPSWVNVEN FSAYHNLPLA TRANGLGGLD AVLVFNNPLV IRHVATLAEW QKTKVFDYAG RATAAEPRFQ QGDCGIFIGS SATRADIIAN SNFEVGYGRL PYWPEVPGAP QNTIIGGATL WVLRGRPATD YHGVAKFFTY LSRPEVQAAW HQNTGYLPVT RAAYQLTRAQ GFYDRNPGTA ISIEQIISKP PTENSRGLRL GSFVLIRDVI DDELEQAFRG KKPAQAAMNS AVERGNKLLR QFERTQP
|
| |