Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4211 |
Symbol | |
ID | 3912019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4783668 |
End bp | 4785464 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637886114 |
Product | extracellular solute-binding protein |
Protein accession | YP_487813 |
Protein GI | 86751317 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.64235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAGTT TCGGAAATCG GACTCTCTCC AGCCGCTTGC GGCTGATGAC GATGGCGAGC GCCGCGGCAT TGGTCGCCGC GTCGATGACG CTGGCGGCGC CGGCATGGGC CGCCGACGAT GCGGTGCTGA AGAAGTGGAT CGACGAGGAG TTTCAGCCCT CGACGCTGTC GAAGGACGAC CAGCTCAAGG AACTGCAATG GTTCGCCAAG GCGGCCGAGC CGTTCAAGGG CATGGACATC AACGTCGTCT CCGAGACCAT CACCACCCAC GAATACGAGG CGAAGACGCT CGCGAAGGCG TTCTCGGAGA TCACCGGCAT CAAGCTCAAG CATGATCTGA TCCAGGAAGG CGACGTGGTC GAGAAGCTGC AGACCCAGAT GCAGTCCGGC AAGAACGTCT ATGACGGCTG GATCAACGAC AGCGACCTGA TCGGCACGCA TTTCCGCTAC GGCCAGACCA TCGCGCTGTC CGACTACATG ACCGGCGAGG GCAAGGACGT CACCGATCCG ATGCTCGATA TCGATGACTT CATCGGCAAG TCGTTCACCA CCGCGCCCGA CAAGAAGATG TACCAGTTGC CCGACCAGCA GTTCGCCAAT CTGTACTGGT TCCGCTACGA CTGGTTCACC AATCCGGACT ACAAGGCGAA GTTCAAGGCG AAATACGGCT ACGAGCTCGG CGTCCCGGTC AACTGGTCGG CCTACGAGGA TATCGCCGAG TTCTTCACCA ACGACATCAA GGAGATCAAC GGCGTCAAGG TCTATGGTCA CATGGACTAC GGCAAGAAGG ATCCCTCGCT CGGCTGGCGC TTCACCGACG CCTGGCTGTC GATGGCCGGC AACGGCGACA AGGGCTTGCC GAACGGTCTG CCGGTCGACG AATGGGGCAT CCGCATGGAA GGCTGCCGTC CGGTCGGCTC CTCGATGGAG CGCGGCGGCG ACACCAACGG TCCCGCGGCG GTGTACTCCA TCGTGAAGTA TCTCGACTGG ATGAAGAAGT ATGCGCCGCC GCAGGCGCAG GGCATGACCT TCTCGGAGTC GGGGCCGGTG CCGGCGCAGG GCAACGTCGC CCAGCAGATG TTCTGGTACA CCGCCTTCAC CGCCGACATG GTGAAGCCCG GCCTGCCGGT GGTGAACGCC GACGGCACGC CGAAATGGCG GATGGCCCCC TCGCCGAAGG GCGCCTATTG GAAAGACGGC ATGAAGCTCG GCTATCAGGA CGTCGGCTCC GGCACGCTCT TGAAGTCGAC GCCGCCGGAT CGCCGCAAGG CGGCCTGGCT GTATCTGCAG TTCATCACCT CGAAGACGGT GAGCCTGAAG AAGAGCCATG TCGGTCTCAC CTTCATCCGC GAGAGCGATA TCTGGGACAA ATCGTTCACC GAGCGCGCGC CCAAGCTCGG TGGCCTGATC GAGTTCTATC GCTCGCCGGC CCGCGTGCAG TGGTCGCCGA CCGGCAACAA CATCCCGGAC TATCCGAAGC TGGCGCAATT GTGGTGGCAG AACATCGGCG ATGCATCGTC CGGTGCGAAG ACGCCGCAGG CCGCCATGGA TTCGCTGGCG GCCGCGCAGG ACTCGGTGCT CGAGCGGCTC GAGCGGTCGA AGGTGCAGGG TGATTGCGGT CCGAAGCTGA ACAAGAAAGA GACCGCCGAG TACTGGTACG AGAAGTCCGC CAAGGACGGC AACATCGCTC CGCAGCGCAA GCTGGCGAAC GAGAAGCCGA AGGGTGAGAC CGTCGATTAC GACACCCTGA TCAAGTCCTG GCCCGCCTCG CCGCCGAAGC GCGCGGAGGC GAAGTAA
|
Protein sequence | MHSFGNRTLS SRLRLMTMAS AAALVAASMT LAAPAWAADD AVLKKWIDEE FQPSTLSKDD QLKELQWFAK AAEPFKGMDI NVVSETITTH EYEAKTLAKA FSEITGIKLK HDLIQEGDVV EKLQTQMQSG KNVYDGWIND SDLIGTHFRY GQTIALSDYM TGEGKDVTDP MLDIDDFIGK SFTTAPDKKM YQLPDQQFAN LYWFRYDWFT NPDYKAKFKA KYGYELGVPV NWSAYEDIAE FFTNDIKEIN GVKVYGHMDY GKKDPSLGWR FTDAWLSMAG NGDKGLPNGL PVDEWGIRME GCRPVGSSME RGGDTNGPAA VYSIVKYLDW MKKYAPPQAQ GMTFSESGPV PAQGNVAQQM FWYTAFTADM VKPGLPVVNA DGTPKWRMAP SPKGAYWKDG MKLGYQDVGS GTLLKSTPPD RRKAAWLYLQ FITSKTVSLK KSHVGLTFIR ESDIWDKSFT ERAPKLGGLI EFYRSPARVQ WSPTGNNIPD YPKLAQLWWQ NIGDASSGAK TPQAAMDSLA AAQDSVLERL ERSKVQGDCG PKLNKKETAE YWYEKSAKDG NIAPQRKLAN EKPKGETVDY DTLIKSWPAS PPKRAEAK
|
| |