Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1367 |
Symbol | |
ID | 3972463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1491378 |
End bp | 1493165 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637924482 |
Product | extracellular solute-binding protein |
Protein accession | YP_531248 |
Protein GI | 90422878 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.15999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACACG CAATCGGACA GCGCCGTCGA TCGACGCGCG CGATTTTTCT CGGCATGGCG AGCGCCGCGG CGCTGATCGC GGTCTCGACG GCGCCGGCGC TTGCCGACGA CGCCACCGCG CAGAAATGGA TCGACGAGGA ATTCCAGCCC TCGACGCTGT CGAAGGAAGA TCAGCTCAAG GAGCTGCAAT GGTTCGCCAA GGCGGCCGAG CCGTTCAAGG GCATGGACAT CAACGTCGTC TCCGAAACCA TCACCACCCA CGAATACGAA GCCAAAACGC TGGCCAAGGC GTTTTCGGAA ATCACCGGCA TCAAGCTCAA GCACGATTTG ATCCAGGAAG GCGACGTGGT CGAGAAGCTG CAGACCCAGA TGCAGTCCGG CAAGAACGTC TATGACGGCT GGATCAACGA CAGCGATCTG ATCGGCACGC ATTTCCGCTA CGGCCAGACC ATCGCGCTGT CCGACTACAT GACCGGCGAG GGCAAGGACG TCACCGACCC GATGCTGGAC ATCGACGACT TTATCGGCCG TTCGTTCACC ACCGCGCCCG ACAAGAAAAT GTATCAGCTG CCGGACCAGC AGTTCGCCAA CCTGTATTGG TTCAGGTACG ACTGGTTCAC CAATCCGGAC TACAAATCGA AGTTCAAGGC GAAATACGGC TACGACCTCG GCGTGCCGGT GAATTGGTCG GCCTATGAGG ACATCGCCGA GTTCTTCACC AACGACGTCA AGGAGATCAA CGGCGTCAAG GTCTATGGCC ACATGGATTA TGGCAAGAAG GATCCGTCGC TCGGCTGGCG CTTCACCGAC GCCTGGCTGT CGATGGCCGG CAACGGCGAT CGCGGCATTC CCAACGGCCT GCCGGTCGAC GAATGGGGCG TCCGCATGGA AGGCTGCCGT CCGGTCGGCT CCTCGATCGA GCGCGGCGGC GACACCAACG GCCCTGCGGC GGTGTATTCG ATCGTCAAAT ATCTCGACTG GATGAAGAAA TACGCGCCGC CGCAGGCCCA GGGCATGACC TTCTCGGAAT CGGGTCCGGT GCCGGCGCAG GGCAACGTCG CCCAGCAGAT GTTCTGGTAC ACCGCCTTCA CCGCCGACAT GGTGAAGCCG GGTCTTGCGG TGATGAACGC CGACGGCACG CCGAAGTGGC GGATGGCGCC GTCGCCGCAT GGCGCGTATT GGAAAGACGG CATGAAGCTC GGCTATCAGG ACGTCGGCTC CGGCACGCTG TTGAAGTCGA CGCCGCCGGA TCGCCGCAAG GCGGCGTGGC TGTATCTGCA GTTCATCACC TCGAAGACTG TCAGCTTGAA GAAGAGCCAC GTCGGTCTCA CCTTCATCCG CGAGTCGGAT ATCTGGGATA AGTCCTTTAC GGAACGGGCG CCAAAACTCG GCGGCCTGAT CGAGTTCTAT CGCTCGCCGG CCCGTACGCA ATGGTCGCCG ACCGGCAACA ACATCCCGGA CTATCCGAAG CTGGCGCAAT TGTGGTGGCA GAACATCGGC GACGCGGCCT CGGGCGCCAA GACCGCGCAG GCGGCGATGG ACTCGCTGGC GGCGGCGCAG GATTCGGTGC TGGAGCGGCT CGAGCGCTCC AAGGTGCAGG GTGACTGCGG CCCGAAGCTG AACAAGAAGG AGACCGCCGA GTTCTGGTAC AAGAAGTCGG AAAAGGACGG CAACATCGCG CCGCAACGCA AGCTCGCCAA CGAGAAGCCG AAGGGCGAAA CCATCGACTA CGACACGCTG ATCAAGTCGT GGCCGGCGTC GCCGCCGAAG CGCGCCTCGC TGAACTGA
|
Protein sequence | MQHAIGQRRR STRAIFLGMA SAAALIAVST APALADDATA QKWIDEEFQP STLSKEDQLK ELQWFAKAAE PFKGMDINVV SETITTHEYE AKTLAKAFSE ITGIKLKHDL IQEGDVVEKL QTQMQSGKNV YDGWINDSDL IGTHFRYGQT IALSDYMTGE GKDVTDPMLD IDDFIGRSFT TAPDKKMYQL PDQQFANLYW FRYDWFTNPD YKSKFKAKYG YDLGVPVNWS AYEDIAEFFT NDVKEINGVK VYGHMDYGKK DPSLGWRFTD AWLSMAGNGD RGIPNGLPVD EWGVRMEGCR PVGSSIERGG DTNGPAAVYS IVKYLDWMKK YAPPQAQGMT FSESGPVPAQ GNVAQQMFWY TAFTADMVKP GLAVMNADGT PKWRMAPSPH GAYWKDGMKL GYQDVGSGTL LKSTPPDRRK AAWLYLQFIT SKTVSLKKSH VGLTFIRESD IWDKSFTERA PKLGGLIEFY RSPARTQWSP TGNNIPDYPK LAQLWWQNIG DAASGAKTAQ AAMDSLAAAQ DSVLERLERS KVQGDCGPKL NKKETAEFWY KKSEKDGNIA PQRKLANEKP KGETIDYDTL IKSWPASPPK RASLN
|
| |