Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0404 |
Symbol | |
ID | 3970856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 437561 |
End bp | 438877 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637923519 |
Product | extracellular solute-binding protein |
Protein accession | YP_530298 |
Protein GI | 90421928 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTTC GACATGTCGG CCTGGCGGCG ACGCTTTCGC TTGCGTTGGG ACTCGCTTCG CCTGCGCTTG CTGCGACCGA GATCCAGTGG TGGCACGCCA TGACCGGCGC CAACAACGAC GTCATCGTCA AGCTCGCCGA AGAGTTCAAC GCCGCGCAGA CCGACTACAA GGTGGTGCCG TCCTATAAGG GCAGCTATCC CGACACCATG AACGCCGGCA TCGCGGCGTT CCGCGCCGGC AACGCGCCGC ATATCATCCA GGTGTTCGAG GTCGGCACCG CCACCATGAT GGCAGCGACC GGGGCGGTGA AGCCGGTCTA CAAGTTGATG GCGGAGGCCG GAGAGAAATT CGACTCGCAG GCCTATCTGC CGGCGATCAC CGGCTACTAC TCGACCTCGA AGGGTGAGAT GCTGTCGTTC CCGTTCAACT CGTCCTCGAT GGTGATGTGG ATCAACAAGG ACGCCTTGAA GAAGGCCAAC ATCGCCGAGA TCCCGAAGAC CTGGCCGGAG GTGTTCGAAG ACGCCAAGAA ATTGAAGGCG GCGGGCTACG CCACCTGCGG CTTCTCCACC GCCTGGGTGA CCTGGGCCAA TCTCGAGCAA TTGTCGGCCT GGCACAACGT GCCGCTGGCG AGCCGGGCCA ACGGCCTCGA CGGCTTCGAC ACCAAGCTCG AATTCAACGG CCCGCTGCAG ATCAAGCATC TGGAGACGCT GATCGCGCTG CAGAAGGACA AGACCTACGA TTATTCCGGC CGCACCAACA CTGGAGAAGG CCGTTTCACC TCCGGCGAAT GCCCGATCTT CCTGAGTTCC TCGGGCTTCT TCGGCCAGGT CAAAGGCAAC GCCAAGTTCG ATTGGACCAA CGCGCCGATG CCGTATTATC CGGACGTTCA AGGCGCGCCG CAGAACTCGA TCATCGGCGG CGCCTCGCTG TGGGTGATGG GCGGCAAGTC GCCGGCGGAA TACAAGGGCG TCGCCAAGTT CCTCAGCTTC CTGTCCGACA CCGACCGTCA GGTCGCGATC CACAAGGCCT CTGGCTATCT GCCGATCACC AAGGCGGCCT ACGCCAAGGC CCAGGAGGAA GGCTTTTACG TCAACGCGCC GTATCTGGAG ACGCCGCTCA GGGAATTGAC CAACAAACCG CCGACCGAAA ACTCCCGCGG ACTGCGGCTC GGCAACATGG TGCAGCTGCG CGACATCTGG GCGGAAGAAA TCGAATCCGC GCTGGCCGGC AAGAAGACCG CCAAGGACGC GCTCGACACC GCAGTGACCC GCGGCAACGC CATGCTGCGG CAGTTCGAAC GCACGGTGAG CAAGTAG
|
Protein sequence | MALRHVGLAA TLSLALGLAS PALAATEIQW WHAMTGANND VIVKLAEEFN AAQTDYKVVP SYKGSYPDTM NAGIAAFRAG NAPHIIQVFE VGTATMMAAT GAVKPVYKLM AEAGEKFDSQ AYLPAITGYY STSKGEMLSF PFNSSSMVMW INKDALKKAN IAEIPKTWPE VFEDAKKLKA AGYATCGFST AWVTWANLEQ LSAWHNVPLA SRANGLDGFD TKLEFNGPLQ IKHLETLIAL QKDKTYDYSG RTNTGEGRFT SGECPIFLSS SGFFGQVKGN AKFDWTNAPM PYYPDVQGAP QNSIIGGASL WVMGGKSPAE YKGVAKFLSF LSDTDRQVAI HKASGYLPIT KAAYAKAQEE GFYVNAPYLE TPLRELTNKP PTENSRGLRL GNMVQLRDIW AEEIESALAG KKTAKDALDT AVTRGNAMLR QFERTVSK
|
| |