Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3728 |
Symbol | |
ID | 3971473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4148081 |
End bp | 4149946 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926838 |
Product | extracellular solute-binding protein |
Protein accession | YP_533582 |
Protein GI | 90425212 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC TCAATCGCCG GCATCTGCTC GGTCTCGGTG TCGGTGCGGT GGCCGCCGCC TCGTTGCGGC CGGCGCTTGC GGCCGAGGGC GGCGAGATCG AGGCGCACGG CATTTCGGCG TTCGGCGATC TGAAATATCC GGCGGATTTC CATCATTTCG ACTACGTCAA TGTCGACGCG CCGAAGGGCG GGGTGTTTTC GACCAATCCG TCCTACCGGT CCTTCAACCA GTCGTTCCTG ACCTTCAACT CGCTCAACGC CTTCATCTTC AAGGGCGACG GCGCGCAAGG CATGGGCCTG ACTTTTGCGC CGCTGATGGC GCGCGCCGGC GACGAGCCCG ACGCGATGTA CGGCCTGGTT GCCAAATCGG TGAAGATCTC CGCCGATGGG CTCGGCTATC GCTTCACGCT GCGGCCGGAG GCGCGGTTTC ATGATGGCTC GAAGCTCACC GCCCACGACG CGGCGTTTTC GCTGACCGTG CTGAAGACCA AGGGGCATCC CTTGATCACC CAGCAGATGC GCGACGTGGT CTCGGCGGAA GCGCTCGACG ACTTAACGCT GCTGGTGAGG TTTTCGGCCA AGCGCGGCCG CGACGTGCCG CTGTTCGTGG CGGGGCTGCC GATCTTTTCG CAGGCCTATT ACGCCAAACA TCCGTTCGAT GAGTCGACGC TGGAGGCGCC GCTCGGCTCT GGCCCCTACA AGGTCGGCAA GTTCGAAGTC GGCCGCTACA TCGAATTCGA GCGGCTGCAG GACTGGTGGG GCGCCGAGCT GCCGGTCAAT CGCGGAGCCA ACAATTTCGA CGTGGTGCGT TACGACTTCT ATCGCGACCG CGACGTCGCC TTCGAGGGCT TCACCGGCCG CAGCTATCTG TATCGCGAGG AGTTCACCTC GCGGGTCTGG AACACGCGCT ATGATTTTCC GGCGATCCTC GACGGCCGGG TGAAGCGCGA AACCCTGCCG GATGAAACGC CCTCCGGGGC GCAGGGCTGG TTTCCCAACA CCCGCCGCGA CAAGTTCAAG GACCCGCGGG TGCGCGAGGC GCTGGGCTGC GCGTTCGATT TCGAATGGAC CAACAAGACC CTGATGTACG GCGCCTATCT CCGCACGGTA TCGCCGTTCC AGAACTCCGA CCTGATGGCC AACGGTCCGC CGTCGCCGGA AGAAGTGGCA TTGCTAGAGC GCTTCCGCGG CCAGGTGCCG GAGGAGGTGT TCGGCGCGCC CTATGTGCCG CCGGTGTCCG ATGGCTCCGG GCAGGACCGC GCGCTGTTGA AGAAGGCGGT GCAACTGCTG CAGGACGCCG GCTGCGTGAT CAAGAACGGC AAGCGGATGA CGCCGCAGGG CGAACCGTTC ACGATCGAGT TTCTGCTCGA CGAGCCGACC TTTCAGCCGC ACCACATGCC GTTCATCAAG AATCTCGCCA CGCTCGGCAT CGAGGCGTCG CTGCGCATGG TCGATGCCGT GCAGCATCGC GCCCGGCGCG ACGATTTCGA TTTCGACCTC ATCATCGAGC GCTTCGGCTT CTCGACGGTG CCGGGCGACT CGCTGCGGCC GTTCTTCTCG TCGCGCGCGG CGGCCACCAA GGGCTCGAGC AATCTCGCCG GGATCGCCGA TCCGGTGGTC GATGCGCTGG TCGAAGACGT CATCGCCGCC GACACCAGGG TCAAGCTGGT GGTCGCCGCG CGCGCGCTCG ACCGCGTGGT CCGCGCCGGC CGCTATTGGG TGCCGCAATG GTATTCGGGC TCGCATCGGG TGGCCTATTG GGACGTGTTC GGCCATCCGG CGAAACTGCC GAAATATCTC GGCGTCGCAG CACCCGATCT GTGGTGGTCG ACCGTGAAGT CCGCAGCGAC CGAACAGGCG AAATAG
|
Protein sequence | MAELNRRHLL GLGVGAVAAA SLRPALAAEG GEIEAHGISA FGDLKYPADF HHFDYVNVDA PKGGVFSTNP SYRSFNQSFL TFNSLNAFIF KGDGAQGMGL TFAPLMARAG DEPDAMYGLV AKSVKISADG LGYRFTLRPE ARFHDGSKLT AHDAAFSLTV LKTKGHPLIT QQMRDVVSAE ALDDLTLLVR FSAKRGRDVP LFVAGLPIFS QAYYAKHPFD ESTLEAPLGS GPYKVGKFEV GRYIEFERLQ DWWGAELPVN RGANNFDVVR YDFYRDRDVA FEGFTGRSYL YREEFTSRVW NTRYDFPAIL DGRVKRETLP DETPSGAQGW FPNTRRDKFK DPRVREALGC AFDFEWTNKT LMYGAYLRTV SPFQNSDLMA NGPPSPEEVA LLERFRGQVP EEVFGAPYVP PVSDGSGQDR ALLKKAVQLL QDAGCVIKNG KRMTPQGEPF TIEFLLDEPT FQPHHMPFIK NLATLGIEAS LRMVDAVQHR ARRDDFDFDL IIERFGFSTV PGDSLRPFFS SRAAATKGSS NLAGIADPVV DALVEDVIAA DTRVKLVVAA RALDRVVRAG RYWVPQWYSG SHRVAYWDVF GHPAKLPKYL GVAAPDLWWS TVKSAATEQA K
|
| |