Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2613 |
Symbol | |
ID | 3973292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 2846521 |
End bp | 2848335 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637925724 |
Product | extracellular solute-binding protein |
Protein accession | YP_532482 |
Protein GI | 90424112 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.674629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.538496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGC TCGCGGCGGC TGGCACGGCC GTGATCATCG GATCCGTGGC GGCTTCGGCC GCCGATCCTG CCCCTGCAGT TTCTAGCCAT GGCATCGCCA TGCACGGCGC GCCGGCGCTG CCCGCCGATT ACGTCCAGAT GCCCTACGTC AATCCGCAGG CTCCGAAGGG CGGCAGGCTG ATCCTGGGCC TGCTCGGCGC CTTCGACAGT CTCAATCCGC TGATCGTCAA AGGTCTCGCC GTGCAGCAGA TCCGCGGCTA CGTCATCGAA AGCCTGATGG CACGGGGCAA TGACGAGGCC TTCACGCTGT ACGGCCTGTT GGCGCAGAGC GTCGAGACCG ACGACGCCCG CAGCTACGTC ACCTTTCGAA TCGACCCGCG GGCGCGGTTC GCCGACGACC ATCCGCTGCG CGCCGACGAC GTGATCTTCT CCTGGCAATT GCTGCGCGAC AAGGGCCGGC CCAATCACCG GATGTACTAC GCCAAGGTCG CCAAGGCGGA AGCGCTCGAT TCCCTCACCG TGCGATTCGA TTTCGGCGGC AGCAGCGACC GCGAACTGCC GTTGATTCTC GGGCTGATGC CGGTGCTGCC GAAGCACGCG GTCAATGTCG AGACCTTCGA AGAAACCTCG CTCGCCGCCC CGCTCGGCTC CGGTCCCTAT CGCGTCAGCG CGGTTCGACC GGGCGCCAGC GTGACGTTGA CCCGCAATCC GAACTATTGG GGCCGCGATC TGCCGGTCAA TCGCGGGCTG TGGAATTTCG ACGAGGTCCG GCTCGACTAC TATCGCGAGG CCAACGGGCT ATTCGAAGCC TTCAAGCGCG GGCTGTACGA TGCCCGGGTG GAGAACGAGC CGCTGCGCTG GCACGAGGGC TATGACTTCC CGGCGGCGCG CCATGGCGAG GTGATCCGCG ACACGATCAA GACCGGACTG CCGCAGCCTT CGGAATATCT GGTGTTCAAC ACCCGGCGCC CGGTGTTCGC CGATATCAGG GTCCGCGAGG CGCTGACGCT GCTGTTCGAT TTCCAGTGGA TCAATCGCAA CTACTTCTAT GGCCTGTATG CCCGTTCCGC CGGCTACTTC GCCGGCTCCG AACTGTCGGC CTATGCCCGC CCCGCCGGTC CCCGCGAGCG CGAATTGCTT GGGCCGCTGC TTGACCGGCT GCCTGCGGCG ATCATCGACG GCAGCTATCG GCTGCCCGCC GGCGACGGCT CCGGCCGCGA CCGCGAAGCG CTGCGACAAG CGCTGGCGCT GCTGGCGCAA GCCGGCTTTG AGCTCGACGG CACCGCCTTG CGGCAGCGCT CGACCGGCCA GCAACTGAGC TTCGAGATGC TGGTGACGAC CCGCGATCAA GAGCGCATCG CGCTGGCTTT CGTCCGCGAT CTCAAGCGCG CGGGTATTCA AGCCAACGTC CGCGCGGTCG ACGGCGTGCA GTTCGACCAG CGCCGCCTGG CCTTCGATTT CGACATGATC CAGAACCGCT GGGACCAATC GCTGTCGCCG GGTAATGAAC AGTCGTTCTA CTGGGGGAGT GCGGCGGCGG ACAATCAGGG CACCCGCAAT TACATGGGCG CCAAGGACCC GGCGATCGAC GCGATGATCG CGGCCTTGCT GGAAGCGCGC GATCGCCCCG ACTTCGTGTC CGCAGTGCGG GCCTTAGACC GCGTCCTGAT GGCGAGGTTC TACGCAATTC CAGCGTATAA CGTGCAAGAA CAATGGATCG CCCGCTGGAA TCGGATAGAA CGGCCTAGGG CCAACGCCTT GTCCGGCTAT CTGCCCGAAA CCTGGTGGCA CGCGCCACCG ACGTTGCAAA GGTGA
|
Protein sequence | MSRLAAAGTA VIIGSVAASA ADPAPAVSSH GIAMHGAPAL PADYVQMPYV NPQAPKGGRL ILGLLGAFDS LNPLIVKGLA VQQIRGYVIE SLMARGNDEA FTLYGLLAQS VETDDARSYV TFRIDPRARF ADDHPLRADD VIFSWQLLRD KGRPNHRMYY AKVAKAEALD SLTVRFDFGG SSDRELPLIL GLMPVLPKHA VNVETFEETS LAAPLGSGPY RVSAVRPGAS VTLTRNPNYW GRDLPVNRGL WNFDEVRLDY YREANGLFEA FKRGLYDARV ENEPLRWHEG YDFPAARHGE VIRDTIKTGL PQPSEYLVFN TRRPVFADIR VREALTLLFD FQWINRNYFY GLYARSAGYF AGSELSAYAR PAGPRERELL GPLLDRLPAA IIDGSYRLPA GDGSGRDREA LRQALALLAQ AGFELDGTAL RQRSTGQQLS FEMLVTTRDQ ERIALAFVRD LKRAGIQANV RAVDGVQFDQ RRLAFDFDMI QNRWDQSLSP GNEQSFYWGS AAADNQGTRN YMGAKDPAID AMIAALLEAR DRPDFVSAVR ALDRVLMARF YAIPAYNVQE QWIARWNRIE RPRANALSGY LPETWWHAPP TLQR
|
| |