Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1218 |
Symbol | |
ID | 3969095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1332450 |
End bp | 1334117 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637924329 |
Product | extracellular solute-binding protein |
Protein accession | YP_531100 |
Protein GI | 90422730 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0270957 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATGA TTGGTTTGGT GGCTCGGCGC ACAGCCGCAG CCGTGTTCGT GCTGACGACC GCGAGTGCGA TCGCGCTGCC GCAATCTGCC GGCGCGGAAG CCGTGCTGCG CATCGGCATG ACAGCCGCAG ACGTGCCGCG CACGCTCGGC CAACCCGATC AGGGCTTCGA AGGCAATCGC TTCACTGGCC TCACCATGTA CGACGCGCTG ACGATGTGGG ACCTGTCGTC TGCAACCAAA GCCAGCGTGG TGATCCCGGG GCTTGCCACC GAATGGGCGG TGAATGAGAG CGATAAGACC AAATGGACCT TCAAGCTGCG TCCCGGCGTC AGCTTTCATG ACGGCTCGCC GTTCAACGCC GATGCGGTGG TCTGGAATGT GGAGAAGGTG CTGAAGCAGG ACGCGCCGCA ATTCGACGCC AGCCAGGTCG GCGTCACCGC ATCGCGGATG CCGACATTGG TCTCGGCGAA GAAGATCGAC GACATGACGG TGGAACTGAC CACCAAGGAG CCGGACAGCT TCCTGCCGAT CAACCTCACC AACCTGTTCA TGGTCAGCCC GAGCAAGTGG CAGGCGTTGT ATGAGAAGGC CGAAGGCGCC GACGCCAAGG CGAAGTCGCA GGCCGCCTGG GCGTCGTTCG CCAAGGACGC CTCCGGCACC GGGCCGTGGA AGATGTCGAA GTTCACGCCG CGCGAACGGC TCGAACTGGT GAAGAACGAC AAATATTGGG ACGCCACACG CGTGCCGCAT GTCGACCGCC TAGTGCTGTT GCCGATGCCG GAAGCCAACG CCCGCACCGC GGCGCTGTTG TCCGGCCAGG TCGACTGGAT CGAGGCGCCC GCCCCCGACG CGGTCAAGGA AATCACCGCG CGCGGTTTCA AGATCGAGAA GAACGAGCAG CCGCACGTCT GGCCCTGGCA GTTCTCCCGC GTCGAAGGCT CGCCGTGGAA CGACATCCGG GTGCGCCGCG CCGCCAATCT GTGCATCGAT CGCGAAGGCT TGCGCGACGG CCTGCTCGCA GGATTGATGG TGCCGGCGAC CGGCACCTTC GAGCCCGGCC ATCCGTGGCG CGGCAAGCCG GCATTCCAGA TCAAATACGA TCTGCCGGCG GCACAGAAGC TGATGAAGGA AGCCGGCTAC GGCCCGACCA AGAAGCTCAG CGTCAAGGTG CAGACCTCGG CGTCGGGCTC CGGCCAGATG CTGCCGCTGC CGATGAACGA ATATCTGCAG CAGGCGCTCG CGGAGTGCTA CTTCGACGTC AAGCTCGACG TCATCGAGTG GAACACGCTG TTCACCAATT GGCGCCGCGG CACTAAGGAT CCCTCCGCCA ACGGCGCCAA CGCCACCAAC GTCACCTATG CGGCGATGGA CCCGTTCTTT GCGATGGTGC GCTTCCTGCA GTCGTCGATG GCGCCGCCGG TGTCGAACAA TTGGGGCTTC ATCAACAACC CGAAGTTCGA CGCGCTGGTG ACCAAGGCGC GCACCACCTT CGATGCCTCG CTGCGCGACG AAGCCTTGGC CGAACTGCAC GCCGCCTCGG TCGACGACGC CGCCTTCCTC TACGTCGCCC ACGACGTCGG CCCGCGCGCG CTGAGCCCGA AGGTCAAGGG CTTCGTGCAG CCGAAGAGCT GGTTCGTCGA CTTCTCGCCG GTGACGCTGG CGCCGTAA
|
Protein sequence | MRMIGLVARR TAAAVFVLTT ASAIALPQSA GAEAVLRIGM TAADVPRTLG QPDQGFEGNR FTGLTMYDAL TMWDLSSATK ASVVIPGLAT EWAVNESDKT KWTFKLRPGV SFHDGSPFNA DAVVWNVEKV LKQDAPQFDA SQVGVTASRM PTLVSAKKID DMTVELTTKE PDSFLPINLT NLFMVSPSKW QALYEKAEGA DAKAKSQAAW ASFAKDASGT GPWKMSKFTP RERLELVKND KYWDATRVPH VDRLVLLPMP EANARTAALL SGQVDWIEAP APDAVKEITA RGFKIEKNEQ PHVWPWQFSR VEGSPWNDIR VRRAANLCID REGLRDGLLA GLMVPATGTF EPGHPWRGKP AFQIKYDLPA AQKLMKEAGY GPTKKLSVKV QTSASGSGQM LPLPMNEYLQ QALAECYFDV KLDVIEWNTL FTNWRRGTKD PSANGANATN VTYAAMDPFF AMVRFLQSSM APPVSNNWGF INNPKFDALV TKARTTFDAS LRDEALAELH AASVDDAAFL YVAHDVGPRA LSPKVKGFVQ PKSWFVDFSP VTLAP
|
| |