Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1200 |
Symbol | |
ID | 3969077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1314483 |
End bp | 1316090 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637924311 |
Product | extracellular solute-binding protein |
Protein accession | YP_531082 |
Protein GI | 90422712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAT GTCTGCGCGG CCCGCGTCGC TCGATGTCCT CATGGGGATT ACGCGGCGTT ATCGCATCTG TGGTGTTGGC GTTATCGCTA CCGGCCTCGG CGAAGACGCT GACCGCGGTG ATGCATTCCG ATCTGCGGGT GATCGATCCG GGGCTGACCA CCGCCTACAT CACCCGCGAC CACGGCTATA TGGTGTACGA CACGCTGCTG GCGATGGATG CCAATTTCAA AGTGCAGCCG CAGATGGCGG ACTGGAAGGT GTCCGACGAC AAATTGACCT ACACCTTCAC CTTGCGCGAC GGGTTGAAGT GGCACGACGG TGCGCCGGTC ACCGCGGAGG ATTGTGTCGC ATCGTTGCGG CGCTGGGGCT CCTCCGACGG CATGGGCCAG AAGCTGATGG ACTTCACCGC GTCGCTGGAG GCCACCGACG CCAAGACCAT CACTCTTAAA TTGAAGGAGC CCTACGGGCT GGTGCTGGAA TCGATCGGCA AGCCGTCGTC GCTGGTGCCG TTCATGATGC CGAAGCGCAT CGCCGAGACC CCGGCCGGCA AGGCGATCTC CGAGCAGATC GGCTCCGGGC CGTTCAAATT CGTCGCCGCC GAGTTCCAGC CCGGCGTCAA GGCGGTGTAT GTCAAGAACG CCGACTATGT GCCGCGCGCC GAGCCGCCGA GCTGGACCTC GGGCGGCAAA GTGGTGAAGG TCGATCGCGT CGAATGGCTG ACCATGGCCG ACGCGCAGAC CGCGGTGAAC GCGCTGCAGT CCGGCGACAT CGATTTTCTG GAGAATCCCT CGTTCGACAT CCTGCCGATG CTGCTGGCCG ACAGCGAGCT GACGGTGCAG ACGCTGAGCC CGCTGGGGTT TCAGACTCTG GGGCGGATGA ACTTCCTGTA TCCGCCGTTC GACAACATCA AGGTGCGGCG CGCCGCGTTC CTCGCCATGA GCCAGAAGCC GGTGCTCGAC GCGCTGGTCG GCAATCCGGA CTACTACAAG ATCTGCGGCG CGGTGTTCGG CTGCGACACG CCGCTCTCCA CCGACGTCGG CTCGGAGAGC CTGGTGAAGG GCAACGGCAT GGCGGAGGCC AAGAAGCTGT TGGCCGAGAG CGGCTACGAC GGCACCCCGA TCGCGCTGAT GGCGCCGACC GACGTCAACA CGTTGCGGGC GCAGCCGATC GTGGCGGCGC AATTGCTGCG CGACGCCGGC TTCAAGGTCG ACGTGCAGGC CACCGACTGG CAGACCGTGG TGACGCGGCG CGCCAGCCAG AAGCCGGTGA AGGACGGCGG CTGGAACATC TTCTTCACCA ACTGGGCCGG CCCGGAAATT CTCAACCCGA TCGCCAACGT CTCCACCAGC GGCAAGGGCA AGAGCGGCGG CTGGTTCGGC TGGCCGGACG ATCCCGCCAT GGAAACGCTG CGCGACAAGT TCGCCCGCGC CAAGGCGCTG GATGAGCAGA AGCAGCTCGC CGAAGCGATC CAGGCGCGGG TCTATGAGCA GGTGCTGTAT ATCCCGCTCG GTCAATACAA GGTGCCGAGC GCCTGGCGCA AATCGCTGTC CGGCGTGCTC AGCGGCCCGG CGACCCCGGT GTTCTGGAAT ATCGACAAGA AGGAGTAG
|
Protein sequence | MSECLRGPRR SMSSWGLRGV IASVVLALSL PASAKTLTAV MHSDLRVIDP GLTTAYITRD HGYMVYDTLL AMDANFKVQP QMADWKVSDD KLTYTFTLRD GLKWHDGAPV TAEDCVASLR RWGSSDGMGQ KLMDFTASLE ATDAKTITLK LKEPYGLVLE SIGKPSSLVP FMMPKRIAET PAGKAISEQI GSGPFKFVAA EFQPGVKAVY VKNADYVPRA EPPSWTSGGK VVKVDRVEWL TMADAQTAVN ALQSGDIDFL ENPSFDILPM LLADSELTVQ TLSPLGFQTL GRMNFLYPPF DNIKVRRAAF LAMSQKPVLD ALVGNPDYYK ICGAVFGCDT PLSTDVGSES LVKGNGMAEA KKLLAESGYD GTPIALMAPT DVNTLRAQPI VAAQLLRDAG FKVDVQATDW QTVVTRRASQ KPVKDGGWNI FFTNWAGPEI LNPIANVSTS GKGKSGGWFG WPDDPAMETL RDKFARAKAL DEQKQLAEAI QARVYEQVLY IPLGQYKVPS AWRKSLSGVL SGPATPVFWN IDKKE
|
| |