Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1219 |
Symbol | |
ID | 3969096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1334300 |
End bp | 1335895 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637924330 |
Product | extracellular solute-binding protein |
Protein accession | YP_531101 |
Protein GI | 90422731 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0111682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTC GAAAATCTCT CGTCGTCGCT GCCATCGTGG CGATCGGCGC AATTAGTTCT GCGCGCGCCG AGTCGGTGGT GCGATACGGC ATATCGATGG CCGATATTCC GCTGACCACC GGCCAGCCGG ATCGTGGCGC CGGCGCTTAC CAGTTCACCG GGTACACCAT CTACGATCCG TTGGTGGCCT GGGAGATGAA TGTGGCGGAC CGCCCGGGAA AGCTGGTGCC GGGGCTCGCG ACATCATGGA AAGTCGATGA TGGCGATAAG AAAAAGTGGC GCTTCACTTT GCGCAAGGGC GTCAAATTTC ACGACGGAAG TGAATTCAAT GCCGATGCGG TGGTCTGGAA TTTAGACAAG GTGCTGAACG ACAAGGCACC GCAGTTCGAC AAGCGGCAAA GCGCGCAGGT CAAGACCCGG CTGCCGTCGG TCGCCAGCTA CACCAAAATC GATGACGACA CCGTGGAAAT TACAACCAAG ACGATCGACT CCTTCTTCCC GTACCAGATG CTCTGGTTCT TGGTGTCGAG CCCGACGCAA TACGCCAAGC TCGGCAATGA CTGGGACAAG TTCGCCGCGC AGCCGTCCGG CACCGGGCCG TTCAAGCTGA CCAAGCTGGT GCCGCGCGAA CTCGCCGAAC TCACCCGCAA TGATGAGTAC TGGGACAAGG CGCGGCTGCC GAAGACCGAC AAGCTGGTGC TGGTGCCGAT GCCGGAAGCC TTAACCCGCA CCAACGCGCT GCTCGCCGGC CAGGTCGATC TGATCGAGAC CCCGGCGCCG GATGCCGTGC CGCAGTTGAA GTCGGCCGGC ATGAAGATCG TCGACAACGT CACCCCGCAC GTCTGGAATT ATCACCTCAG CGTGTTGCCG GGTTCGCCCT GGACCGACGT GCGGTTGCGC AAGGCGCTCA ATCTGGCGAT CGACCGCGAT GCCGTCGTCG GCCTAATGAA TGGCCTCGCC AAACCGGCGA TCGGCCAGGT CGATCCGTCG AGCCCGTGGT TCGGCAAGCC GACCTTCAAC ATCAAATACG ATCTCGCCGA AGCCAAGCGG CTGGTGAAGG AAGCCGGCTA TTCGCCGGAG AAGCCGTTGA AGGCGACCTT CATCATCGCC AATGGCGGCA CCGGCCAGAT GCTGTCGCTG CCGATGAACG AGTTTCTGCA GCAGAGCTTC AAGGAGATCG GCGTCGACGT CGAGTTCAAG GTGGTCGAAC TCGAAGTGCT GTACACCGCC TGGCGCAAGG GCGCGGCCGA CGACAGCATG AAGGGCATCA CCGCCAACAA CATCGCCTAT GTCACCTCCG ATCCGCTCTA CGCCATCGTG CGGTTCTTCC ACTCCGGGCA GATCGCGCCG GTCGGCGTCA ATTGGGGCGG TTACAAAAAT CCGAAGGTTG ACGCCTTGAT CGACGAAGCC AAGACCACCT TCGACCCGAC CAAGCAGGAC GAATTGCTGG CGCAGGCGCA TGGCTTGATC GTCGACGACG CGGTGCTGGT CTGGGTGGTG CACGACACCA ACCCGCACGC TCTGTCGCCT AAGGTGAAGA GCTTCGTGCA GGCGCAGCAC TGGTTCCAAG ACCTCACTAC GATCGGTCTG CAGTAA
|
Protein sequence | MTVRKSLVVA AIVAIGAISS ARAESVVRYG ISMADIPLTT GQPDRGAGAY QFTGYTIYDP LVAWEMNVAD RPGKLVPGLA TSWKVDDGDK KKWRFTLRKG VKFHDGSEFN ADAVVWNLDK VLNDKAPQFD KRQSAQVKTR LPSVASYTKI DDDTVEITTK TIDSFFPYQM LWFLVSSPTQ YAKLGNDWDK FAAQPSGTGP FKLTKLVPRE LAELTRNDEY WDKARLPKTD KLVLVPMPEA LTRTNALLAG QVDLIETPAP DAVPQLKSAG MKIVDNVTPH VWNYHLSVLP GSPWTDVRLR KALNLAIDRD AVVGLMNGLA KPAIGQVDPS SPWFGKPTFN IKYDLAEAKR LVKEAGYSPE KPLKATFIIA NGGTGQMLSL PMNEFLQQSF KEIGVDVEFK VVELEVLYTA WRKGAADDSM KGITANNIAY VTSDPLYAIV RFFHSGQIAP VGVNWGGYKN PKVDALIDEA KTTFDPTKQD ELLAQAHGLI VDDAVLVWVV HDTNPHALSP KVKSFVQAQH WFQDLTTIGL Q
|
| |