Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3402 |
Symbol | |
ID | 3911204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3888501 |
End bp | 3889418 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885305 |
Product | periplasmic solute binding protein |
Protein accession | YP_487009 |
Protein GI | 86750513 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.832439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCGA AAGCCGACCA TCGACGCCTG GCCTGGACCG CCTTCGCGGG CGCCCTCGCC TGCCTGCTGC TGGCGGGCCC CTCCCTGGCG GAAACGCGCA AGCCGTTCCG GGTGGTGACG ACCTTCACCA TCATCCAGGA CATCGCGCAG AACGTCGCCG GCGACAGGGC CGTGGTGGAA TCGATCACCA AGCCGGGGGC TGAGATCCAC GACTACCAGC CGACACCGCG CGACATCGTC CGCGCGCAGT CGGCAGACCT CGTGCTGTGG AACGGATTCA ACCTCGAACG CTGGTTCGAG CGGTTCTTCC AGAGCGTCAA ACAGGTGCCG AGCGTGGTCG TCACCGAAGG CATCGCGCCG ATGGGGATCG CCGACGGGCC CTATGCCGGC AAGCCGAACC CGCACGCCTG GATGTCGCCG TCGAACGCGC TGATCTATGT CGAGAACATC CGCAAGGCGC TGGTCACCTA CGATCCGTCC AACAAGGACG CCTACGATCG CAACGCCGCC GACTATTCCG CGCGGATCAG GGCGCTCGAC GAGCCGCTGC GCAAGCGGCT CGCCGAAATT CCGAAGGACA AGCGCTGGCT GGTGTCGAGC GAAGGCGCTT TCAGCTATCT TGCGCGCGAC TACGAGATGA AGGAAGCTTT CCTCTGGCCG ATCAACGCCG ACGAACAGGG CACGCCGCAG CAGGTGCGCA AGGTGATCGA CCTGGTGCGC AACAACGACA TCCCGGTGGT GTTCAGCGAA AGCACGATCT CGGACCGCGC CGCCAAGCAG GTGGCCCGCG AGGCCGGGGC ACGCTACGGT GGGGTGCTCT ATGTCGACTC GCTCAGCGCC GCGGGCGGCC CAGTCCCGAC CTATCTGGAC CTGCTGAAGG TCACAGTGGA GACCATCGCG AAAGGGTTTG GCGCTTGA
|
Protein sequence | MKPKADHRRL AWTAFAGALA CLLLAGPSLA ETRKPFRVVT TFTIIQDIAQ NVAGDRAVVE SITKPGAEIH DYQPTPRDIV RAQSADLVLW NGFNLERWFE RFFQSVKQVP SVVVTEGIAP MGIADGPYAG KPNPHAWMSP SNALIYVENI RKALVTYDPS NKDAYDRNAA DYSARIRALD EPLRKRLAEI PKDKRWLVSS EGAFSYLARD YEMKEAFLWP INADEQGTPQ QVRKVIDLVR NNDIPVVFSE STISDRAAKQ VAREAGARYG GVLYVDSLSA AGGPVPTYLD LLKVTVETIA KGFGA
|
| |