Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0132 |
Symbol | |
ID | 3908103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 142384 |
End bp | 143982 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882014 |
Product | extracellular solute-binding protein |
Protein accession | YP_483755 |
Protein GI | 86747259 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTGA CGAAGCGGTC CTTTGTTATC GGCTCCCTCG GAGGCCTGGC GATGCTCGGC CTGCCGGCCG ATCTGCGCGC GCAGCAGGCC GGCGGCGGCA CGTTGGTGAT CGGCTCGACG CAGGTGCCGC GGCACTTCAA CGGCGCGGTG CAGTCGGGCA TCGCCACCGC GCTGCCGAGC ACGCAGATTT TCGCCAGCCC GCTGCGCTTC GACGAGAACT GGAATCCGCA GCCGTATCTC GCCAAATCCT GGGAGGTCGC GCCCGACGGT CTGTCGATTA CCCTGAAGCT GGTCGACGAC GCGGTGTTTC ACGACGGCAA GCCGGTGACC TCCGAGGACG TCGCGTTCTC GATCATGACG ATCAAGGCCA ACCACCCGTT CAAGACCATG CTGGCCGCGG TCGACAAGGT GGAGACGCCC GATCCGAAGA CCGCGGTGAT CAAGCTGGCG CATCCGCATC CGGCGCTGCT GCTGGCGATG TCGCCGGCGC TGATGCCGAT CCTGCCGAAG CACGTCTACG GCGACGGCCA GGACGTCAAG GCGCATCCGG CCAACCTCAA GCCGATCGGT TCCGGCCCGT ACAAGCTCGC CGAATACAAG CAGGGCGAGT ACTACACGCT GGAGAAGTTC GACAAATTCT TCATCCCGGG CCGTCCGAAG CTCGACAAGA TCGTGGTGCG GCTGATCTCG GATCCGAACG CGCTGATGGT CTCGGCCGAG CGCGGCGAGG TCCACGCCGT GCCGTTCGTC ACCGGCGTGC GCGACATCGA CCGGCTGGAG AAGTCGAAGA ACCTCAAGGT CGTCGACAAG GGCTTCGCCG GTCTCGGCGC GCTGAACTGG CTCGCCTTCA ACACCAAGAA GAAGCCGCTC GACGACGTCC GCGTCCGCCA GGCGATCGCC TACGCGGCCA ATCGCGACTT CATCGTCAAC AAGCTGATGG GCGGCAAGGC GATGCCGTCG ACTGGGCCGA TCGCGCCGGG CTCGCCGTTC GAGGAGAAGA ACGTCCAGCT CTACAAGTTC GACGTCGCCA AGGCCAAAAA GCTGCTCGAC GAGGCCGGCC TCAAGCCGGA CGGCAACGGC GTCCGCGCCA CGCTGACGAT CGACTACCTC CCGGGCAGCG ACGAGCAGCA GCGCAACGTC GCCGAATACA TGCGCTCGGC GCTGAAGCGT GTCGGCCTGA ACCTCGAAGT CCGCGCCGCG CCCGACTTCC CGACCTGGGC GCAGCGGGTC TCGAACTTCG ACTTCGATCT GACCATGGAC TCGGTCTACA ATTGGGCCGA TCCGGTGATC GGCGTCGACC GGACCTATCT GACCTCGAAT ATCCGCAAGG GCATCATCTG GTCGAACACG CAGCAATATT CCAACCCGAA GGTCGACGAG ATCCTCGGCA AGGCCGCTGT GGAGACCTCG GCGGAGAAGC GCAAGGCGCT TTATTCGGAG TTCCAGAAGA TCGTCGTCGA CGAGGTGCCG GTGTTCTTCA TCAACGCCGT GCCGTTCCAC AACGCCTTCG CCAACGGCCT CGGCGGGCTG CCGACCACGA TCTGGGGCGT CGTCTCGCCG CTCGACGAAG TGCACTGGGT CACGCCGCCG AAGACCTGA
|
Protein sequence | MKLTKRSFVI GSLGGLAMLG LPADLRAQQA GGGTLVIGST QVPRHFNGAV QSGIATALPS TQIFASPLRF DENWNPQPYL AKSWEVAPDG LSITLKLVDD AVFHDGKPVT SEDVAFSIMT IKANHPFKTM LAAVDKVETP DPKTAVIKLA HPHPALLLAM SPALMPILPK HVYGDGQDVK AHPANLKPIG SGPYKLAEYK QGEYYTLEKF DKFFIPGRPK LDKIVVRLIS DPNALMVSAE RGEVHAVPFV TGVRDIDRLE KSKNLKVVDK GFAGLGALNW LAFNTKKKPL DDVRVRQAIA YAANRDFIVN KLMGGKAMPS TGPIAPGSPF EEKNVQLYKF DVAKAKKLLD EAGLKPDGNG VRATLTIDYL PGSDEQQRNV AEYMRSALKR VGLNLEVRAA PDFPTWAQRV SNFDFDLTMD SVYNWADPVI GVDRTYLTSN IRKGIIWSNT QQYSNPKVDE ILGKAAVETS AEKRKALYSE FQKIVVDEVP VFFINAVPFH NAFANGLGGL PTTIWGVVSP LDEVHWVTPP KT
|
| |