Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2847 |
Symbol | |
ID | 3910640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3240518 |
End bp | 3242389 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637884747 |
Product | extracellular solute-binding protein |
Protein accession | YP_486460 |
Protein GI | 86749964 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCGAA CCGAATCAGA TCGCCGATTC GCGCCCCGCG CGGGCCTGCT CGCAGGCGCG CTGGCGATCG CCACCGCCGT CGGCCTTGCC ATCCCGGTCA ACCACGCGGC GATGGCCGGA ACGGAGCTCG CGGCGAAGCC GGCCTATGCG TTGGCGATGC ATGGCGATCC CGCCCTGCCC GCCGAGTTCA AGGCAATGCC CTATGCCGAC CCCGACGCGC CCAAAGGCGG CCGGCTGGTG GAGGGCCTGC TCGGCACTTT CGACAGCCTC AATCCGTTCA TCGTCAAGGG CATCGCCGTG CAGCGGATGC GCGGCTATGT GGTCGAAAGC CTGATGGCGC GCGGCAATGA CGAGCCGTTC ACCCTGTACG GCCTGCTGGC GCAATCGATC GAGACCAACG ACGCCCGCAG CTACGTCACC TTCCGCATCG ATCCGCGAGC CCGGTTCTCC GACGGCAAGC CGGTGCAGGC CGAAGACGTG CTGTTCTCCT GGCAATTGCT GCGCGACAAG GGGCGCCCCA ATCATCGGCA GTACTACGCC AAGGTGGCGC GCGCGAAGGC CCTTGATCCC CTCACGATCC GCTTCGATTT CGACGATGTG CAGGATCGCG AACTGCCGCT GATCCTCGGC TTGATGCCGG TCTTTCCGAA GCACGCCGTC AATGCCGACA CGTTCGAGGA GACATCGCTG TCGCCGCCGA TCGGATCAGG TCCCTATCGC GTCACCGAGG TGAAGGCCGG CGCCAGCGTG ACGCTGACGC GCAACCCGGA CTATTGGGGC CGCGATCTCC CGGTCAATCG CGGCTTGTGG AACTTCGACG AAATCCGGAT CGATTATTTC CGGGAGGCAA ATTCGCATTT CGAAGCCTTC AAGCGCGGGC TTTACGACTA TCGCGTCGAG ACCGAGCCGC TACGCTGGCA CGATGGCTAC GACTTTCCGG CCGCGAAGAG CGGCAATGTG ATCCGCGACG CCTTCAGGAC CGGGATGCCG CAGCCGAGCG AGGTGCTGGT CTTCAACACG CGGCGGCCGG TGTTCGCCGA TATCCGCGTC CGCGAGGCGC TGCTGCAACT GTTCGATTTC GCCTGGATCA ACCGCAACTA CTTCTTCGAT TTGTACGCCC GCGCCGGCGG CTTCTTCGCC GGCTCGGAGC TTTCCGCCTA TGGCCGACCG GCAGGCCCCG GCGAATTGAA GCTGCTGGCG CCGTATCTGT CGCGGCTGCG GCCCGAGTTC CTCGATGGCA GCTACCGCCT GCCTGTCAGC GACGCGTCGG GCCGCGACCG CACCACGCTG CGACGCGCCC TGTCGCAACT GGCGGAGGCG GGGTACGAAC TCGACGGCAC GGTGCTGCGC CGACGCGACA ACCACCAGCC GCTGACATTC GAAATTCTGG TGACCACGCG GGATCAGGAG CGCATCGCAC TGGCGTTCGC GCGCGACGTC AAACGGGTCG GTGTTCAGGC CTCGGTCCGC GTGGTCGACG CGGTGCAGTT CGACCAGCGC CGGATCGCGT TCGATTTCGA TATGATCCCG AACCGCTGGG ACCAGTCGCT GTCTCCGGGC AACGAACAGT ATTTTTATTG GGGCGCGGAG GCCGCCGACA CCCAGGGCAC CCGCAACTAC ATGGGCGTCA AGGATCCCGC GGTCGACGCC ATGATCGCGG CGATGATCGG CGCGCGGGAG CATCCGCAAT TCGTCGATTC GGTGCGCGCG CTCGACCGGG TTCTGACCTC GGGTGTCTAC GTGATCCCGC TCTACAACAT TCAGGAACAA TGGATCGCAA GATGGAATCG GATAGAACGG CCGAAAGCAA ATGCCCTGAC CGGCTACCTG CCCGAAACCT GGTGGGCGAG GCCATCGACG CAGCAAAGGT GA
|
Protein sequence | MDRTESDRRF APRAGLLAGA LAIATAVGLA IPVNHAAMAG TELAAKPAYA LAMHGDPALP AEFKAMPYAD PDAPKGGRLV EGLLGTFDSL NPFIVKGIAV QRMRGYVVES LMARGNDEPF TLYGLLAQSI ETNDARSYVT FRIDPRARFS DGKPVQAEDV LFSWQLLRDK GRPNHRQYYA KVARAKALDP LTIRFDFDDV QDRELPLILG LMPVFPKHAV NADTFEETSL SPPIGSGPYR VTEVKAGASV TLTRNPDYWG RDLPVNRGLW NFDEIRIDYF REANSHFEAF KRGLYDYRVE TEPLRWHDGY DFPAAKSGNV IRDAFRTGMP QPSEVLVFNT RRPVFADIRV REALLQLFDF AWINRNYFFD LYARAGGFFA GSELSAYGRP AGPGELKLLA PYLSRLRPEF LDGSYRLPVS DASGRDRTTL RRALSQLAEA GYELDGTVLR RRDNHQPLTF EILVTTRDQE RIALAFARDV KRVGVQASVR VVDAVQFDQR RIAFDFDMIP NRWDQSLSPG NEQYFYWGAE AADTQGTRNY MGVKDPAVDA MIAAMIGARE HPQFVDSVRA LDRVLTSGVY VIPLYNIQEQ WIARWNRIER PKANALTGYL PETWWARPST QQR
|
| |