Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3360 |
Symbol | |
ID | 3911162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3842918 |
End bp | 3844033 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885263 |
Product | extracellular solute-binding protein |
Protein accession | YP_486967 |
Protein GI | 86750471 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.421619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0114368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGGA ACGATCTCCT TCGACGGCTG CGATTCGTCG GCGCGACGCT GGGGCTTTCG GCGAGCCTCG CCGTGCCGGC GCTGGCGCAG GATCGCGTGG TCAATTTCTA CAACTGGTCG AACTACGTCG CGCCCGGCGT ACTCGAGGAG TTCACCCGCG AAACCGGGAT CAAGGTGATC TACGACACCT TCGACGGCAA CGAGACGCTG GAGACCAAGC TGCTCGCCGG CAAATCCGGC TACGACGTCG TCGTGCCGAC CGCGTATTTC CTGCAGCGCC AGATCGGCGC GAAGGTGTTC CAGAAGCTCG ACCCGACGAA GCTGCCCAAC CTGAAGAACG CCTGGGACGT GGTGACGAAG AAGCTGGCGC TGTACGATCC CGGCAATCAC TATGCGGCCA ACTACATGTG GGGGACCACC GGCATCGGCT ACAATGTCGG CGCAGTGAAG CGCATTCTCG GCGAGGGTGC GGTGATCGAC AGCTGGGACA TCGTGTTCAA GCCGGAGAAT CTGGCGAAGT TCAAGGAGTG CGGCGTCCAG ATGCTGGATT CCGCCGACGA CATCCTGCCG GCGGCGCTGA CCCGTCTCGG CCTCGATCCG AACTCGACCA AGCAGCCGGA TCTGGAGAAG GCCGCCGATG CCGTCGCCAA GGTGCGGCCG TCGGTGCGGA AGTTTCACTC GTCGGAATAT CTCAACGCGC TCGCCACCGG CGAAATCTGC CTCGTCGTCG GCTGGTCCGG CGATATCAAG CAGGCGCAGG CGCGGGCGAC CGAAGCCAAC AACGGCGTCG AAATCGGCTA CGCGATCCCG AAAGAAGGAG CGCAGATGTT CTTCGACAAT CTGGCTATCC CGGCGGACGC CAAGAACGTC GCCGAGGCGC ACGAATTGAT CAACTTCCTG TTTCGCCCCG AGATCGCCGC CCGCAATTCC GACTTCCTGT CCTACGCCAA CGGCAACAAA GCCAGCCAGG AATTCGTCAA CCCGCGTATC TTGAACGACA AGACGATCTA TCCGGACGAA GCGATGCAGG CGCGCCTGTT CGTCATCACC GCGCGCGACG CGGCGACGCA GCGCGTGATC AACCGGCTGT GGACGCGGGT GAAGACCGGG CGGTGA
|
Protein sequence | MMRNDLLRRL RFVGATLGLS ASLAVPALAQ DRVVNFYNWS NYVAPGVLEE FTRETGIKVI YDTFDGNETL ETKLLAGKSG YDVVVPTAYF LQRQIGAKVF QKLDPTKLPN LKNAWDVVTK KLALYDPGNH YAANYMWGTT GIGYNVGAVK RILGEGAVID SWDIVFKPEN LAKFKECGVQ MLDSADDILP AALTRLGLDP NSTKQPDLEK AADAVAKVRP SVRKFHSSEY LNALATGEIC LVVGWSGDIK QAQARATEAN NGVEIGYAIP KEGAQMFFDN LAIPADAKNV AEAHELINFL FRPEIAARNS DFLSYANGNK ASQEFVNPRI LNDKTIYPDE AMQARLFVIT ARDAATQRVI NRLWTRVKTG R
|
| |