Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2097 |
Symbol | |
ID | 3908511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2383749 |
End bp | 2385074 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883990 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_485714 |
Protein GI | 86749218 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.380982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00612544 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGACT TCACATTCGA TCGCCGCTCG TTGCTGAAAG GTGGCGCGCT GACACTGGCC GCGGCGGCGA CCATGTCGGC GGATCAGTTG CTGGGCTACG CCAAGGCCTG GGCGCAGACC TCGCCGTGGA AGCCGGAAGC CGGCGCTAAG ATCAATCTGC TGCGCTGGAA GCGCTTCGTC GAAGCCGAGG ACTTGGCTTT CATGAAGATC GTCGACGCGT TCCAGAAAGC CAACAACGTC ACCATCAACG TCTCCAACGA ATCCTACGAC GACATCCAGC CGAAAGCCTC GGTTGCGGCC AACACCGGGC AGGGGCTCGA CATGGTGTGG GGGCTGTATT CGCTGCCGTT CCTGTTCCCC AGCAAATGCG CCGACGTCAC CGACGTCGCC GATCATCTCG CCAAGAAATG CGGCGGATGG ACGGAGTCGG GCAAGGCCTA CGGCATGTAC AACGGCAAGT GGATCGGCAT TCCGGTCGCC GCCACCGGCG GCCTCGTCAA CTACCGGATC AGCGCCGCCG AGAAGGCCGG CCACAAGGAA TTCCCCAAGG ATCTCGCCGG CTTCTCCGAC CTGATGAAGG CCATGAACAA GAACGGCACG CCGGGCGGCA TGGCGCTCGG CCACGCCTCG GGCGACGCCA ATGGCTGGGT GCACTGGGCG CTGTGGGCGC ATGGCGGCAA GCTGATCGAC AAGGACAACA AGGTCGTCGT CAATTCGCCG GAGACCGCCA AGGCGCTGGA CTACGTCAAG GGCCTGTACG AGAACTTCAT TCCCGGCACC GCGTCGTGGA ACGACGCCTC CAACAACAAG GCGTTCCTCG CCGGCCAGCT CTATCTCACC GTCAACGGCA TCTCGATCTA CGTGGCGGCG AAGAAGGACA ACAAGGAGAT GGCGGCGGAC ATCGGCCACG CGCATCTGCC CGCCGGCGTC AGCGGCAAGA CCCGCGAGCT GCATCTCGGC TTCCCGATCC TGATCTACAA CTTCACCAAG TTCCCGCAGA CCTGCAAGGC GTTCACCGCC TTCATGATGG AGCCGGAACA GTTCAACCCG TGGGTCGAGG CGGCGCAGGG CTATCTGTCG CCGTTCCTGC TCGACTTCGA GAAGAACCCG ATGTGGACCG CGGACCCGAA GAACACGCCG TATCGCGACG TCGGCCGCAC CGCCTCCACG CCCGCCGGCG ACGGCCAGAT GGGCGAGAAC GCGGCGGCGG CGATCGCCGA CTTCGTCGTC GTCGACATGT TCGCCAACTA CTGCACCGGC CGCGAAGACG TGAAGACGGC GATGAGCAGC GCCGAACGCG CGGCGAAGCG CATCTTCCGG GCTTAG
|
Protein sequence | MTDFTFDRRS LLKGGALTLA AAATMSADQL LGYAKAWAQT SPWKPEAGAK INLLRWKRFV EAEDLAFMKI VDAFQKANNV TINVSNESYD DIQPKASVAA NTGQGLDMVW GLYSLPFLFP SKCADVTDVA DHLAKKCGGW TESGKAYGMY NGKWIGIPVA ATGGLVNYRI SAAEKAGHKE FPKDLAGFSD LMKAMNKNGT PGGMALGHAS GDANGWVHWA LWAHGGKLID KDNKVVVNSP ETAKALDYVK GLYENFIPGT ASWNDASNNK AFLAGQLYLT VNGISIYVAA KKDNKEMAAD IGHAHLPAGV SGKTRELHLG FPILIYNFTK FPQTCKAFTA FMMEPEQFNP WVEAAQGYLS PFLLDFEKNP MWTADPKNTP YRDVGRTAST PAGDGQMGEN AAAAIADFVV VDMFANYCTG REDVKTAMSS AERAAKRIFR A
|
| |