Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3133 |
Symbol | |
ID | 3910934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3579560 |
End bp | 3581149 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885035 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_486740 |
Protein GI | 86750244 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.362143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC GCGAATTCAC CAAGCTCGGC CTGCTGGCCG GCGCGGCCGG CATCGGCGGC ATTCCGCTCG GCATCACCCG GGCGGTGGGG CAAACCCGCG GCGGCACGCT CAACACCATC ATTCAGCCGG AGCCGCCGAT CCTGGTCACC GCGCTCAACC AGCAGCAGCC GACGCTGACG CTCGGCGGCA AGATCTACGA AAGCCTGCTG CGCTACGATT TCGATCTCAA GCCGCTGCCC GGCCTGGCGC AGTCCTGGGA AGTGTCGCCC GACAAGCTGA CCTATACATT CAAGCTGCAT CCCAACATCA CCTTCCACGA CGGCGCGCCG CTGACGTCCG AAGACGTGGT GTTCTCGATC ATGAAGGTGC TGATCGAGAA CCACGCCCGC GCGCGCAACA CGTTCTCGCG CGTCGAGAAG GCCGAGGCGC CGGATCCGCT GACCGTGGTG TTCAAACTGA AGGCGCCGTT CGCGCCGTTC CTCACCGCGT TCGACTGCAC CACGGCCCCG ATCGTGCCGA AGCATATTTA CGAAGGCACG GACTATCGCA AGAACCCGGC CAATGCGAAG GCGATCGGCT CGGGTCCGTT CAAGCTGAAG GAATGGGTGC GCGGTTCGCA CGTCCATCTG GTCAAGCACG AAGGCTATTA TCGCCCGGGC GAGCCGGTTC TCGACGAGAT TATCTATCGG GTCATCCCGG ACTCGGCGTC GCGCTCGGTG GCGCTGGAGC AGGGGACCGT GCAGCTCACG CAATGGACCG ACGTCGAACT GTTCGAGGTG CCGCGGCTGT CCAAGCTGCC GCATCTGACG ATGACCACCA AGGGCTACGA ATTCTTCGCC CCCCATACGT GGCTCGAGTT CAACACCCGG ATCGCGCCGA TGAACGACAA GCGGTTCCGG CAGGCGGTGA TGTATGCGAT CGACCGCAAG GCGTTGCTGG CCCGCATCTA TTTCGGTCTC GGCAAGGTCG CGACCGGCCC GGTGTCGTCG AAGACCAAAT TCTACGAGAA GAACGTCAAG GCCTACGACT TCTCGCCCGA GAAGGCGAAG GCTCTGCTCG ACGAGATGGG GCTGAAGCCG GGTGCCGACG GCAAGCGCGT GACGATTCCC TATCTCGTGC CGCCCTACGG TGAAACGCAT CAACGGACCT CCGAATTCAT TCGCCAGTCG CTCGCCCGCG TCGGCATCGA CCTGCAACTC CAGGGGATCG ATGTCGCCGG ATGGGCCGAG AAGTTCAGCA ACTGGGACTT CTCGATGACG GCGACCACGG TCTATCAGTT CGGCGATCCG GCGCTCGGCG TGTCACGGAG TTATGTCTCC TCCAACATCC GCAAGGGCAT TCTGTTCTCC AACACCTGCG GCTACTCCAA TCCGGAGGTC GACCGGCTGT TCGAGGAGGC CGCGGTCGCG ACCTCGGACG ACAAGCGCCA GGAGTTCTAC AGCGAGGTTC AGAAGATCAT GGTCGAGGAC GTGCCGGTCG CCTGGCTGCT CGAGATCGAC TATCCGAACT TCATGGACAA ACGTCTGAAG AACGTGGTGA CGACGGCGAT CGGCGTGCAC GACACGTTCG GGACGGTTTC GTTCGGATGA
|
Protein sequence | MNRREFTKLG LLAGAAGIGG IPLGITRAVG QTRGGTLNTI IQPEPPILVT ALNQQQPTLT LGGKIYESLL RYDFDLKPLP GLAQSWEVSP DKLTYTFKLH PNITFHDGAP LTSEDVVFSI MKVLIENHAR ARNTFSRVEK AEAPDPLTVV FKLKAPFAPF LTAFDCTTAP IVPKHIYEGT DYRKNPANAK AIGSGPFKLK EWVRGSHVHL VKHEGYYRPG EPVLDEIIYR VIPDSASRSV ALEQGTVQLT QWTDVELFEV PRLSKLPHLT MTTKGYEFFA PHTWLEFNTR IAPMNDKRFR QAVMYAIDRK ALLARIYFGL GKVATGPVSS KTKFYEKNVK AYDFSPEKAK ALLDEMGLKP GADGKRVTIP YLVPPYGETH QRTSEFIRQS LARVGIDLQL QGIDVAGWAE KFSNWDFSMT ATTVYQFGDP ALGVSRSYVS SNIRKGILFS NTCGYSNPEV DRLFEEAAVA TSDDKRQEFY SEVQKIMVED VPVAWLLEID YPNFMDKRLK NVVTTAIGVH DTFGTVSFG
|
| |