Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0942 |
Symbol | |
ID | 3909796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1089488 |
End bp | 1090711 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882835 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_484563 |
Protein GI | 86748067 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG ACAACAAACA CGCGTTCACC CGCCGCCGTT TCCTCTCCAA CTTCGCCTTC GCCGGCACCG CGATCGCCAC CGGCGTCGGC AGCTGGGTCG TCCCGGCCGG CTGGGCCAAC GCCGCCGCCG GCCCGATCAA GGTCGGCATC GGCACCGACC TCACCGGCCC GATGGGTTAT GCCGGGAACG CCGATGCCAA CGTCGCCAAG ATGGTGATCA AGCAGATCAA CGACGCCGGC GGCCTGCTCG GCAGGCCGAT CGAACTCTTC ATCGAGGACA CCGCGTCCAA CGAGGCCGTC GCGGTCGGCA ACGTCCGCAA GCTGATCCAG CGCGACAAGG TCGACCTCGT GGTCGGCGGC ATCACCTCGT CGATGCGCAA CGCCATCAAG GACGTCATCG TGTCGCGCGG CAAGACACTC TACATCTATC CGCAACTCTA CGAAGGCAAG GAATGCACGC CGAACCTGTT CTGCACCGGC CCGACCCCGG CGCAGCAGTG CGACGAGTTC ATCCCGTGGC TGATCAAGAA CGGCGGCAAG AAATTCGCGC TGCCCTCCGC CAATTATGTC TGGCCGCACA CGCTCAACGT CTATGCCCGC AAGGTGATCG AGGCCAATGG CGGCGAAGTG GTGCTCGAGG AGTACTACCC GCTTGATCAG ATCGACTTCT CGTCGACCGT CAACCGCATC ATCTCCAACA AGGTCGATGT GGTGTTCAAC ACCGTGATCC CGCCCGGTGT CGGCCCGTTC TTCAAGCAGC TTTATGAAGC GGGGTTCCTC AAGAACGGCG GCCGGCTCGC CTGCGTGTAT TATGACGAGA ACACGCTCGG CATCAATCAG CCGGCGGAGA TCGAAGGGCT GGCGAGCTGC CTCGACTACT TCAAGGCGCT CACCAAGGAC GAGCCGTTCT CCGCCAAGCT GCAGGCCGAC TACGAAAAGG CGTTCCCGGG CAACTTCCTG TTCGCGGCCG GCAGCGCCGC CACCGGCACC TATCGGGCCC TCAAGCTGTG GGAAGCCGCG GTGAAGGAAG CCGGCAAGAT CGACCGCGAC GGCGTCGCCG CGGCGCTCGA TCACGCCAAG ATCGCCGAAG GCCCGGGAGG CCCCGCCGAG ATGGTCCCCG GCAAACGCCA CTGCAAGATG AACATGTACA CCGCCGTCGC CAAGAACGGC AGCTACGAGA TCGTCGAGCG CAGCAAGGGG CTGGTGGATC CGAAGGAATG CTGA
|
Protein sequence | MSSDNKHAFT RRRFLSNFAF AGTAIATGVG SWVVPAGWAN AAAGPIKVGI GTDLTGPMGY AGNADANVAK MVIKQINDAG GLLGRPIELF IEDTASNEAV AVGNVRKLIQ RDKVDLVVGG ITSSMRNAIK DVIVSRGKTL YIYPQLYEGK ECTPNLFCTG PTPAQQCDEF IPWLIKNGGK KFALPSANYV WPHTLNVYAR KVIEANGGEV VLEEYYPLDQ IDFSSTVNRI ISNKVDVVFN TVIPPGVGPF FKQLYEAGFL KNGGRLACVY YDENTLGINQ PAEIEGLASC LDYFKALTKD EPFSAKLQAD YEKAFPGNFL FAAGSAATGT YRALKLWEAA VKEAGKIDRD GVAAALDHAK IAEGPGGPAE MVPGKRHCKM NMYTAVAKNG SYEIVERSKG LVDPKEC
|
| |