Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2054 |
Symbol | |
ID | 3909869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2335842 |
End bp | 2337230 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883947 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_485672 |
Protein GI | 86749176 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.568287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT TCGACAATCC GTTCGATCCC AACCGTCGCC TCACCGCAGG CGGATGTAGC TGCGGCCGGC ACGTCAACGA AGCCGAACAT GCGGCGGATC CGTCGTCGGC GCTGCAGCCG ACAATGCTGG AGAGCGACGA CAAGAAGTTC GAGGGCGTGG TCGCCTCCGC GGTGATGCGG GCGATGTTTC CGCAAGACGC CTCGCGGCGC GCCTTTCTGA AATCGGTCGG CGCCGCAACC GCTCTGGCGG CGATCTCGCA GTTCTTTCCG CTACAGACCG CCACCGAGGC GTTCGCCTCC GGCGGTCCGC TGGAGAAGAA GGACCTCAAG GTCGGCTTCA TCCCGATCAC CTGCGCCACG CCGATCATCA TGGCCGCCCC GATGGGGTTC TATTCGAAAT ACAGCCTCAA CGTCGAAGTC ATCAAGACCG CGGGTTGGGC GGTGATCCGC GACAAGACCA TCAACAAGGA ATACGACGCC GCGCACATGC TGTCGCCGAT GCCGCTCGCC ATCACCATGG GCGTCGGCTC GAATCCGATC CCCTACACCA TGCCGGCGGT CGAGAACATC AACGGCCAGG CCATCACTTT GGCGATGAAG CACAAGGACA AGCGCAATCC GAAGGATTGG AAGGGATTCA AATTCGCGGT CCCGTTCGAC TATTCGATGC ACAACTATCT GCTGCGCTAT TATCTCGCCG AACACGGCCT CGATCCCGAC GTCGACGTGC AGATCCGCGC GGTGCCGCCG CCGGAAATGG TCGCCAATCT GCGCGCCGAC AATATCGACG GCTATCTCGC GCCCGACCCG ATGAACCAGC GCGCGGTGTA TGACGGCGTC GGCTTTATCC ACATCCTGAC CAAGGAGATC TGGGACGGCC ACCCGTGCTG CGCCTTCGCC GCGTCGAAGG AATTCGTCAC CACGATGCCC AACACCTACG GCGCGCTCTT GAAATCGATC ATCGAGGCCA CCGCCTACGC CCACAAGCCG GAGAACCGCA AGGAGATCGC CGCCGCGATC GCGCCGGCCA ACTACCTGAA CCAGCCCGCG ATCGTGCTGG AGCAGATCCT CACCGGCACC TATGCGGACG GCCTCGGCAA CATCATCAAG CAGCCGAACC GGATCGACTT CGACCCGTTC CCCTGGCAGT CCTTCGCGGT CTGGATCATG ACCCAGATGA AGCGCTGGGG ACAGGTCAAG GGCGACGTCG ACTACAAGGC GATCGCCGAG CAGGTCTATC TGGCGACCGA CACCGCGAAA CTGATGAAGG AAGCGGGCCT CACCCCGCCG ACCACGACCT CGCGGTCGTT CTCGGTGATG GGCAAGTCGT TCGACGGCTC GAATCCGGAA GAATATCTCG CGAGCTTCAA GATCAAGAAG GCCTCGTGA
|
Protein sequence | MSTFDNPFDP NRRLTAGGCS CGRHVNEAEH AADPSSALQP TMLESDDKKF EGVVASAVMR AMFPQDASRR AFLKSVGAAT ALAAISQFFP LQTATEAFAS GGPLEKKDLK VGFIPITCAT PIIMAAPMGF YSKYSLNVEV IKTAGWAVIR DKTINKEYDA AHMLSPMPLA ITMGVGSNPI PYTMPAVENI NGQAITLAMK HKDKRNPKDW KGFKFAVPFD YSMHNYLLRY YLAEHGLDPD VDVQIRAVPP PEMVANLRAD NIDGYLAPDP MNQRAVYDGV GFIHILTKEI WDGHPCCAFA ASKEFVTTMP NTYGALLKSI IEATAYAHKP ENRKEIAAAI APANYLNQPA IVLEQILTGT YADGLGNIIK QPNRIDFDPF PWQSFAVWIM TQMKRWGQVK GDVDYKAIAE QVYLATDTAK LMKEAGLTPP TTTSRSFSVM GKSFDGSNPE EYLASFKIKK AS
|
| |