Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3082 |
Symbol | |
ID | 3910883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3513218 |
End bp | 3514825 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637884987 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_486692 |
Protein GI | 86750196 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.49552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAGCC ACATCAACAG ACGAGTGTTT CTGGGTACGT CCGCCCTTGC GGCGATCGCC GCCGGAACGT CACTTTGCTT TCCACCTGCT GCACTCGCGC AGGAGAAGCC GAAGAAGGGC GGCGTGCTGG TAGCGACCTG GGGTGGCTTC GAGCCGCAGG CCGTGTTCGT TCCCGGCGGC GGTGGCTCCA GCCCGCTGAT CAGCTCGACC AAGATCCTGG AGCCGCTGCT GCGCCAGGAC AGCCAGGCCG GCTTCCTTCC CGTGCTGGCC ACCGAGGTCA AGCCTTCGGC CGATTTCAAG TCGTACGACA TCGTGCTCCG TAAAGGCGTG ACGTGGCACG ACGGCAAGCC GTTCACCGCA GACGACGTGG TCTTCAGCAT CGAAAAATAC TGGCTGCAGA CGATCGCCAA GGCGGCGCTG AAGAATTTCT CCGGGGCCGA GGTGACCGAA GGCGGGGTGC GCGTGAGCTT CAAAGAGCCG ACGCCGGAGT TCTTCTTCAA GTCGGTGCTG GCGACCTCGC TGGTCATTCC GAAGCACGTC TACGACGGCT CGGAGATCGT TACCAACCCC GCCAATAACG CGCCGATCGG CACCGGTCCG TTCAAGTTCA AGCAGTGGGT GCGCGGCAGC CATATTGAAT ATGCCGCCAA CGACAAATAT TGGGATGCGG GCAAGCCCTA TCTCAACGGC CTGGTGATGC GCTACTGGCG CGACGCCGCA TCGCGCACCG CGGCCTTGGA AGCCGACGAA CTGCAGCTCG GCATCTTCAA CCCGATCCCG ACCCCGGACA TCGACCGGCT GTCCAAGTCC GGCAAGTTCG TCGCATCGAA CGACGGCTAT CTCGGCGCGG CCTGGGCTTC GACCATCGAA TTCAACAGCC GCCGTGATAT CGTCAAGGAC CCGGCCGTGC GCCGCGCACT ACTCACCGCC ATCGACCGGG CGACCATCTC GGACGTTGTG TATTTCGGCC GCGCCAAGCC CGGCACGTCC TTCGTCAGCT CGACAAACCC GAAATTCTAC AATCCGAACC TGCCGCGATA CGAGTTCGAC GCGAAGAAAG CCGCCAAGAT GCTCGACGAT GCCGGCTACC CGAAGAAGGG CAAGTCGCGC TTCAAGGTTC ACTTGCTTGC GGCCGGCTGG TTCGAGGAGA ACGGCAAGGT CGGCCAGTTC GTGAAGCAGA ATCTGGAAGA CATCGGCGTC GAAGTGACGC TCACCGTGCC CGACCGAGCG ACTTCGCTGA AGCAGATCTA CGGCGATTAC GACTACGACA TCGCGCTGTC GAACTACGCG GCGCGGGTCG AGCTGGTTCC CCAGCAGACC GACTACCTCA GCACGAGCGG GATTGTGAAG GGCGCGGCCT TCCGCAACGC CACCGGCTAT TCCAATCCCG AAGTCGACGA CATCGTCGCC AAGATGTCGG TGGAGTCCGA CGAAGCCAAG CGGAAGGACT TGGCCTTCAA ATTGCAAGAG ATTGCAGCGC GAGATCTGCC GATCACCGTT CTGGTCGAAC TGATCCCGAC GACGATGATG TCGAAGAAGG TCAAGGGCGT CGGCAATCGC GCCGATATTT CGGCGGACAG CCTGTCGGAC GTTTGGCTCG ACGTCTGA
|
Protein sequence | MTSHINRRVF LGTSALAAIA AGTSLCFPPA ALAQEKPKKG GVLVATWGGF EPQAVFVPGG GGSSPLISST KILEPLLRQD SQAGFLPVLA TEVKPSADFK SYDIVLRKGV TWHDGKPFTA DDVVFSIEKY WLQTIAKAAL KNFSGAEVTE GGVRVSFKEP TPEFFFKSVL ATSLVIPKHV YDGSEIVTNP ANNAPIGTGP FKFKQWVRGS HIEYAANDKY WDAGKPYLNG LVMRYWRDAA SRTAALEADE LQLGIFNPIP TPDIDRLSKS GKFVASNDGY LGAAWASTIE FNSRRDIVKD PAVRRALLTA IDRATISDVV YFGRAKPGTS FVSSTNPKFY NPNLPRYEFD AKKAAKMLDD AGYPKKGKSR FKVHLLAAGW FEENGKVGQF VKQNLEDIGV EVTLTVPDRA TSLKQIYGDY DYDIALSNYA ARVELVPQQT DYLSTSGIVK GAAFRNATGY SNPEVDDIVA KMSVESDEAK RKDLAFKLQE IAARDLPITV LVELIPTTMM SKKVKGVGNR ADISADSLSD VWLDV
|
| |