Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3544 |
Symbol | |
ID | 3911346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4054687 |
End bp | 4056042 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885446 |
Product | parallel beta-helix repeat-containing protein |
Protein accession | YP_487150 |
Protein GI | 86750654 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.502732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.506953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTCG ACCGGCGCCG TTTGATCGGA CTGACAGCCG GCGCGCTGGC GCTGTCGGCG GCGCCACTCG CCGCGCAGCC GCTCACCTCG CAACGCGGCC GCGACGCCAC GCAGGCCGGC CTTCGCCCCG ACAGTTCCGA CGACCAGACC GCGGCCCTGC AGCGCGCGAT CGACAGCGCC GCCCGCGCCC GCGTCCCGCT GGCGCTGCCG CCGGGCCATT ATCGCACCGG CCCACTTCGG TTGCCGTCGG GCGCGCAGCT CAGCGGCGTG CGCGGCGCGA CGCGACTCGT CTTCACCGGC GGCGTCTCGC TGTTCGACAG CGCCGGCGCC GAGACGCCGA CGCTCAGCGG CCTCGTCCTC GACGGTGGCG CCATCCCGCT GCCGGCGCGG CGCGGCCTGG TGCATTGCGT CGGCGCGCGC GACCTGCGGA TCACCGATTG CGAGATCACC GCCAGCGGCG GCTGCGGCGT CTGGCTGGAA ACCACCTCGG GCATGATCAG CGACAACACG CTGACCGCGA TCGCGGTCAC CGGCGTGGTG TCGTTCGACG CCAAGGGGCT GAGCGTGACG CGCAACACCA TCATCGGCGC CAACTCAAAC GGCGTCGAGA TCCTGCGCAC CTCGATCGGC GACGACGGCA CGCTGGTCAC CGGCAACCGG ATCGAGAACA TCAAGGCCGG CCCCGGCGGC TCGGGGCAGT ACGGCAACGC CATCAACGCG TTTCGCGCCG GCAACGTCAT CGTCAGCGGC AACCGGATCA GGAACTGCGA TTACTCCGCG GTGCGCGGCA ATTCGGCGTC GAACATCCAC ATCACCGACA ACAGCGTCAG CGACGTGCGC GAGGTCGCGC TGTATTCGGA ATTCGCGTTC GAAGGTGCGG TGATCTCGGG CAACACCGTC GACGGCGCAG CGCTCGGCGT CTCGGTGTGC AACTTCAACG AGGGCGGCAG GCTCTCGGTC GTGCAGGGCA ACATCATCCG CAACCTCAAG CCGAAGCGGC CGATCGGCAC CGCGCCGGAC GACGACGCCG GCATCGGCAT CTATGTCGAG GCCGACACCG CAGTGACCGG CAATGTGATC GAGAACGCAC CGTCGTTCGG CATCGTCGCC GGATGGGGCA AGTACCTGCG CGATGTCGCC ATCACCGGCA ATGTCGTGCG CAGGGCGTTC ATCGGCATCG GCGTCTCGGT GATGGACGGC GCCGGCACGG CCGCGATCAA CGGCAACGTC ATCGCCGAAG CGCCGCGCGG CGCCGTGGTC GGGCTCGACC ACGCCCGCCC GGTGACGCCG GACCTCACCG CGCCCGGCGC AGCCAAATTC GCGCAGATCG CCCTCGGCAG CAACTCGGTG CGGTGA
|
Protein sequence | MTLDRRRLIG LTAGALALSA APLAAQPLTS QRGRDATQAG LRPDSSDDQT AALQRAIDSA ARARVPLALP PGHYRTGPLR LPSGAQLSGV RGATRLVFTG GVSLFDSAGA ETPTLSGLVL DGGAIPLPAR RGLVHCVGAR DLRITDCEIT ASGGCGVWLE TTSGMISDNT LTAIAVTGVV SFDAKGLSVT RNTIIGANSN GVEILRTSIG DDGTLVTGNR IENIKAGPGG SGQYGNAINA FRAGNVIVSG NRIRNCDYSA VRGNSASNIH ITDNSVSDVR EVALYSEFAF EGAVISGNTV DGAALGVSVC NFNEGGRLSV VQGNIIRNLK PKRPIGTAPD DDAGIGIYVE ADTAVTGNVI ENAPSFGIVA GWGKYLRDVA ITGNVVRRAF IGIGVSVMDG AGTAAINGNV IAEAPRGAVV GLDHARPVTP DLTAPGAAKF AQIALGSNSV R
|
| |