Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3558 |
Symbol | |
ID | 3911360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4071544 |
End bp | 4072962 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885460 |
Product | Type I secretion outer membrane protein, TolC |
Protein accession | YP_487164 |
Protein GI | 86750668 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.445028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.560368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTGGT CGTCAGGATT GATTAGCAGC GCCGGTACGA CGGCACGTCG TTCGCTTGCG GGCGTTTGCG CCGCCGCGAT CAGCCTTGCG CTGACCTTTC CGGTATCCGC CGAGGGACTC CCCGAAGCGC TCGCGAAAGC CTATCAGACC AATCCGCAAT TGAATGCCGA GCGCGCCCGG CAGCGCGCGA CCGACGAGAA TGTGCCGGCC GCCCTGTCGG GTTATCGGCC GCAGATCATC GCAAGTCTCG GCGTCGGTAT GCAGGCGGTG CGGAACCTGC TGCCCGACAA CACCATCCAG ACCGCAACGC TGAAGCCGTG GACGATCGGC GTGACCGTCA CGCAGAACCT GTTCAACGGT TTCAGGACAG CCAACAGTGT GCGCGTCGCC GAATTTCAGG TGAAGTCCGG CCGCGAAGCG TTGCGCAATG TCGGACAGGG CGTGTTGCTC GACGCGGTCA CCGCCTACAC CAACGTGCTC GCCAACCAGG CCTTGGTTGC GGCCCAGAAG ACCAACGTCG ATTTTCTCAG CCAGACGCTG GACATCACCA ACAAGCGATT GAACGCCGGC GATGTGACGC CGACCGACAC CGCGCAGGCC GAAGCGCGGC TCAGCCGCGG CCGCGCCGAT CTCAACGCCG CCGAAGTGAA CCTCGCCGTC AGCGAGGCCA CCTACGCGCA GGTGATCGGC AATCCGCCGT CGCGGCTCAG CCCAGCCGCG CCGGTCGACC GGCTGCTGCC TCGCAGCCGT GAGGAAGCGA TCGCGCTGGC GCTGCAAGGC AATCCCGCAG TGCTGGCGGC GAGCTACGAC GTCGACGTCG CCACCACGAC GATCAAGGTC GCCGAAGGCA GCCTGCTGCC GAGCGTGACG CTGCAAGGCA ACGCCAGTCG CAGCCGGGAC ACCGATTCGA CGCTCGGCAC CAAAGGCACC GACCAGGCTT CGATCCTCGG CCAGGTCTCC GCGCCGATCT ACGACGGCGG ACTCGCCGCC GCGGAGACCC GGCAGTCCAA GGAGATCGCG GCGCAGAGCC GTCTGGTGCT CGATCAGATC CGCAATCAGT CGCGCACCGC GGCGGTCGGC GCGTGGGTCA GCAATGAGGG CGCCAAGATC GCCGTCAGCG CGTCGGAAGC CGAAGTGCGC GCCGCCGAGA TCGCGCTGAA AGGCGTCGGC CGCGAAGCGC AAGGCGGCCA GCGCACCACG GTCGACGTGC TGAATTCGCA GCAGGACCTC ACGCTCGCCC GCGCGCGCCT GATCGGCGCG CAGCGCGATC GGGTGATCGC GTCCTACACG CTGCTCAGCG CCATCGGCCG TCTCGACGTC AAGACGCTGA AGCTCAATAC GCCGGACTAT CTGCCGGATG TTCACTACCA TCAGGTCCGC GACGCCTGGC ACGGCCTGCG CACGCCGTCG GGTCAGTGA
|
Protein sequence | MAWSSGLISS AGTTARRSLA GVCAAAISLA LTFPVSAEGL PEALAKAYQT NPQLNAERAR QRATDENVPA ALSGYRPQII ASLGVGMQAV RNLLPDNTIQ TATLKPWTIG VTVTQNLFNG FRTANSVRVA EFQVKSGREA LRNVGQGVLL DAVTAYTNVL ANQALVAAQK TNVDFLSQTL DITNKRLNAG DVTPTDTAQA EARLSRGRAD LNAAEVNLAV SEATYAQVIG NPPSRLSPAA PVDRLLPRSR EEAIALALQG NPAVLAASYD VDVATTTIKV AEGSLLPSVT LQGNASRSRD TDSTLGTKGT DQASILGQVS APIYDGGLAA AETRQSKEIA AQSRLVLDQI RNQSRTAAVG AWVSNEGAKI AVSASEAEVR AAEIALKGVG REAQGGQRTT VDVLNSQQDL TLARARLIGA QRDRVIASYT LLSAIGRLDV KTLKLNTPDY LPDVHYHQVR DAWHGLRTPS GQ
|
| |