Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4229 |
Symbol | |
ID | 4024750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4697380 |
End bp | 4698387 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637964435 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_571347 |
Protein GI | 91978688 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.467764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.364318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCCT CGACCCGTTC GCCGTTCCGC CCGAGCCGTC GCAGCGTGCT GCGCGCCGCC GCCGCGCTCG GAGCCCTGCC GCTGCTGCCG CGCCTCGCCC AGGCTGATAC CTGGCCGAGC CGGCCGGTCA AGTTCGTCGT GCCGTTCGCG GCCGGCGGCA CCACCGACAT TCTGGCGCGG CTGGTGGCGC AGACGTTGTC CGAGGAATAC GGCCAGCAAT TCGTGGTGGA GAACAAGGCC GGCGCGGGCG GCAACATCGC CGCCGATTTC GTCGCCAAGG CGGAGCCCGA CGGCTACACC TTCATCGTCG GCACGCCGGG CACCCACGCG ATCAATCAGT TCGTGTTCAA GTCGCTGAGC TACGATCAGG TCAAGGATAT CACGCCGGTC ATCATCATCG CCAAGGTGCC GAACCTGTGC TCGGTGACCA ATTCGCTGCC GGTCAAAAGC GTCGCCGAAC TGATCGCCTA CGCAAAGGAG AAGCCGGGCG AACTGTTCTA CGGCACGCCG GGGCTCGGCT CGACCGCGCA CGTCTCGACC GAACTATTCA AATCGTTGAC CGGCGTGCAG ATGACCCACG TGCCGTATAA GGGCTCGGCG CCGATGCTGA CCGATCTGAT CGCCGGGCGC GTGCATCTCA CGATCGACAA TCTGCCGGCC TCGCAGCCGC ACGCCGACGC CGGCTCTATC CGTCCGCTCG CGGTCTCCAC CGCGACGCGC TGGCCGCTGC TGCCGGATCT GCCGACCATC GCCGAAGCCG GCGTGCCGGG CTACGACGCC GCCGCCTGGT TCACGATCGG CGCGCCGGCG AAGACCTCTC CGGCGATCAT CGCGAAGCTC AACGCCAGCG TCGACAAATT CATCAAGACC GAAGGCGGGA CCGCGCGGAT GCGCAAGCTC GGCGCCGATC CGGTGGGCGG CTCGCCGGAG GACATGCAGC GCTACGTGCT CGCCGAAATC GAGAAATGGG GCAAGGTCGC GAAGTTCGCC AAGATCGACC CGCAATAG
|
Protein sequence | MPSSTRSPFR PSRRSVLRAA AALGALPLLP RLAQADTWPS RPVKFVVPFA AGGTTDILAR LVAQTLSEEY GQQFVVENKA GAGGNIAADF VAKAEPDGYT FIVGTPGTHA INQFVFKSLS YDQVKDITPV IIIAKVPNLC SVTNSLPVKS VAELIAYAKE KPGELFYGTP GLGSTAHVST ELFKSLTGVQ MTHVPYKGSA PMLTDLIAGR VHLTIDNLPA SQPHADAGSI RPLAVSTATR WPLLPDLPTI AEAGVPGYDA AAWFTIGAPA KTSPAIIAKL NASVDKFIKT EGGTARMRKL GADPVGGSPE DMQRYVLAEI EKWGKVAKFA KIDPQ
|
| |