Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4334 |
Symbol | |
ID | 3912147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4919634 |
End bp | 4920641 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886238 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_487932 |
Protein GI | 86751436 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0464074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCTT CGACCCGCCC GCCGTTCCGA CCCGACCGCC GGACCATCTT GCGCGGCGCC GCAGCGCTCG GCGCGCTGCC GCTGCTGCCC GGCATCGCCC GCGCCGACAG CTGGCCGAGC CGGCCGGTGA AGTTCATCGT GCCGTTCGCG GCCGGCGGCA CCACCGACAT TCTGGCGCGG CTGGTGGCGC AGAAGATGTC GGAGGAGTAC GGCCAGCAAT TCATCGTCGA AAATAAGGCC GGCGCCGGCG GCAACATCGC CGCCGACTTC GTCGCCAAGG CGGATCCCGA CGGCTACACC TTCATCGTCG GCACGCCGGG CACGCACGCC ATCAATCAGT TCGTGTTCAA GTCGCTGAGC TACGATCAGG CCAAGGACAT CGCGCCGGTG ATCGTCATCG CCAAGGTGCC GAACCTGTGC TCGGTGACCA ATGCGCTGCC GGTGAAGAGC GTCGCCGAAC TGATCGCCTA CGCCAAGTCG AAGCCGGGCG AAATATTCTA CGGCACACCC GGCCTCGGCT CGACCGCGCA TGTCTCGATC GAGCTGTTCA AGTCGATGAC CGGCGCGCCG ATGACGCATG TGCCCTACAA GGGCTCGGCG CCGATGCTGA CCGACCTGAT CGCCGGCCGC GTGCATTTCA CCATCGACAA TCTGCCGGCG TCGCAACCCC ACGCCGACGG CGGAACGATC CGCGCGCTCG CGGTCTCGAC CGCGACGCGA TGGCCGCTGC TGCCGGACCT GCCGACCATC GCCGAGGCCG GCGTGCCCGG CTACGACGCC GCGGCGTGGT TCACGATCGG CGCGCCCGCG AAGATTTCGC CGGACATCGT CGCCAGGCTC AACGCCAGCG TCGACAAATT CATCAAGACC GAGGAGGGCA CCGCGCGGCT GCGCAAGCTC GGCGCCGATC CGGTCGGCGG TTCGCCCGCG GACATGCAGC GCTTCGTGCT CGCCGAAACC GAAAAATGGG GCAAGGTCGC GAAGTTCGCC AAGATCGAGC CGCAGTAA
|
Protein sequence | MPSSTRPPFR PDRRTILRGA AALGALPLLP GIARADSWPS RPVKFIVPFA AGGTTDILAR LVAQKMSEEY GQQFIVENKA GAGGNIAADF VAKADPDGYT FIVGTPGTHA INQFVFKSLS YDQAKDIAPV IVIAKVPNLC SVTNALPVKS VAELIAYAKS KPGEIFYGTP GLGSTAHVSI ELFKSMTGAP MTHVPYKGSA PMLTDLIAGR VHFTIDNLPA SQPHADGGTI RALAVSTATR WPLLPDLPTI AEAGVPGYDA AAWFTIGAPA KISPDIVARL NASVDKFIKT EEGTARLRKL GADPVGGSPA DMQRFVLAET EKWGKVAKFA KIEPQ
|
| |