Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4253 |
Symbol | |
ID | 4024774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4721690 |
End bp | 4723387 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637964459 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_571371 |
Protein GI | 91978712 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.417802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.174095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGA GGCGGGAATT TCTGCAGGTG ACTGCGGCTG CGTCGGCGCT GACGCTCGCC GGCGGCCTCG GTCCGGTTGG ACGCGTTGCG GCACAGCAGC GGCTGACGCA GGCCGACATC CTGAAATTCG ATCCGCTGGG CACGGTGACG CTGCTACACA TCACCGACCT CCACGCCCAA CTGATGCCGC TGCATTTCCG CGAGCCGTCG GTCAATCTCG GAGTCGGCGA GGTCAAGGGC AAGCCGCCGC ATCTCACCGA CGCGGAATTC CGCAACTACT TCCACATCGC TACCGGCTCT CCGGACGCTC TCGCGCTGAC CGCCGATGAT TTCGTCGCGC TCGCCCGCAA CTATGGCCGG ATGGGCGGCA TGGACCGGAT CGCGACGCTG GTCGGCGCGA TCCGCGCGGA GCGCGGCGAC GACAAGGTGC TGCTGCTCGA CGGCGGCGAC GCATGGCAGG GAAGCTGGAC TTCGCTGCAG ACCAAGGGCC AGGACATGAT CGACGTCCTG AGCGCGCTCA AGATCGACGC GATGACCGGC CATTGGGAGT TCACCTACGG CGCCGATCGC GTCAAGCAGG TCGCCGAACA GGCGTCATTC GCCTTTCTCG CGCAGAACGT CCGCGACAAC GAATGGCAGG AACCGGTGTT CGAGGCGCGC AAGATGTATG AGCGCGGCGG CGTGAAGGTC GCCGTGATCG GACAGGCGCT GCCGCGCACC GCGATCGCCA ATCCACGCTG GATGTTCCCG AAATGGGAGT TCGGCATCCG CGAAGAGGAC ATGCAGAAGC AGGTCGACGA CGCGCGCGCC GAAGGCGCCG AGGTCGTGGT TCTGCTGTCG CACAATGGCT TCGACGTCGA CCGCAAGCTC GCCGGGCGCG TGAAGGGCCT CGACGTCATC CTCACCGGTC ACACCCACGA CGCGATGCCG GGCCTGGTCA AGGTCGGCGA CACCGTTCTG GTGGCGTCGG GCTCGCACGG CAAATTCGTG TCGCGGCTCG ATATCGCGGT GAAGGACAAG AAGGTCTCCG ATATCCGCTT CAAGCTGATG CCGGTGTTCG CCGACGCCAT CAAGCCGGAT CCGGCGATGG CGCAACTGGT CGAGAAGCTG CGTGCGCCTT TTGCCAAGGA TCTCGCCCGC GTCGTCGGCA AGACCGACTC GCTGCTGTAT CGCCGCGGCA ATTTCAACGG CACGTTCGAC GATCTGATCT GCGAAGCGAT GTTGAAGCAG CGCGACACCG AGATCGCCCT GTCGCCCGGT TTCCGCTGGG GCGGAACGCT GCTGCCGAAC GATGACATCA CCTGGGAAGC GATCACCAAC GCCACTGCGA TCACCTATCC GAACTGCTAC CGCACCGAGA TGACCGGCGA GCAGCTCAAG ATCGTGCTCG AGGACATTGC CGACAACATC TTCCATCCCG ATCCCTATTT CCAGGGCGGC GGCGACATGG TGCGCACCGG CGGCATGGGC TATGCGATCG ACGTCGGCAA GGAGATCGGC TCGCGGATCT CCAACATGAC GCATCTCAAG ACCGGCAAGC CGATCGAGGC GTCGAAGAAA TACACGGTCT CCGGCTGGGC CAGCATCAAC GAAAACACCG AGGGCCCGCC GATCTGGGAG GTGCTGTCCA AGCACGTCGC GCAGGCCGGT CCGGTGAAGA TCGATCCCAG CAGCGCGGTC AAGGTTTCAG GAGCCTGA
|
Protein sequence | MISRREFLQV TAAASALTLA GGLGPVGRVA AQQRLTQADI LKFDPLGTVT LLHITDLHAQ LMPLHFREPS VNLGVGEVKG KPPHLTDAEF RNYFHIATGS PDALALTADD FVALARNYGR MGGMDRIATL VGAIRAERGD DKVLLLDGGD AWQGSWTSLQ TKGQDMIDVL SALKIDAMTG HWEFTYGADR VKQVAEQASF AFLAQNVRDN EWQEPVFEAR KMYERGGVKV AVIGQALPRT AIANPRWMFP KWEFGIREED MQKQVDDARA EGAEVVVLLS HNGFDVDRKL AGRVKGLDVI LTGHTHDAMP GLVKVGDTVL VASGSHGKFV SRLDIAVKDK KVSDIRFKLM PVFADAIKPD PAMAQLVEKL RAPFAKDLAR VVGKTDSLLY RRGNFNGTFD DLICEAMLKQ RDTEIALSPG FRWGGTLLPN DDITWEAITN ATAITYPNCY RTEMTGEQLK IVLEDIADNI FHPDPYFQGG GDMVRTGGMG YAIDVGKEIG SRISNMTHLK TGKPIEASKK YTVSGWASIN ENTEGPPIWE VLSKHVAQAG PVKIDPSSAV KVSGA
|
| |