Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4369 |
Symbol | |
ID | 3912184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4953346 |
End bp | 4955082 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886275 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_487967 |
Protein GI | 86751471 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGCC GCGTCTGCGC GGACAAGGGA ACGGAAACGA TGATCTCGCG GCGGGAATTC CTGCAGGCCA CAGCGGCCGC ATCGGCGCTC ACGATCGGCA GCGGCCTCGG TCCGATTGGG CGGGTGGCGG CGCAGCAGCG GCTGACGCAG GCCGACATTC TGAAGTTCGA TCCGCTCGGC ACGCTGACGC TGCTGCACGT CACCGACATC CACGCCCAGC TGATGCCGCT GCATTTCCGC GAGCCGTCGG TCAATCTCGG CGTCGGCGAG GTCAAGGGCA AGCCGCCGCA TCTGACCGAC GCGGAATTCC GCAACTACTT CCACGTCGCC ACCGGATCGC CGGACGCCTT CGCGCTCACC GCCGATGATT TCGTCGCGCT CGCCCGCAAT TACGGCCGGA TGGGCGGCAT GGACCGCATC GCCACCTTGG TGAACGCGGT GCGCGCCGAG CGCGGCGCCG ACAAGGTGCT GCTGCTCGAC GGCGGCGACA CCTGGCAGGG CAGCTGGACC TCGCTGCAGA GCAAGGGCCA GGACATGATC GACGTCATGA CCGCGCTGAA GCTCGACGCG ATGACCGGCC ATTGGGAATT CACCTACGGC GCCGAGCGGG TCAAGCAGGT CGCCGACTCG GCGCCGTTCG CCTTCCTGGC GCAGAACGTC CGCGACAACG AATGGCAGGA GCCGGTGTTC GAGGCGCGCA AGATGTTCGA GCGCGGCGGC GTCAAGATCG CGGTGATCGG GCAGGCGCTG CCGCGCACCG CGGTCGCCAA TCCGCGCTGG ATGTTTCCGA ACTGGGAGTT CGGCATCCGC GAGGAGGACA TCCAGAAGCA GGCCGACGAC GCCCGCGCCG AAGGCGCCGA GGTCGTGGTG CTGCTGTCGC ACAACGGCTT CGACGTCGAC CGCAAGCTCG CCGGCCGGGT CAAGGGCCTC GACATCATCC TCACCGCCCA CACCCACGAC GCGATGCCGG GCCTGATCAA GGTCGGCGAC ACCGTGCTGG TGGCGTCGGG CTCGCACGGC AAATTCGTGT CGCGGCTCGA CATCGCGGTG AAGGGCAAGA AAGTGTCCGA CATCCGCTTC AAACTGATGC CGGTGTTCGC CGACGCCATC GCGCCGGACC CGGCGATGAA GCAACTGGTC GAGAAGCTGC GTGCGCCCTA CGCCAAGGAT CTCGCGCGCG TCGTCGGCAA GACCGATTCG CTGCTGTATC GCCGCGGCAA TTTCAACGGC ACCTTCGACG ACCTGATCTG CGACGCGATG CTGAAGCAGC GCGACACCGA GATCGCGCTG TCGCCGGGCT TCCGTTGGGG CGGCACGCTG CTGCCGAACG AGGACATCAC CTGGGAGGCG ATCACCAACG CCACCGCGAT CACCTATCCG AACTGCTACC GCAGCGAGAT GACCGGCGAG CAGCTCAAGA ACGTGCTCGA GGACATCGCC GACAACATCT TCCACCCAGA CCCGTATTTC CAGGGCGGCG GCGACATGGT CCGCACCGGC GGCATGGGCT ATTCGATCGA TATCGGCAAG GAGATCGGCT CGCGGATCTC CGGCATGGTG CATCTCAAGA CCGGCAAGCC CATCGAGGCG TCGAAGACCT ACACCGTCTC CGGCTGGGCC AGCATCAACC AGAACACCGA GGGCCCGCCG ATCTGGGACG TGCTGGCCAA GCACGTCGCG CAGGCGGGGC CGGTGAAGAT CGATCCCAAC AGCGCCGTCA AGGTGTCGGG CGCCTGA
|
Protein sequence | MQGRVCADKG TETMISRREF LQATAAASAL TIGSGLGPIG RVAAQQRLTQ ADILKFDPLG TLTLLHVTDI HAQLMPLHFR EPSVNLGVGE VKGKPPHLTD AEFRNYFHVA TGSPDAFALT ADDFVALARN YGRMGGMDRI ATLVNAVRAE RGADKVLLLD GGDTWQGSWT SLQSKGQDMI DVMTALKLDA MTGHWEFTYG AERVKQVADS APFAFLAQNV RDNEWQEPVF EARKMFERGG VKIAVIGQAL PRTAVANPRW MFPNWEFGIR EEDIQKQADD ARAEGAEVVV LLSHNGFDVD RKLAGRVKGL DIILTAHTHD AMPGLIKVGD TVLVASGSHG KFVSRLDIAV KGKKVSDIRF KLMPVFADAI APDPAMKQLV EKLRAPYAKD LARVVGKTDS LLYRRGNFNG TFDDLICDAM LKQRDTEIAL SPGFRWGGTL LPNEDITWEA ITNATAITYP NCYRSEMTGE QLKNVLEDIA DNIFHPDPYF QGGGDMVRTG GMGYSIDIGK EIGSRISGMV HLKTGKPIEA SKTYTVSGWA SINQNTEGPP IWDVLAKHVA QAGPVKIDPN SAVKVSGA
|
| |