Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2067 |
Symbol | |
ID | 3974025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2262994 |
End bp | 2264082 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637925175 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_531940 |
Protein GI | 90423570 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.270219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0802358 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTC GTGACTTTTT GAAAGTATCC GCAACCGGCG CTGCGGTCGC CGCGGTGGCG TCGCCGGCGA TTGCGCAGTC CTCGCCGGAA ATCAAGTGGC GGCTGACCTC GAGTTTCCCG AAGTCGCTCG ACACCATCTA TGGCGGCGCG GAATATCTCG CCAAGCAAGT CGCCGAGATG ACCGACAACA AGTTCCAGAT CCAGGTGTTC GCGGCCGGCG AAATCGTGCC CGGCCTGCAG GCGCTCGACG CCACTTCGAA CAACACCGTC GAGATGAGCC ACACCGTGTC CTATTACTAC GTCGGCAAGG ACCCGACCTT CGCGGTCTAT GCGTCGGTGC CGTTCGGGCT GAACGCGCGG CAGCAGAATT CCTGGTGGTA TCAGGGCGGC GGCATGGAGC TCGGCAACGA GTTCTACAAG AAATACGGCG TGATCGGATT TCCCTGCGGC AACACCGGGA CCCAGATGGG CGGCTGGTTC CGCAAGGAGA TCAAGACCGT GGCCGATCTG TCCGGGCTGA AATTCCGCAT CGGCGGCATC GCCGGCCAGG TGCTGCAGAA GCTCGGCGTG GTGCCGCAGC AGATCGCCGG CGGCGACATC TATCCGGCGC TGGAAAAGGG CACCATCGAC GCGGCCGAGT GGGTCGGCCC CTATGACGAC GAGAAGCTCG GCTTCCAGAA GGTCGCGAAG TACTACTACT ATCCGGGCTT CTGGGAAGGC GGTCCGACCG TGCACGCCTT CGCCAATCTG GAAAAGTACA ACTCGCTGCC GAAGAGCTAT CAGTCGATCC TGGCCAACGC CGCGCAGGCC ACCAACAGCT GGATGGCGGC GCGCTACGAC ATGCAGAATC CGCCGGCGCT GAAGCGCCTG GTGGCAGGCG GCACGCAGCT GCGGCCGTTC TCCAACGAGG TGCTGGACGC CTGCCTGAAG GCGACCAACG AATTGTGGGG CGAAATCTCC GCCAAGAACG CCGACTTCAA GAAGGGCATC GACGCGATGC AGGCCTACCG CTCCGATCAG TATCTGTGGT GGCAGGTCGC CGAATACACC TTCGACAGCT TCATGATCCG CTCGCGCACC CGCGGCTGA
|
Protein sequence | MKRRDFLKVS ATGAAVAAVA SPAIAQSSPE IKWRLTSSFP KSLDTIYGGA EYLAKQVAEM TDNKFQIQVF AAGEIVPGLQ ALDATSNNTV EMSHTVSYYY VGKDPTFAVY ASVPFGLNAR QQNSWWYQGG GMELGNEFYK KYGVIGFPCG NTGTQMGGWF RKEIKTVADL SGLKFRIGGI AGQVLQKLGV VPQQIAGGDI YPALEKGTID AAEWVGPYDD EKLGFQKVAK YYYYPGFWEG GPTVHAFANL EKYNSLPKSY QSILANAAQA TNSWMAARYD MQNPPALKRL VAGGTQLRPF SNEVLDACLK ATNELWGEIS AKNADFKKGI DAMQAYRSDQ YLWWQVAEYT FDSFMIRSRT RG
|
| |