Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0356 |
Symbol | |
ID | 3908622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 398221 |
End bp | 399282 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882242 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_483978 |
Protein GI | 86747482 |
COG category | [R] General function prediction only |
COG ID | [COG2220] Predicted Zn-dependent hydrolases of the beta-lactamase fold |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCGA TCTCGCGCCG AACGTTGCTG GCGAGCCTCG CCGCCCTCGC CGCCACGGCC GGGCTTTCCT CGTTCTGGGT TTCCCGCATG ACCTCCTACA AAGGCCCGAT CACCGATCAT TTCGACGGCG AGCGCTTCTT CGATCGCGAC GGCGCGGCGC CGAAGGGGTG GCTCGACGTG CTGCGCTGGC GCTTCACCAC CAAGCCGGCC AAATGGCCGG ACTGGGCGCC GAGCCCGTTC GCCGACACCC CGCCGCCGCG CGTCGAAGGC GCCAGGGCGC GGCTGAGTTT CGTCGGTCAT GCGAGCTGGC TGATTCAGAC CGGCGGGCTC AATATCCTGG TCGATCCGGT GTGGTCGGAG CGGGTGTCGC CGGTCAGCTT CGCCGGCCCC AAGCGGCACA ACGATCCCGG CATCGCGTTC GACAAGCTGC CCAAGATCGA CATCGTGCTG GTGTCGCACG GCCACTACGA TCACCTCGAC CTGGCGACGC TGTCGCGGCT CGCCGCGCAG CATGCACCGC GGGTGATCAC GCCGCTCGGC AACGATCTGA CGATGGCTTC GCACGACAGC GCGATCCGCG CCGAGGCCTA TGACTGGCGC GACCGCGTCG AGCTCGGGCC CGGCGTCGCC GTGACGCTGG TGCCGACCCG GCACTGGACC GCGCGCGGCC CGTTCGACCG CAATCGCGCG CTGTGGGCGT CGTTCGTCCT GGAGACGCCG GCCGGCAGGA TCTACGTCGT CTGCGATTCC GGCTATGGCG ACGGCCGGCA CTTCCGTAAC GTCCGCGAGG CGCACGCGCC GCTGCGGCTG GCGATCCTGC CGATCGGCGC CTATGCGCCG CGCTGGTTCA TGAAGGACCA GCACATGAAC CCCGCCGACG CCGTGATGGC GCTGGCGGAT TGCGGCGCCC GGCAGGCGCT GGCGAACCAT CACGGCACCT TCCAGCTCAC CGACGAGGCG ATCGATGCGC CGGAGCTGGA ACTGTATGCG GCGCTCGACG CCGCTGCGGT GCCGCGCGAG CGCTTTCCGG TGCTGAAGCC GGGGCAGGTT TTCGAAATCT GA
|
Protein sequence | MNPISRRTLL ASLAALAATA GLSSFWVSRM TSYKGPITDH FDGERFFDRD GAAPKGWLDV LRWRFTTKPA KWPDWAPSPF ADTPPPRVEG ARARLSFVGH ASWLIQTGGL NILVDPVWSE RVSPVSFAGP KRHNDPGIAF DKLPKIDIVL VSHGHYDHLD LATLSRLAAQ HAPRVITPLG NDLTMASHDS AIRAEAYDWR DRVELGPGVA VTLVPTRHWT ARGPFDRNRA LWASFVLETP AGRIYVVCDS GYGDGRHFRN VREAHAPLRL AILPIGAYAP RWFMKDQHMN PADAVMALAD CGARQALANH HGTFQLTDEA IDAPELELYA ALDAAAVPRE RFPVLKPGQV FEI
|
| |