Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2423 |
Symbol | |
ID | 4022914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2701740 |
End bp | 2702906 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637962616 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_569554 |
Protein GI | 91976895 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00407497 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGAGCTC TGATCGCCGT CCTCTCACTT GCATTGATGC CGGCGGCTCG CTCCGAAGAC GGTCTGAACA TCGACAAATC ATTCGGATCG GTCTCGGGTT GGGACGTTGG GTTCAGCAAG AACGTCGGTG GCTGTCTGGC CGCAGCGACC TATCGCGACC GGACGACGGT GTGGTTCGGC TTTGCCGGAG ACAAGCCGAG CGCCTACATC GCCTTCACCA ACCCGCGCTG GGCGTCCGTC GAGGTCGATG GCCAGTACGA CCTGCAACTG GTCATGCGCC GCACGCGGTG GAACGGTCAG TTCGTCGGCT TCACGCGCGG TAACGAGAGG GGCGTTTTCT CTGCCGGTCT CAAGACCGAG TTCATGGTCG AACTGGCCGA GTCCGGCGGT GTGGGTGTGT TCCTGAATCG AAATCGCATC GCGGCGCTTT CCCTCGACGG CTCCCGGCGC GCCCTCGAAG CGGTGCTGTC TTGCCAGAAG GCATTCATGA CGGCCCAGAG CGACACACGC GACGAAGGTT CGACCGGGGC CAAGCCAAAG CGCAACGCCA GGAGTTCAGG CACGGGATTC TACGTCTCCG GGAACGGGCA CATCGTGACC AACAACCACG TCATCGCCGA ATGCTCGGCG ATCAATGTGA TTCCCCCCGG CGGGGCGCCG TTGCGCGCGA CTCTCGTGGC GAAGGACAAG ACCAACGATC TCGCGATTCT GAAGACGTCG TCGTCGCCGC CCGCTGTTCC TGGACTCAGA ACCCAGATGC GGTTGGGCGA AGCCGTGTAC GTGTTCGGCT TTCCTCTGAC TGGCATCCTG TCGACATCAG GAAACTTCAC GGCCGGCGCG ATCACCGCGA CCACCGGCAT GGAAGACGAC ACCCGCCTCG CCCAGATCTC CGCTCCGGTT CAACCGGGCA ACAGCGGCGG TCCGTTGCTC GACAAATACG GCAACGTTGT CGGCGTGATC GTATCGAAGC TCAACGCCTT GAACATCGCC GCCGCGACCA AAGACATTCC GCAGAACGTC AATTTCGCGA TCAAATCCGG CATCGCGACG AACTTCCTCG ACAGCAGCGG CGTGCTCCCC AGCGGCACGG TGAGCACGCG CGAACTCCCG CCGGAGGCGA TCGCCGATCT GGCCAAATCC TTCACGGTCC AGGTGCTCTG TAATTAG
|
Protein sequence | MGALIAVLSL ALMPAARSED GLNIDKSFGS VSGWDVGFSK NVGGCLAAAT YRDRTTVWFG FAGDKPSAYI AFTNPRWASV EVDGQYDLQL VMRRTRWNGQ FVGFTRGNER GVFSAGLKTE FMVELAESGG VGVFLNRNRI AALSLDGSRR ALEAVLSCQK AFMTAQSDTR DEGSTGAKPK RNARSSGTGF YVSGNGHIVT NNHVIAECSA INVIPPGGAP LRATLVAKDK TNDLAILKTS SSPPAVPGLR TQMRLGEAVY VFGFPLTGIL STSGNFTAGA ITATTGMEDD TRLAQISAPV QPGNSGGPLL DKYGNVVGVI VSKLNALNIA AATKDIPQNV NFAIKSGIAT NFLDSSGVLP SGTVSTRELP PEAIADLAKS FTVQVLCN
|
| |