Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2140 |
Symbol | |
ID | 6409800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2309908 |
End bp | 2311479 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642712024 |
Product | protease Do |
Protein accession | YP_001991136 |
Protein GI | 192290531 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.750781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGATC GTCGCCCCGT CCTGTCCACG CAGTCTTCGC ACCGGCCCCA GTCGAGGTCG TTGCTGTCGG CGCGCAAGTT CGCGCTGATG GCCTCGGTCG TCGCCGGTCT CGGCGCGGGG GCTTTCGGGC TCGGCAACGG TTCGTTCGAT CTGATCGCCA CTCCGGCGCA TGCGCAGCAG GTCGGCGCCA ACGTTCAGCC GGCGCAGCAG CCGGTCGGTT TCGCCGACAT CGTCGACAAG GTGAAGCCGT CGGTGATCTC GGTGAAGGTC AACATCGCCG ACAAGATGGC CAAGAACGAA GACCGCGAGG ACTTCTCGTT CCCGCCCGGC TCGCCGATGG AGCGATTCTT CCGCCGGTTC GGCGGCGAGA TGCCTCCCGG CCTGCGCGGC CATCGCGGCG GCGGCATGAT CACCGGCCAG GGCTCGGGCT TCTTCATCTC GGCCGACGGC TATGCGGTGA CCAACAATCA CGTGGTCGAA GGCGCCGACA AGGTCGAAGT CACCACCGAC GACGGCAAGA CCTACAAGGC CAAGGTGATC GGCAATGATC CGCGCACCGA CTTGGCGCTG ATCAAAGTCG AAGGCGGTTC GAACTTCCCC TACGCCAAGC TGTCGGAAGG CAAGCCGCGG ATCGGTGACT GGGTGCTGGC GGTCGGCAAT CCGTTCGGCC TCGGCGGCAC CGTGACGGCC GGCATCGTCT CGGCGATGGG CCGCGACATC GGCAACGGTC CGTACGACGA CTTCATCCAG ATCGACGCGC CGGTGAACAA GGGCAACTCC GGTGGTCCGG CGTTTAACAC CGCGGGCGAA GTGGTCGGCG TCAACACCGC GATCTATTCG CCGTCGGGCG GCAGCATCGG CATCGCATTC TCGATCCCGG CCAACACCGT CAAGGCGGTG GTCGAGCAGC TCAAGGATCG CGGCTCGGTG AGCCGTGGCT GGATCGGCGT GCAGGTGCAG CCGGTGACGC CGGAGATCGC CGACAGCCTC GGCTTGAAGA AGGCGGAAGG CGCGCTGGTC GCAGAGCCGC AGTCGAACGG TCCGGCCGCC AAGGCCGGCA TCGAATCCGG CGACGTGATC GTCGCGGTCG ATGGCACGTC GGTGAAGGAC GCTCGCGAAC TCGCCCGCAC CATCGGTGCG TTCGCGCCGG GTCATGCGGT CAAGCTCACC GTGTTCCACA AGGGCAAGGA GCGTGAGCTG ACGCTGACGC TCGGCGAGCT GCCGAACAAG ATCGAAGCCA GCAACAACAC CGACCGCGGT GATCGCGGCG GAGCCAACCA GGGCCTCGAC CTGCCCAAGC TCGGCCTGAC GCTGGCTCCC GCCAGCTCGG TCGCCGGTGC CGGCAAGGAT GGCGTGGTGG TCACCGACGT CGATCCGAAG GGCGCCGCTG CAGACCGCGG CTTCAAGGAA GGCGATGTGA TCCTCGAGGT CGCCGGCAAG AACGTGTCGA GCCCGGCGGA CGTCCGCGAC GTGCTCGCTA CGGCGAAGAC CGAAAACAAG AACAGCGTGC TGGTCCGGGT ACGCAGCGGC GGCGCCTCGC GCTTCGTCGC CCTCCCGATC GCCAAGGGCT GA
|
Protein sequence | MHDRRPVLST QSSHRPQSRS LLSARKFALM ASVVAGLGAG AFGLGNGSFD LIATPAHAQQ VGANVQPAQQ PVGFADIVDK VKPSVISVKV NIADKMAKNE DREDFSFPPG SPMERFFRRF GGEMPPGLRG HRGGGMITGQ GSGFFISADG YAVTNNHVVE GADKVEVTTD DGKTYKAKVI GNDPRTDLAL IKVEGGSNFP YAKLSEGKPR IGDWVLAVGN PFGLGGTVTA GIVSAMGRDI GNGPYDDFIQ IDAPVNKGNS GGPAFNTAGE VVGVNTAIYS PSGGSIGIAF SIPANTVKAV VEQLKDRGSV SRGWIGVQVQ PVTPEIADSL GLKKAEGALV AEPQSNGPAA KAGIESGDVI VAVDGTSVKD ARELARTIGA FAPGHAVKLT VFHKGKEREL TLTLGELPNK IEASNNTDRG DRGGANQGLD LPKLGLTLAP ASSVAGAGKD GVVVTDVDPK GAAADRGFKE GDVILEVAGK NVSSPADVRD VLATAKTENK NSVLVRVRSG GASRFVALPI AKG
|
| |