Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1565 |
Symbol | |
ID | 6458405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | - |
Start bp | 1703133 |
End bp | 1704641 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642725553 |
Product | protease Do |
Protein accession | YP_002016230 |
Protein GI | 194334370 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000901531 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00330598 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAGAA AATATACCTA CTTGTTTCTT GTGCTCGCCG GTGTGCTGGT TGGCGCGCTT GCTTTTTCTC ATATTGATGT CACTGTACCG GTTGCCGAGA AAAAGATCGC GGTCACGAGT TTTTCGAATG ATGCTGCTGC GTCGCCGGGG GTCGAGCATC AGCCTATCCG TTCGCTGAAG GACCTGAACG AAGCATTTGT CCAGTTGGCT GAGTCTGCGA CACCTTCCGT TGTGACGATT TTTACCGAAA AGACGGTCAA CAGAAAAACC ATTTCTCCGT TTGATCTTTT CGGCAGTCCC TTTGACGATT TTTTCAATGT TCCACGTGAC AGGGGAGGGC AGAATGGATC CAAGGAGGTG CTCAGAGGAC TCGGCTCAGG GGTGGTCGTC AGTGCGGATG GTTATATCCT TACCAACAAC CATGTTGTCG ATAATGCCGA TGTGATCTAC ATTCGTACGT ATGAAAATAA TAAAGTGGCG GCCAAAGTGG TCGGCAAGGA CCCGAAGACA GATCTGGCGG TTATCAAGGC TGACGTCAAA GGATTGAAGC CTATCGCTAT CGGCGACAGT GACGCACTGA GGGTTGGTGA GTGGGTGATC GCTATTGGCA GTCCGCTCGG AGAGAACCTT GCCCGCACAG TAACGCAGGG TATTGTCAGC GCTAAAGGCC GCGCCAATGT CGGACTTGCT GATTATGAAG ATTTTATTCA GACAGACGCT GCCATCAATC CCGGCAATTC CGGTGGTCCC CTGGTCAATA TCAACGGTGA ACTTGTCGGG ATCAACACTG CTATTGCAAG TCGGACAGGT GGTTTTGAAG GGATCGGCTT TGCTGTTCCT TCGAATATGG CAAAAAAAGT GATGCAGTCG CTCATCAGCA ACGGCAAGGT TACAAGAGGA TGGCTGGGCG TCACCATTCA GGATGTCGAT GAAAATATCG CCAAGGGGCT GCAGCTTGAT CCGCCAGAGG GTGTTCTTGT CGGAACGGTT GTTGACGATG GTCCTGCTGC ATCGGCAGGG GTGAAGACGG GAGATGTGAT CATTGCCATC GATGGTAAAA AAGTGACCGA TACCATTGAA CTTCGTAACG GCATTGCAGA GACACCTCCG GGGACGACTG TAAAACTCAG GGTGTGGCGA AACGGGCAGG TAAAGCTCTT GAGCGTGCGG TTGAGCGAGC TTCCCGGAAA GGAGGAGGTT GCTGTCGAAG AGAAGGCTGA AATCACTGAT CTCCTGGGTT TCAGTGTTTC GGAACTGACC TCCGAGCTGG CTTCACGATA CAGGCTGAGT CGGGATAAGG GTGCTGTTCT TGTGACCGGT ATCGATCCAT CGAGCAAGGC GTATCGAGCT GGTTTGCGAC AGGGAGATCT CATTCTTTCC GTCAACAAGA AAAACGTAAG CACCTACAAG GAGTTTCTTG CGGTGGCCGG CTCAGTGAAA AAAGGCGATC TCCTCTTTCT CTTGATCGAA CGTCAGGGCA GCAAAGTGTA TTTTGCCTTT AACATTTAA
|
Protein sequence | MKRKYTYLFL VLAGVLVGAL AFSHIDVTVP VAEKKIAVTS FSNDAAASPG VEHQPIRSLK DLNEAFVQLA ESATPSVVTI FTEKTVNRKT ISPFDLFGSP FDDFFNVPRD RGGQNGSKEV LRGLGSGVVV SADGYILTNN HVVDNADVIY IRTYENNKVA AKVVGKDPKT DLAVIKADVK GLKPIAIGDS DALRVGEWVI AIGSPLGENL ARTVTQGIVS AKGRANVGLA DYEDFIQTDA AINPGNSGGP LVNINGELVG INTAIASRTG GFEGIGFAVP SNMAKKVMQS LISNGKVTRG WLGVTIQDVD ENIAKGLQLD PPEGVLVGTV VDDGPAASAG VKTGDVIIAI DGKKVTDTIE LRNGIAETPP GTTVKLRVWR NGQVKLLSVR LSELPGKEEV AVEEKAEITD LLGFSVSELT SELASRYRLS RDKGAVLVTG IDPSSKAYRA GLRQGDLILS VNKKNVSTYK EFLAVAGSVK KGDLLFLLIE RQGSKVYFAF NI
|
| |