Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_3148 |
Symbol | |
ID | 4581712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008687 |
Strand | + |
Start bp | 325116 |
End bp | 326492 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639770472 |
Product | protease Do |
Protein accession | YP_916925 |
Protein GI | 119385870 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.398511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGA AAATCCCCCG ACTGCGCGCG GCCGTCATCA CGCTGTGCAC GATCCTGGCG CTGGTCGCGC CGCCCCAAGC GGCTGTCGGG CAGCCCATGC CCGAGGCTGG GGGCGGTATT CCCACCATTG CCCCGATGCT GTCGCGCGTG ACCCCGGCGG TGGTCAATAT CGCGGTGGTC TCGGAAACCC CGGTAGTATC GAACCCGCTT TACAACGATC CCTATTTCCG CCGCTTCTTC GATCTTCGCG ACCCGCCGAT GCGGCGGCAG ATGAGCGCCG GCTCCGGCGT GATCGTCGAT GCCGAAAACG GCTATGTGCT GACCAACCAT CACGTCGTCG CCGAGGCCAG CGCGATCAGC GTCACCCTGA AGGACGGGCG CCAGTTCCGG GCCGAGCTCG TCGGCGCCGA CCAGGCCACG GAAATCGCCC TGCTGCGGAT CGAGGCCGAG GATCTGGTCG CGCTGGAGAT CGGCGATTCC GACCGCCTTC AGGTCGGCGA TTTCGTCGCC GCCATCGGCA ACCCCTTCGG CCTGGGCCAG ACGGTGACCT CGGGCATCGT CAGCGCGCTG GGGCGCAGCG GCATCAGCCG CGAGGGCTAT GAGGATTTCA TCCAGACCGA CGCCTCGATC AACCCGGGCA ATTCGGGCGG CGCGCTGGTG ACGCTGGACG GCCGGCTGGT CGGCATCAAC ACCGCCATCC TGACCCCGGC AGGGGGCAAT ATCGGCATCG GCTTCGCGGT GCCCAGCAAC ATGGCCGTCG CGGTGATGCG GCAGCTGATC GAGCACGGCG AGGTGCGGCG CGGCAGGCTG GGCATCGGCA TGCACGATCT GACGCCCGAC CTGGCCGAGG CGCTGGAGCT GGGCGGCATC CACGGCGCGG TGATCGCGAA TGTCGAGCCC GGCTCTCCCG CCGAGAAGGC GGGGCTGAAG GCGGGGGACG TGGTGACCGC TGTCGACGGC GCCCCGGTGC AGGGCGCCAC GCATCTGCGA AACCGGCTGG GCCTGACGCC CGTCGGCAGC ACCATCCGCC TGACGGTCAG GCGCGAGGGG GATGCGCGGG AGGTCGACCT GACCATCACC GCCGATGCGA CCTCCTCCGG TGATTTCGCC GGCACGCCGC TGGATGGCGC CCGGCTGCGC AACGCCTCGG CGGAGGAGGC GCGCCGGGCG GGGGGTGCCG GGATCATGGT CGACAGCGTC GAGCCGGACA GCCTTGCGGC CTGGATCGGC CTGCGCCGCG GCGACATGAT CGTCGCCGTC AACCGGACGC CGGTTTCCTC GGTCGCGGAT TTGCGTGACA TGCTTGCCGG CCACCGAGCC GTCGCGGCGC TGGAACTGAT CCGCGACGGC AGCCGTCTTT TCATCGTCGC AAGGTGA
|
Protein sequence | MSMKIPRLRA AVITLCTILA LVAPPQAAVG QPMPEAGGGI PTIAPMLSRV TPAVVNIAVV SETPVVSNPL YNDPYFRRFF DLRDPPMRRQ MSAGSGVIVD AENGYVLTNH HVVAEASAIS VTLKDGRQFR AELVGADQAT EIALLRIEAE DLVALEIGDS DRLQVGDFVA AIGNPFGLGQ TVTSGIVSAL GRSGISREGY EDFIQTDASI NPGNSGGALV TLDGRLVGIN TAILTPAGGN IGIGFAVPSN MAVAVMRQLI EHGEVRRGRL GIGMHDLTPD LAEALELGGI HGAVIANVEP GSPAEKAGLK AGDVVTAVDG APVQGATHLR NRLGLTPVGS TIRLTVRREG DAREVDLTIT ADATSSGDFA GTPLDGARLR NASAEEARRA GGAGIMVDSV EPDSLAAWIG LRRGDMIVAV NRTPVSSVAD LRDMLAGHRA VAALELIRDG SRLFIVAR
|
| |