Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_0403 |
Symbol | |
ID | 5052180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | + |
Start bp | 393768 |
End bp | 395219 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640470554 |
Product | protease Do |
Protein accession | YP_001155187 |
Protein GI | 145588590 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.149111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AGCATTTAAT TGCGCTCTTG GCTATTTTGA GTCTTGGGCA ATCCCCATTC ATTTCCCAGT CAGTCGCCGA GACGCCTCGA GTGACAATTC CTGATTTTGC GGATTTAGTC GAGCGCGCCA GTCCAGCGGT AGTGAATATT CGCACGACAG AAAAGGTGAA TGTTCAACAA ACTCAGGGTG GTATTCCTGG GATGCCAGAA GATCAAGCAG AATTTTTCCG CCGTTTCTTC GGCGTTCCTA TTCCCGGAAT TCCAAACAGC CCTAAGCAAG TACAGCCAGC TCCCGGTAAA CCTCAAGAGG CTGATCGCGG TGTAGGTTCA GGTTTTATTA TTGATTCCAA CGGCATGATC TTGACGAACG CGCACGTAGT TGAAGGTGCA ACAACGATTT ATGTCACCTT GACTGATAAG CGCGAATTCA AAGCGAAGTT GTTGGGTATG GATAAGCGCA CCGACGTAGC CGTAGTCAAA ATTGATGCGC GTGATCTCCC TAGATTACCT TTGGGTGACT CTTCTAAGGT GCGTGTGGGT GAATGGGTGC TCGCCATTGG CTCCCCGTTC GGCCTGGAGA ATACGGTCAC TGCAGGTATC GTTTCTGCAA AGAGTCGAGA TACTGGCGAC TATCTGCCAT TCATCCAAAC AGACGTTGCT GTGAATCCTG GTAACTCTGG CGGTCCACTC TTAAACACAG CGGGACAAGT GATTGGCATT AATTCGCAAA TATTTAGTCG CTCTGGCGGA TACATGGGAA TTTCTTTTGC CATTCCGATT GATGAAGCAA TACGCGTTGC CGATCAATTG CGTACCAATG GAAAAATGAC GCGTGGCCGT ATCGGTGTCG CTTTAGGTGA TATGACTAAA GAAGTGGCTG AGAGTTTAGG TTTGGGTAAG CCACGTGGGG CATATGTTCG CAATGTCGAG CCAGGCGGTC CTGCAGCTGG AGGTGGCATT GAATCAGGCG ATGTAATTTT GAGTTTTAAT GGACACGAAA TCAATAAGTC AACGGATTTA CCAAGAGTAG TTGGAGAAAC TAAGCCTGGC ACTTCTGTAG TGGTGCAAGT TTGGCGTAAA GGCACTACTC GCGACCTCAC GGTTACTGTG ACGGATGCGG AGTCAAACCA GGCTGCCATT AAAAAGCAGG AGGTACCGGC TGCCAGTGGT AACGGTGGAA ATGTTCTGGG TATCCAGGTA AATGACTTAA GTGATGCCAA GAAGAAGGAC TGGAATATCA AAGGGGGCGT TGAAGTGACT GGCCTTGGCG ACGGTCCTTT AGCTAGGGCG GGGGTTCGTC CTGGGGATGT GATTATCCGC ATTGCTGATA CGGATATTAC GGGGGTGAAG CAGTTTGAGG CCCTGGTTAA AGGTCTGGAT AGTAATAAGG CCGTTCCGGT CTTTATTCGC CGTGCTGACA GCACTTTAGT GATCCCAGTA AGGTCAAAAT AA
|
Protein sequence | MMKKHLIALL AILSLGQSPF ISQSVAETPR VTIPDFADLV ERASPAVVNI RTTEKVNVQQ TQGGIPGMPE DQAEFFRRFF GVPIPGIPNS PKQVQPAPGK PQEADRGVGS GFIIDSNGMI LTNAHVVEGA TTIYVTLTDK REFKAKLLGM DKRTDVAVVK IDARDLPRLP LGDSSKVRVG EWVLAIGSPF GLENTVTAGI VSAKSRDTGD YLPFIQTDVA VNPGNSGGPL LNTAGQVIGI NSQIFSRSGG YMGISFAIPI DEAIRVADQL RTNGKMTRGR IGVALGDMTK EVAESLGLGK PRGAYVRNVE PGGPAAGGGI ESGDVILSFN GHEINKSTDL PRVVGETKPG TSVVVQVWRK GTTRDLTVTV TDAESNQAAI KKQEVPAASG NGGNVLGIQV NDLSDAKKKD WNIKGGVEVT GLGDGPLARA GVRPGDVIIR IADTDITGVK QFEALVKGLD SNKAVPVFIR RADSTLVIPV RSK
|
| |