Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swoo_4225 |
Symbol | |
ID | 6118580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella woodyi ATCC 51908 |
Kingdom | Bacteria |
Replicon accession | NC_010506 |
Strand | + |
Start bp | 5128933 |
End bp | 5130285 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641635791 |
Product | protease Do |
Protein accession | YP_001762576 |
Protein GI | 170728550 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000611786 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000372275 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGA AATTAAGCTT ACTCTCTGCC GCATTACTAA CGGCATCCCT AACCTTAACG CCAGCGATAA GCCAAGCCGC TATCCCTATG GCTGTTAATG GCGAGTCAAT TCCAAGCCTC GCGCCTATGC TTGAGCGTAC AACTCCTGCG GTCGTCGCCG TTGCTGTTGA AGGCACCCAT GTATCTAAGC AGAAACTTCC CGATGCTTTT CGCTATTTCT TCGGCCCTAA TGCACCACAA GAGCAGGTGC AAGAGCGCCC ATTTAGAGGC TTAGGCTCAG GCGTGATAAT CGATGCTAAT AAGGGCTATA TCGTCACGAA TAACCATGTG ATTGATGGCG CGGATGAGAT CTTAATCGGC CTGCATGACG GTCGTGAGGT TGAAGCAAAA CTGATTGGTG CCGACGCTGA ATCTGATATT GCACTGCTGC AGATAAAGGC TAAAAACTTA GTGGCGGTGA AGCGTGCCGA TTCTGATGAG CTTAAAGTAG GTGACTTTGC TGTCGCTATC GGTAACCCCT TTGGCTTAGG TCAGACGGTC ACATCAGGTA TTGTCAGTGC AATGGGCCGC AGTGGTCTGG GCATAGAGAT GCTTGAAAAC TTTATTCAAA CCGACGCAGC TATCAATAGT GGTAACTCTG GTGGCGCACT CGTCAACCTT AATGGCGACC TTATCGGTAT CAACACAGCC ATTGTTGCGC CTGGAGGCGG TAACGTAGGT ATTGGATTTG CTATCCCAGC TAACATGGTC AACAACTTAG TCGATCAGAT CATTGAACAT GGCGAAGTAC GCCGCGGCGT ATTAGGCGTA TCGGGCAGAG ATCTCACCAG CGAACTTGCT CAAGCCTTCG GTCTCGATAC CCAACATGGC GGGTTTGTCG ATCAGGTGAT GGAAGACAGC GCCGCTGAAG ATGCAGGTAT CAAAGCCGGC GACATTATCG TTAGTGTAAA CGGACGCAAG ATCAAAAGCT TCCAGGAGCT TCGAGCAAAA GTCGCCACGA TGGGCGCTGG TGCTAAAGTC AAATTTGGCT TAATCCGCGA TGGTGACTCC AAAACAGTAT CGGCCACATT AGGTGAAGCG AGCCAAACCA CAGAAGCTTC GGCAGGCGCT GTTCATCCTA TGCTCGCAGG AGCTGCACTG GAAAATGGAG ATGATGGTGT CGAAATCACC GATATTGCCC AAAATTCACC AGCCGCAGCT AGCGGGCTGC GTAAAGGAGA CGTCATTGTC GGCGTAAACC GCAGCTCAAT AGATGATCTC AATTCACTTA AAGCTAAGCT AAAGGAGCAG CAAGGTACGG TGGCGCTTAA GATACAAAGG GGCCACAGTA GCCTTTTCTT AGTACTCAGA TAA
|
Protein sequence | MKTKLSLLSA ALLTASLTLT PAISQAAIPM AVNGESIPSL APMLERTTPA VVAVAVEGTH VSKQKLPDAF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAN KGYIVTNNHV IDGADEILIG LHDGREVEAK LIGADAESDI ALLQIKAKNL VAVKRADSDE LKVGDFAVAI GNPFGLGQTV TSGIVSAMGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGDLIGINTA IVAPGGGNVG IGFAIPANMV NNLVDQIIEH GEVRRGVLGV SGRDLTSELA QAFGLDTQHG GFVDQVMEDS AAEDAGIKAG DIIVSVNGRK IKSFQELRAK VATMGAGAKV KFGLIRDGDS KTVSATLGEA SQTTEASAGA VHPMLAGAAL ENGDDGVEIT DIAQNSPAAA SGLRKGDVIV GVNRSSIDDL NSLKAKLKEQ QGTVALKIQR GHSSLFLVLR
|
| |