Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_2338 |
Symbol | |
ID | 6207278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | + |
Start bp | 3029540 |
End bp | 3030955 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641691993 |
Product | protease Do |
Protein accession | YP_001819220 |
Protein GI | 182414154 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.206121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0104866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCC GCTCTCTCGC CAGCTTGCTC TCGATCGCGA TCGCGTCAGC CGTGCCTTTC GTCGCGCACG GCAAGGAAGC CAAACCCGAT CGGAAGCCGC CTACGCTGGC GATCGATCCG AGCCCCGTCA CCGCCGGCAA GTCAGCCCTG GTGACGAGCT ATGCGGATAT TCTCGAGCCG GCGCAGAAAG CGGTGGTCTC GGTTTACTCG ATGAAGATCG TGCGCGAACG CATGGCGCTG AATCCGTTCC TGCGGCAGTT TTTCGGTAAC GAGATTCCGG ATCAGGAACG CGAGCGCAAG GAAGAGGGGC TCGGCTCGGG GGTGATCGTT TCGCCCGACG GCTACATTCT CACAAACAAC CACGTCGTTG AAGGAGCGGA TGAGTTGAAG GTGCTGCTCG CCGACGATCG CGAGTTCATT GCCAAAGTCA TCGGCGCGGA TCCGAAGACC GACATCGCGG TAATCAAGAT CGAAGGCGAA CGACTGCCGG TCGTGACGCT GGCCGACAGT GACAACATCC GCGTTGGCGA CGTCGTGTTC GCGGTCGGGA ATCCGCTCGC GGTCGGGCAG ACGGTCACGA TGGGCATCGT CTCGGCCAAG GGCCGCAGCG TGGGGATCCT CGACGAGGTC GCCGGCTATG AATCGTTCAT CCAAACCGAC GCGGCCATCA ATATGGGGAA CTCCGGGGGC GCCTTGGTGG ATGCCAAAGG CCGGCTGGTG GGAATCAACA GCGCGATCCT GTCGCCCTCG CGCGGCAACA TCGGCATCGG GTTCGCCGTT CCGGTGAATC TAGCCGCGAC GGTCATGCAC AGCCTCATCG AGACCGGAAC GGTTTCGCGC GGCTATCTCG GCGTGCAATC GCAGACGCTC GCCGCCGACG AGGCGGAGGC GTTTGGATTG CCGCGGGACA CGAAGGGTGT GACCATTACC GACGTCACGC CGGACAGCGC GGCGGACAAG GGCGGGCTGA AGGTGGGTGA CGTCGTCCTC AGCGTGAACG ACAAACCCGT CGCGGCGTTG CGCGACCTGC GCATCTACAT CGCCCAGACC GCTCCCGGCT CCAAGGTGAA GCTGAAGATT TCTCGCGACG GCAAACCGCA GGTGCTCGAC ATCGTATTGG GCAAACTGGA CGAGAAACCC AACGAGTTGC TGACCGGTGT GGAGGTGTCT GCGTTGACGC CGGAGGCTCG ACGCCGGCTG CGAATTCCGC CGCGGTTCGA CGGGCTGTTG GTGACATCGG TCGATCCTGA ATCGCCGTAT GCGGACCGAC TGGCGCCTGA CGTCCTGATT TTGCAGGTCG ACCGCGAGGA CGTGAGCGAC ATTGAAGCTG CGCGGCGTGC GCTGACGCCT GGCCGGCACA TCCTGATTGT TTACTATCGC GGCTCGGCGC GCGTGATCGG GCTGACAGTG GAATAG
|
Protein sequence | MKFRSLASLL SIAIASAVPF VAHGKEAKPD RKPPTLAIDP SPVTAGKSAL VTSYADILEP AQKAVVSVYS MKIVRERMAL NPFLRQFFGN EIPDQERERK EEGLGSGVIV SPDGYILTNN HVVEGADELK VLLADDREFI AKVIGADPKT DIAVIKIEGE RLPVVTLADS DNIRVGDVVF AVGNPLAVGQ TVTMGIVSAK GRSVGILDEV AGYESFIQTD AAINMGNSGG ALVDAKGRLV GINSAILSPS RGNIGIGFAV PVNLAATVMH SLIETGTVSR GYLGVQSQTL AADEAEAFGL PRDTKGVTIT DVTPDSAADK GGLKVGDVVL SVNDKPVAAL RDLRIYIAQT APGSKVKLKI SRDGKPQVLD IVLGKLDEKP NELLTGVEVS ALTPEARRRL RIPPRFDGLL VTSVDPESPY ADRLAPDVLI LQVDREDVSD IEAARRALTP GRHILIVYYR GSARVIGLTV E
|
| |