Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2013 |
Symbol | |
ID | 6975439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2233636 |
End bp | 2235210 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643391542 |
Product | protease Do |
Protein accession | YP_002276388 |
Protein GI | 209544159 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.316391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACG AGCTTCAGCC GATGTCTTCT CTCCGCGCCC GCCGGATGCG GGGCGGAATC CTGGCCGCGC TGGTTGCGGG AACCATGCTG GGGGGCGTGG CCGCCGACGG GCTGGTCCCC GTGGCGCGGG CCGACGATAC CGGGGTGATC CGTCCCGATA CGCAGGTCCA GACCCTGCCG AACTTCGTCA ACCTGGTGAA GCAGGTCCGG CCCGCCGTGG TGTCGATCAC CTCCAGCATC CGTGCCGAGG ACCTGGGGGA CGAAGGCGGC GGCGGCGCCG AGGGGCAGCA GCAGATGCCC TTCCCGTTCC CCTTCCCGTT CCAGATGATG CCGCAGCAGC AGCGCCGCAC GGTCGAGGCG CGGGGATCGG GCTTCATCAT CTCGGCCGAC GGCTATGTCG TGACCAACAA CCATGTGGTC AAGGGCGCGA CCAAGGTCAC GGTGACGCTG GATGACGGCA CCACCCTGCC GGCCAAGATC GTCGGCCGCG ATTCCAAGAC CGACCTCGCG CTGCTGAAGG TCACGTCGCA GGGCAAGCTG CGCTTCATCG AACTGGGCGA TTCCGACAAG GTCGAGCCCG GGGAATGGGT GGTCGCGGTC GGCAATCCCT ACGGGCTGGG CGGCACGGTC ACCGCCGGCA TCGTCTCGGC GCGCGGGCGT GACATCGGCG ACGGGCCGTA CGATTCGTTC ATCCAGGTCG ATGCCCCGAT CAATCGCGGC AATTCCGGCG GCCCGCTCTT CACCCAGGAC GGCAAGGTCG TGGGCGTCAA TACCGCCATC CTCTCGCCCT CGGGCGGGTC GATCGGCATC GGCTTCGCCA TCCCGTCCGA CGTGGTGAAG AACGTCGTCT ACCAGTTGCA GAAGACCGGG CACGTCACCC GGGGTTACCT CGGCGTGGTC GCGCAGGTGA TCACGCCCGC GATGGCCACG GCGCTGGGCC TGAAGCCCGC GGCGCCCGGC GCGCCGCCCA GCGGCGCCCT GGTCGCCAGT GTCAGCAACG GCAGCCCGGC CGAAAAGGCG GGGATCAAGG CCGGGGACGT GATCACCACC CTGAACGGGC AGAAGATCGA CAGCCCGCAT GATCTGGCGG TCAAGGTGGC CTCGATCGTG CCGGGCAGCA AGGCGGCGGT GAACTATATG CGCGGCACGG CCGCGCAGAG CACGACGGTC ACGATCGCCA ACCTCTCCGG CGCTCCGTCG CCCGACGGCG CGGTCGGGGA CAGGAACGAC GGCGGTCCGC GCCTGGGCGT CTCGCTGTCG CCCCTGACGT CGGACCTGCG CCAGCAACTG GGCCTGGACG GGTCGGTGCG CGGCGTCGTC GTCAGCGACG TCCAGTCGGG TTCGGCGGCG GAACAGGCCG GAATCCACGC GGGCGACGTG ATCCAGGCGG TGGGCAACAA GCCGGTGGAA AACCCCGGCG CTACCGTCAC CGCCGTCCGC GCGGCGCTGA AATCCAACCA GTCGGTCCTG CTGCGCATCC TGCGCAACGG GCAGAACATC TTCGTCGCCG TCACGCCGGG CTCGGATGAG GGCGACAGCG GCAATGGCAA CAGCGACCCC GACGGCAACG ACTGA
|
Protein sequence | MSDELQPMSS LRARRMRGGI LAALVAGTML GGVAADGLVP VARADDTGVI RPDTQVQTLP NFVNLVKQVR PAVVSITSSI RAEDLGDEGG GGAEGQQQMP FPFPFPFQMM PQQQRRTVEA RGSGFIISAD GYVVTNNHVV KGATKVTVTL DDGTTLPAKI VGRDSKTDLA LLKVTSQGKL RFIELGDSDK VEPGEWVVAV GNPYGLGGTV TAGIVSARGR DIGDGPYDSF IQVDAPINRG NSGGPLFTQD GKVVGVNTAI LSPSGGSIGI GFAIPSDVVK NVVYQLQKTG HVTRGYLGVV AQVITPAMAT ALGLKPAAPG APPSGALVAS VSNGSPAEKA GIKAGDVITT LNGQKIDSPH DLAVKVASIV PGSKAAVNYM RGTAAQSTTV TIANLSGAPS PDGAVGDRND GGPRLGVSLS PLTSDLRQQL GLDGSVRGVV VSDVQSGSAA EQAGIHAGDV IQAVGNKPVE NPGATVTAVR AALKSNQSVL LRILRNGQNI FVAVTPGSDE GDSGNGNSDP DGND
|
| |