Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1135 |
Symbol | |
ID | 6974539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1272557 |
End bp | 1274164 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643390664 |
Product | protease Do |
Protein accession | YP_002275533 |
Protein GI | 209543304 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGA CCGTTCCGGT TTTCTTCCGT CCCTGTGCCG CCGTCGGGCT GGCGGCGGCC CTGTCCCTGA TCGCGGGGGG GGGCCTCCGG GCCGAACCGG CGGTGGTGCC GCATCCGGTC GCCCCAGCCG CGCCCGCCGC CCCTGCGGTC CCGCCCGTTA CCGTGCCGCC GGTCCACCTG CCCGCGCCGG GGCGCGGCGC GCCCGACAGC TTCGCGGACC TCGCGGCGCG GCTGCTGCCC GCCGTGGTCA ACGTCTCGAC CACCGAGACG GTCAAGCCCG GGGACCAGGA CGGCGATTCC GACGGTGATG GTGGGGGCGA GGATACGCCC CAGGTCCCCA ATTTCCCCGA GGGATCGCCG TTCGAGAAAT TCTTCCACGA TTTCATGAAC CGGCAGACCG GGCCCGAGGC CGCGCCGCGC AAGATGCAGG CGCTGGGGTC GGGCTTCATC ATCGACCCGT CGGGGATCGT CGTCACCAAC AACCACGTCG TCCGCCACGC GGACCAGATC ACGGTGACGC TGCAGGACAA TACGGTGCTC AAGGCGCATT TGCTGGGCCA TGACGACCGG ACCGACCTCG CGGTGCTGAA GGTCGATGCG CCCCATCCGC TGCCCGCCGT GCCCTTCGGC GACAGCGACC ATGCGCGTGT GGGCGACTGG GTGCTGGCGA TCGGCAATCC GTTCGGCCTG TCGGGCACGG TGACGGCCGG CATCATCTCG TCCCGCGGCC GCAATATCGA ACAGGGTCCG TACGACGATT TCATCCAGAC CGACGCCCCG ATCAACAAGG GCAATTCCGG CGGTCCGCTG TTCGACATGC AGGGGCAGGT GATCGGCATC AACACTGCGA TCTATTCGCC CTCCGGCGGG TCGATCGGCA TCGGCTTCTC CATTCCCTCG GCCGAGGCGC GGGGCATCAT CGACCAGCTC CGCCGCACCG GCAAGGTGTC GCGTGGCTGG ATCGGCGTGC GGATCCAGGA CGTGACCCAG GACATCGCCG ACGGGCTGGG CCTTAAGGTC GCGCGCGGCG CCCTGATCGC GGGGATCGAG CCCAAGGGCC CGGCGGCGGC GGCGAAGCTG CAGACCGGCG ACGTGATCCA GACCCTGGAC GGCAAGGAGA TCGACGGCCG CGCCCTGCCG CGCCTGATGG CCGACGAATC CCCCGGCCGG GTGGTCAGCC TGGGGGTGTG GCGCCATGGT CATGTCCTGA CTGTTCCCGT CACGGTCGGC GCCCTGCCCG AGGAAGCGGC CGAGGCCCCG GCGCCCAAGC CCGCCGCCAC GGCCCAGGGC AACGTTCAGT TGCAGGGCAT GGGCTTCACC GTGGGGGCGA TCGACGACGT CGCCCGGCAG AAATACAGCC TGGCCGAGGG GCAGAAGGGC GTGGTCGTCA CCGCGGTGGC CGCCGACGGC CCGGCCGCCG ATCGCGGCCT GCGCCCCGGC GACGTGATCA CCGAGGTCCA GCAGTCCGAG GTCGCGTCCC CCGCCGACCT GCGCCGCCTG GTCGAGGCGG CGCGCGCGCA GCATCGGCGT TCGGTCCTGT TCCTGGTGCA GAACGGCGAC GGGCTGCGCT GGGTGCCGCT CCCGCTGGTC GGAGGGGACG GGAAATAG
|
Protein sequence | MPLTVPVFFR PCAAVGLAAA LSLIAGGGLR AEPAVVPHPV APAAPAAPAV PPVTVPPVHL PAPGRGAPDS FADLAARLLP AVVNVSTTET VKPGDQDGDS DGDGGGEDTP QVPNFPEGSP FEKFFHDFMN RQTGPEAAPR KMQALGSGFI IDPSGIVVTN NHVVRHADQI TVTLQDNTVL KAHLLGHDDR TDLAVLKVDA PHPLPAVPFG DSDHARVGDW VLAIGNPFGL SGTVTAGIIS SRGRNIEQGP YDDFIQTDAP INKGNSGGPL FDMQGQVIGI NTAIYSPSGG SIGIGFSIPS AEARGIIDQL RRTGKVSRGW IGVRIQDVTQ DIADGLGLKV ARGALIAGIE PKGPAAAAKL QTGDVIQTLD GKEIDGRALP RLMADESPGR VVSLGVWRHG HVLTVPVTVG ALPEEAAEAP APKPAATAQG NVQLQGMGFT VGAIDDVARQ KYSLAEGQKG VVVTAVAADG PAADRGLRPG DVITEVQQSE VASPADLRRL VEAARAQHRR SVLFLVQNGD GLRWVPLPLV GGDGK
|
| |