Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1647 |
Symbol | |
ID | 6975063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1835909 |
End bp | 1837699 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643391182 |
Product | peptidase S41 |
Protein accession | YP_002276039 |
Protein GI | 209543810 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.754893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGC CATCCGTTCC CGTTCCGGCG GGCCCCGTTT CGGCGGGCTG GATCCCCGCC CCGCTCCGTG GGGCCGGGCG ACGGGTCACG TCCGTGATCC TGCCGACGCG CACGATCGCC ATCATGCTGA TCGTCATCCA CCTGATGACG CCCGTCCTGG CCCCCGTGGC GGCGCGCGCC GCCGGCACGC CCGCGCCACC CCCGCCCCCC CGGCCCGCGC CGACCCCCGG ACAGGATACG AGGAGCCAGG ATACGAGCCA GATTCTGGTC CAGGCGGACC CGGTGCCCCA GGCGGGCCAG TTGGACGCGG ACATGACGAT ATCGGTCGTG AACGCGGCCC TGACCTTCCT GCTGCCCCGA ACGCTGGAGA GCCACACGCC GCGCGATTTC TGCCTGTGGG GCCTGAACGG GCTGAGCGCG ATCGACCCGT CCCTGACCGT CGTCGAGCAG AAGGGGCCCG AACAGAAAGG GGCGAACCAG AACGGGATCA TCCAGCTTTC GCTGGGGCAG GAAATCGTGC TGCGCCTGCC CGCCCCCCCC GATTCCGACC AGGCGGGATG GACGGACCTG ACCGTGCGGT TGATGCAGGC CGCCTGGGCC CGGTCGGGCA CGGTGCGCGG CGCCGGCGCG GACGGTCTGA TGCAAAGCTT CTTCGACGAA CTGTTCAACC ACATGGACCC GTATTCGCGT TACGTCGCGC CCAGCCCGGC CACGACCGAC CGCGACACGC GCACCGGCGG CGAGGCCGGG ACCGGGCTGA CATTGGGCCG CGACGCACGC TCGATCCTGA TCACCGGGGT CAATGCCAAC GGGCCGGCCT GGCCGGCGGG TCTGGCGACG GGCCAGCGGC TGTACGCGGT CAATGGCCGT TCGACCCGTG ACCAGGCGCC CGGAACGGTC GCCCAGTGGC TGCTGGGTGC GCCCGGCAGC ACGGTGACGG TGACGGTTGG CGACGGGCGC GCCCGGCGCA CCGTGACGCT GCGCCGGGCC TCGGTCCCGC CGGAGACGGT CTTCGCCTAT GCCGCGGAAC ATATGGTGGT GATCCGCGTC ACCGCGTTTT CGGCCGACAC CGCGCAGGAA ATGAGCCAGT ACCTGGACCA GGCATCCGAC GACCAGCACC TGCGCGGACT GGTGCTGGAC CTGCGCGGCA ATCGCGGCGG GGTGCTGCAG CAGGCGGTGA CGGCCAGCGC GCTGGTGCTG GACCAGGGCG TGGCGGCGAT CACCCACGGG CGGGACCCGG AGGCCAACCA TGTCTGGGCG GTGCAGGGCG GCGACATGAC CGGGGGCGTG CCGATCGTGG TGCTGGTGGA CGGGCGGACC GCCAGCGCGG CCGAGATCCT GGCCGCCTCG CTGGCCGACC ACCGGCGCGC CGTGGTGGTG GGCAGCGCCA CGCTGGGCAA GGGACTGGTG CAGACGATCG GCCAGATGCC CGACGGCGGT GAATTGTTCG TGACCTGGAG CCGCGTCCTG GCGCCGCTGG GCTGGCCGCT GCAGGGGCTG GGCGTCATGC CGCAGGTCTG CACCAGCCGG GGCGAAAGCG ACCTGGAACG GCAGTTGCAG GACCTGGCGG CCGGCCAGGT GGACATGCGC GACGCCGTCC AGGCGACACG CGCCACGCGC TATCCCGTGC CGGTGTCGCG CATCCTGGAC CTGCGCCGCG CCTGCCCGGC GGCGATCGGC ACCGATTCCG ACCTGGATGC CGCGCGGTCG CTGATCGACA ATCCCGCCGA ATATCGCGCC GCCCTGTCCG CCATCCCCGA GGAGAGCCCC TATGCGCCGC AGGCTGAATA A
|
Protein sequence | MRAPSVPVPA GPVSAGWIPA PLRGAGRRVT SVILPTRTIA IMLIVIHLMT PVLAPVAARA AGTPAPPPPP RPAPTPGQDT RSQDTSQILV QADPVPQAGQ LDADMTISVV NAALTFLLPR TLESHTPRDF CLWGLNGLSA IDPSLTVVEQ KGPEQKGANQ NGIIQLSLGQ EIVLRLPAPP DSDQAGWTDL TVRLMQAAWA RSGTVRGAGA DGLMQSFFDE LFNHMDPYSR YVAPSPATTD RDTRTGGEAG TGLTLGRDAR SILITGVNAN GPAWPAGLAT GQRLYAVNGR STRDQAPGTV AQWLLGAPGS TVTVTVGDGR ARRTVTLRRA SVPPETVFAY AAEHMVVIRV TAFSADTAQE MSQYLDQASD DQHLRGLVLD LRGNRGGVLQ QAVTASALVL DQGVAAITHG RDPEANHVWA VQGGDMTGGV PIVVLVDGRT ASAAEILAAS LADHRRAVVV GSATLGKGLV QTIGQMPDGG ELFVTWSRVL APLGWPLQGL GVMPQVCTSR GESDLERQLQ DLAAGQVDMR DAVQATRATR YPVPVSRILD LRRACPAAIG TDSDLDAARS LIDNPAEYRA ALSAIPEESP YAPQAE
|
| |