Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0779 |
Symbol | |
ID | 6974176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 887557 |
End bp | 888567 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643390308 |
Product | Cellulase |
Protein accession | YP_002275184 |
Protein GI | 209542955 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATTA TACAATTGCC CATGCGATCG ACCGCCCTGG CGCTGCTGCT GCTCGCGGGG GCGATGCCGG CCCGGGCCCA GGACTACAAG GGTGTGAACC TGGCCGGGGC GGCCTATTCG TCGAACAAGA TCCCGGGTCG CTACGGCTAC GACTATCTGT TTCCCAAGCC GGGCGAGGTC ACGTACTTCC ATGACCGGGG GATGAATATC TTTCGCCTGT CCGTTCTGTG GGAACGGCTG CAGCCCGCGC CGGGGGTGGC GCTGGATTCC GGTTATATGG GCCGCATCAA CGGGTTCGTG GACCAGGTCC ACGCGGTGGG CGGCAAGGTC ATCCTCGATA TCCACGATTA TGGCCGCTAT CGCGGCACGC TGATCGGCGA CGGCCAGGTG ACCGCCGCCG ATTTTCGCGA CCTGTGGACC CGCCTGGGGA CCGCGTTCCG CGACCGGCCG GATGTCTGGT TCGGCCTGAT GAACGAACCG CAGCAGAAAT CGGCCGAGGC GTGGCGCGAT ATCGAACAGC AGGCGATCCT GGGCATTCGC GCGGCGGGGG CCCGGAACCC GATCCTGGTA TCGGGCGTTG GCTGGGACGC CGCCCATGGT TTCGCAACCG TAAATGCTCC GGCCCTGGCC ACGCTGAAGG ACCCCGACCA TTCCCTGGTG TTCGAAGTCC ATGAATATTT CGATCCCGAC AGTTCGGGCC GCAGCCCGGA CTGCATCCCG ACCGATCAGG CGGTCGCGCG GCTGGCCTCC TTCACCGCAT GGCTGCACCA GACGGGAAAC AAGGGCTTCC TGGGCGAATT CGGCGTCGGC CGCAGCGCGG CCTGCCTGGA TGTCCTGGAC AAGGTCGCAT CCTATCTGGC GGCCAACCAG GCGGTCTGGC TCGGCTGGAC CTACTGGGCC GCCGGGCCGC TGTGGGGCGA ATACATGTAT ACGCTGGAAC CGACCAAGAC CGGACAGGAC CGGCCGCAGA TGCTCGTCCT GGACCGTTAC CTGACCCACC GGGGGAGATA A
|
Protein sequence | MRIIQLPMRS TALALLLLAG AMPARAQDYK GVNLAGAAYS SNKIPGRYGY DYLFPKPGEV TYFHDRGMNI FRLSVLWERL QPAPGVALDS GYMGRINGFV DQVHAVGGKV ILDIHDYGRY RGTLIGDGQV TAADFRDLWT RLGTAFRDRP DVWFGLMNEP QQKSAEAWRD IEQQAILGIR AAGARNPILV SGVGWDAAHG FATVNAPALA TLKDPDHSLV FEVHEYFDPD SSGRSPDCIP TDQAVARLAS FTAWLHQTGN KGFLGEFGVG RSAACLDVLD KVASYLAANQ AVWLGWTYWA AGPLWGEYMY TLEPTKTGQD RPQMLVLDRY LTHRGR
|
| |