Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_28010 |
Symbol | glgE |
ID | 7761706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2886656 |
End bp | 2888659 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643805680 |
Product | family 13 glycosyl hydrolase |
Protein accession | YP_002799948 |
Protein GI | 226944875 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACA CCAGCCAGCC CGGCGTGCAT TGCGAGAACC GTCGTCCGAC CCTGGAGGAA GCGCTGCGTC TACCGCGGAT CGCCATCGAG AACACCGAAC CGGTTGTCGA CAACGGACGC TTCGCCGTCA AGGCGATCGC CGGCCTGCCG GTGACGGTGG CCAGCACCGT CTTCACCGAC GGCCACGACC AACTCGGCGC CGCCCTCTAC TGGCGCGCCG ACGGCGAGAG CGGCTGGCAC CGCCTGCGCA TGAAGTTCGT CGGCAACGAC CGCTGGCAGG TCGAGTTCAC GCCGCTGCGC GTCGCCCGGC ATCTGTTCGT GGTCGAGGCC TGGTGGGATC TCTACGAAAC CTACCGGCAC GAGTTGCAGA AGAAGCATGC CGCCGGGGTG CCGGTCTCCC TGGAGCTGCA GGAGGGCCGC CTGCATCTGC AGCGGGCCGC CGAACACGCC CGGGGCGAGG TGCGCGCCAC GCTGGAGGAT CTCCTGCGGC GTCTGGAGCA GGCCCCGCAC ACCGAGGCCG TGGAGTTGCT GCTGAGCGGC GAGGCCTCCG CCGCCATGAG CGCCGCCGAC CCGTATCCGC ATCGCACCTA CAGCAATGCC TTCCCCCTGG ACGTGGAGCG CGAACGGGCC CTGTTCGCTA GTTGGTACGA ACTCTTCCCC CGCTCGCAGA CCGACAGCCC GCATCGCCAC GGCACCTTCA AGGATGTCAT CGCCCGCCTG CCGGCGATCC ACGACATGGG TTTCGATGTC CTCTACTTCC CGCCGATCCA TCCGATCGGC CGGCGCTTTC GCAAGGGCCG CAACAACAGC CTGGAGGCCG GGCCGGACGA TCCCGGCAGC CCCTATGCCA TCGGCGGCGA GGAGGGCGGC CACGAGGCCA TCCATCCGCA ACTGGGCTCG CGCGAGGATT TCCGCGAACT GGTCGCCGCC GCCGGCGAGT ACGATCTGGA GATCGCCCTG GACTTCGCCG TGCAGTGCTC CCAGGACCAC CCCTGGCTGA AGCAGCATCC CGGCTGGTTC TCCTGGCGCC CGGACGGCAG CATCCGCTAC GCCGAGAATC CGCCGAAGAA ATACCAGGAC ATCGTCAACG TCGACTTCTA TGCCAAGGAC GCCATCCCCG ACCTCTGGAT GGCGCTGCGC GACGTGGTGC TCGGCTGGGT CGCCGAGGGG GTGAAGATCT TCCGCGTCGA CAACCCACAC ACCAAGCCGT TGCCGTTCTG GGAGTGGCTG ATCGGCGACG TGCGCGGCCG GCACCCGGAA GTGATCTTTC TCGCCGAGGC CTTCACCCGC CCGGCGATGA TGCTGCGCCT GGGCAAGCTC GGCTTCTCGC AGAGCTACAC CTATTTCACC TGGCGCAATG CCAAGGAAGA ACTGACCGCC TACTTCGCCG AGCTGAACGA GGCGCCGGCC AGACACTGCG TGCGACCGAA CTTCTTCGTC AACACGCCGG ACATCAACCC CTTCTTCCTG CAGCACTCCG GGCGCGCCGG CTTCCTGATC CGCGCCGCGC TGGCGACCAT GGGCTCCGGG CTGTGGGGCA TGTACTCGGG GTTCGAGCTG TGCGAGGCGG CGGCGCTGCC CGGCAGGGAG GAATACCTCG ACTCGGAGAA GTACGAGATC CGCCCGCGCG ACTACCGGGC GCCCGGCAAC ATCGTCGCCG AGATCGCCCA GCTCAACCGC ATCCGCCGCT ACAACCCGGC GCTGCAGACC CATCTCGGCT TCGAGCCGTA CAACATCTGG AACGACAACA CCCTGCTGTT CGGCAAGCGT ACGCCGGACC TCTCCAACTT CGTCCTGGTC GCCATCAACC TCGATCCCTG GCACGCCCAG GAGGCCCATT TCGAGCTGCC GCTCTGGGAG TTCGGCCTGC CCGACCACGC TGACATGCAC GGCGAGGATC TGATGAACGG CCATCGCTGG ACCTGGCACG GTAAGAAGCA GTGGACGCGC CTCGACCCCG ACTATCTGCC CTTCGGTATC TGGAAGCTGA CCCCACCGGA GTAA
|
Protein sequence | MSDTSQPGVH CENRRPTLEE ALRLPRIAIE NTEPVVDNGR FAVKAIAGLP VTVASTVFTD GHDQLGAALY WRADGESGWH RLRMKFVGND RWQVEFTPLR VARHLFVVEA WWDLYETYRH ELQKKHAAGV PVSLELQEGR LHLQRAAEHA RGEVRATLED LLRRLEQAPH TEAVELLLSG EASAAMSAAD PYPHRTYSNA FPLDVERERA LFASWYELFP RSQTDSPHRH GTFKDVIARL PAIHDMGFDV LYFPPIHPIG RRFRKGRNNS LEAGPDDPGS PYAIGGEEGG HEAIHPQLGS REDFRELVAA AGEYDLEIAL DFAVQCSQDH PWLKQHPGWF SWRPDGSIRY AENPPKKYQD IVNVDFYAKD AIPDLWMALR DVVLGWVAEG VKIFRVDNPH TKPLPFWEWL IGDVRGRHPE VIFLAEAFTR PAMMLRLGKL GFSQSYTYFT WRNAKEELTA YFAELNEAPA RHCVRPNFFV NTPDINPFFL QHSGRAGFLI RAALATMGSG LWGMYSGFEL CEAAALPGRE EYLDSEKYEI RPRDYRAPGN IVAEIAQLNR IRRYNPALQT HLGFEPYNIW NDNTLLFGKR TPDLSNFVLV AINLDPWHAQ EAHFELPLWE FGLPDHADMH GEDLMNGHRW TWHGKKQWTR LDPDYLPFGI WKLTPPE
|
| |