Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26980 |
Symbol | |
ID | 7761605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2767609 |
End bp | 2770074 |
Gene Length | 2466 bp |
Protein Length | 821 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643805575 |
Product | glucan 1,4-alpha-glucosidase |
Protein accession | YP_002799848 |
Protein GI | 226944775 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01535] glucan 1,4-alpha-glucosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC CAGCTAGATT CCGCAAAACA CTGCTGAGCG TCCTGATCGG CGCACTGGGA ATCGCCCAGG CCGCCGGGGC GGCCAACATG GCCCCCGGCG CGCCGGGCGC GGCGCCGTTC TGGTCCTATT CGGGCAAGAC CGGCATCGGT ACTTCCTACG AGCAGTACGA GGACGGCCAG TACTCGCCGC AAGCGGCCAC CGGCGAAGTC TCGAAGGTCT GGTTCTCCCT GGCCCGGGGG GTCGTCACCG AAACCATGTT CGGGCTGATC CACGAAGCCC AGCTCCGCGA AATGCAACTG GTGATCCAGG GCCCCGACTT CCTGGACCTG GAAGCGCAGG ACACCGACAG CGAGATCCAC TATCTGTCGG TCGACGATCA GGGCCGCCCC ACCTCACTGG CCTACAAGAT CGTCAACCGC GACAGGCAGG GCCGCTACGA GATCGAGAAG CACATCTTCA CCGACCCGGA CGCCGATTCG CTGGTGATAC GGGTCATCTT CCGCAGCCAG GAGCCGGGCA TCCAGGCCTA TGTGCACATC GATCCGGCCA TCGGCAACGA CGGCAGTGGC GACAGCGCCT TCGTCGACGG GCAGGCGCTG TACGCCCGGG AGGGCGAGAC CACCCTGGTG GTCAAGGCCA GCCGCCCTCT GGAACAACCC ACCGTCGGCT TCGCCGGAGT TTCCGACGGC CTGACCGACC TCAAGGACGG CAAGCTCGAA AACCGCTACG ACAGCACCGG GGATGCTGCT GGTAACGTCA CCATGCTGGC CCGCCTGCCC TTGGCGGGCA GCGAAACCAC CCTCGACCTG GTGGTCGGCT TCGGCAAGAA CCGCCAGCAG GCCGACCGGG CGGCAGATGC GACGCTGGCG CGCGGCTACG CCGAGGTGCT GGCCCACTAC AACGGCGAGG GCGAGGCGAT CGGCTGGCGG GACTACCTGG CCTCCCTTTC CGGCCTGCCG GCCATGTACG CGCAGAGTGC CGACAACGGC AAGCTGCTCA ACGTCAGCGC CATGGTGCTC AAGGCACAGG AAGACAAGAG CAACGCCGGA GCGCTGATCG CCTCCCTGTC CAACCCCTGG GGCGAGACGG TGCCGGCGGA AAGCGGCACC ACCGGCTACA AGGCGGTCTG GCCGCGCGAT TTCTACCAGT GCGCGATGGC GCTGCTGGCC CTCGGCGACC GGCAGACACC GCAGGTCGCC TTCGAATACC TGAAGAAGGT CCAGGTCAGC GAAGCCACCT CTCTCACGCA GGGCGACACT TCGGCCTACA TCCACGAGCA GAAGCAGGAA GGCAAGACCG AAGACGCCGG CCCCGGCGCC ACCGGCTGGT TCCTGCAGAA GACCCACGTC GATGGCGAGA TCGAATGGGT CGGGGTCCAG CTCGACCAGA CCGCCATGCC CATCATGCTG GGATACAAGC TGTGGAAGGC CGGGATACTG GACGACGCCC GGATCAAGCA CTGGTACCAG AACATGCTCA AGCCGGCGGC GGAATTCCTG ACCAGGGGCG GCCAGGTGAA GGTCGGCTGG AACGACTGGA AGGTCGTCCC GCCGCAGACC CAGCAGGAAC GCTGGGAAGA GCAATGGGGC TACTCGCCCT CGACCACGGC GGCGGTCATC GCCGGCCTGG TGGCGGCCTC CGAAGTCGCC GGCCATGCCG GGGATGGCGA GGCGGCGCAG CGTTTCCTGG ACGCCGCGAA GGGCTATTCC GACAAGCTGG AGGCGAGCAT GGTCACCCGG AAAGGCAGCC TGGGTGACGG GCACTACTAT CTGCGCATCA CCCAGAACGA CGACCCCGAC GACGGCGCCC CCCTGCTCGA CAACAACAGC CGCCCCGGAT TGCCGGAACA TCAGGTGCTC GATGCCGGCT TCCTGGAGCT GGTGCGCTAC GGCGTACGCC CCGCCACCGA CCCGAACATC GTCGGCAGCC TGGACGAACT CGATAGCCAG GAGCTGCCGG AAAACCTGAA GGTCAAATAC CTGTTCAGCT ATCCGGGCGT CGAGGGCGAA TTCCCCGGCT GGCGCCGCTA CGGCAACGAC GGCTACGGCG AGAGCGAAAT TTCCGCCATC AACTTCCCGG CCACCGAGGC CCCGCAGATG AACTCCGGGC TGCGCGGCCG CGTCTGGCCG ATCTTCACCG GCGAGCGCGG CCATTACGAG TTGGCCCGCA GCAAGGCCCG CTTGCCGGCA GGAATCGAGG CACTGAGCGC CGAGCAACTG GCGCAGTTGA GGAACACTTA CGTCAAGGCC ATGGAGCTGT TCGCCAACGA GGGCCTGATG CTGCCGGAGC AGGTCTGGGA CGGCGTAGGC GACAACAGCC GCCACGGCTA CGTGAAGGGC GAAGGCACGG ACTCGGCCAC GCCCCTGGCC TGGAGCCATG CCGAATACGT CAAGCTGGTC CGCTCGCTGA CCGACCAGAA GGTCTGGGAT CACTACCCGG TCGTCCCGGA AAAACTCGCC AGATGA
|
Protein sequence | MQQPARFRKT LLSVLIGALG IAQAAGAANM APGAPGAAPF WSYSGKTGIG TSYEQYEDGQ YSPQAATGEV SKVWFSLARG VVTETMFGLI HEAQLREMQL VIQGPDFLDL EAQDTDSEIH YLSVDDQGRP TSLAYKIVNR DRQGRYEIEK HIFTDPDADS LVIRVIFRSQ EPGIQAYVHI DPAIGNDGSG DSAFVDGQAL YAREGETTLV VKASRPLEQP TVGFAGVSDG LTDLKDGKLE NRYDSTGDAA GNVTMLARLP LAGSETTLDL VVGFGKNRQQ ADRAADATLA RGYAEVLAHY NGEGEAIGWR DYLASLSGLP AMYAQSADNG KLLNVSAMVL KAQEDKSNAG ALIASLSNPW GETVPAESGT TGYKAVWPRD FYQCAMALLA LGDRQTPQVA FEYLKKVQVS EATSLTQGDT SAYIHEQKQE GKTEDAGPGA TGWFLQKTHV DGEIEWVGVQ LDQTAMPIML GYKLWKAGIL DDARIKHWYQ NMLKPAAEFL TRGGQVKVGW NDWKVVPPQT QQERWEEQWG YSPSTTAAVI AGLVAASEVA GHAGDGEAAQ RFLDAAKGYS DKLEASMVTR KGSLGDGHYY LRITQNDDPD DGAPLLDNNS RPGLPEHQVL DAGFLELVRY GVRPATDPNI VGSLDELDSQ ELPENLKVKY LFSYPGVEGE FPGWRRYGND GYGESEISAI NFPATEAPQM NSGLRGRVWP IFTGERGHYE LARSKARLPA GIEALSAEQL AQLRNTYVKA MELFANEGLM LPEQVWDGVG DNSRHGYVKG EGTDSATPLA WSHAEYVKLV RSLTDQKVWD HYPVVPEKLA R
|
| |