Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4198 |
Symbol | |
ID | 3681001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5257970 |
End bp | 5260552 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719545 |
Product | glucan 1,4-alpha-glucosidase |
Protein accession | YP_324692 |
Protein GI | 75910396 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01535] glucan 1,4-alpha-glucosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.116621 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCT TCACTCTCAA AATTGCATCA CTTACTTTCT GGAAGTCTAT TCATCTCCGT CTCAATTTTA CTCACTCAAA AAAACAATTC ATTCATATGA AACGTTGGTT ATTAGCTAAC TTAATTGCTC TTTGCTCGTT TTTGTTCTTA GGAACAAACG CTGCTTGGGC AGTCGGTGGT GGCATTGCGC CGGGCGCACC GGGTGTTTCC TCTGTTTGGT CTTATGCTGG TAAGCAAGGC ATTGGAACAT CCTATGAACA GTATGTGGAT AATAAATATA GCGATCGCGC TCCTACAAAT GCCATCTCCA AGGTTTGGTT TTCAATTGCT CAAGGTATTG TGACAGAAAC CGCCTATGGA GAAATCGATC GCGCCCAGAT CAATGACTTA CAATTCTTAG TCACAGGCAA CGGTTTTTTT GATGAAGAGA AGGTTTCTAC CACCAGTAAG GTGGACTATC TAGATAAAGA TAAGGATGGA AGGCCTCTTT CTCCTGCTTA TCGTGTGATC AATAAAGATA AAGATGGTAA ATACACCATT GAAAAGCACA TCTTTACCGA TCCTGATCAC CAAACTTTGT TTACAAAAGT GATCTTTACT GCCAAAGAGG ATAACGTTAC CCCCTACATT CTGATTAATC CTCACATGAA TAATACTGGC AAAGAAGATG TAGCTTTTGT TAAAAGCGAT AGTCTCAATG CTAGGGAAGG AGAAATTGTT TATCTAAGTT TGAAGAGTTC ATTGCCTTTT GTGAAAACCT CAGCCGGATA CGTAGCTCGT AGTGATGGCT ATCAAGATTT AAAAGACAAT GGGGTGATGG ACTGGACTTA TGACTACACT GATCAGAGCA AGCCGGGGAG TGTAGCTATG ATAGGTCAAC TGCCTACCCT CAATAAAGGA CAAACTTCCA CCTTTAACAT TGCTGTAGGC TTTGGCAGCA CCTATCAAGA AGCAACCGAA CAAGTCGATG CTTCTTTAAA AGAAGGGTAC GAAAGTCTGT TAGCAAAATA CAACGGTAAA GGCAGCGCCG TAGGCTGGGA AGATTATCTA GCCAGTCTTA AAAATCTACC TGCCATGATT GCCAACACCG GTGACAAGGG TAAACAGCTT TATGCCAGTG CTTTCACCCT CAAGAGTATG GAAGATAAAG AAAATCCTGG GGCTTTAATT GCTTCTTTAT CCGTTCCTTG GGGAGATACT GTGAACGCAG ACACATTTGC CACAGGCTAT CGAGCAGTTT GGCCGAGAGA CTTTTATCAA GCAGCTATGG CTTTGTTAGC TTTAGGAGAT AAAGAAACTC CCCTAACGGC TTTTAAATAC CTTCCACAAG TGCAAGTGCA ATCAGATACA CCAGGTAACG CTGGGGCTAC AGGCTGGTTT TTACAAAAAA CTCACGTTGA TGGAACCTTA GAATGGCTGG GAGTGCAGTT AGATCAAACT GCCATGCCGA TTATGTTGGG CTGGAAACTT TGGAAAGCCG GAGTGTTGTC GGATGGTGAA ATCACTGAGC AATATCGCAC GATGTTGAAA CCTGCCGCAG AATTTCTGGC TAATGGCGGT AATGTCAATA TACATACCTC TGATCAAGCA AACAACCGCA AGATAAACCC ACCAAGCACT CATCAAGAAC GATGGGAAGA ACAATCAGGA TATTCCCCCT CTACAACGGC TGCGATTATT ACTGGGTTAG TTGCGGCGGG TGATATTGCT GAAAATGCAG CTGATGATCC TCTTGCAGCA AAATATTATT TCAGTAAAGC AGATGAATAT CAAAAGAATG TGGATAAATT TATGTTTACC ACAACTGGAG ATATTAAGAA CTGTAATAGC TCCAGTGAAT ATCTCCTGAG AGTTACCCCT AATGCAGATC CTAACGATGG AAGTCGCATC CACGACAACA ATAGTTTGCT GGAAGCGGAT GAACGCCAGA TTTTAGATGG TGGTTTCTTG GAATTGGTAC GTTATGGAGT CAGAAAGGGA GATGATGCTC ACATTGCTGC CAGTGTTTGC GCCTTGGATA ATATCAGCCT CTCAGAAGAT TTAAGGGTGA GGTACGACAT ACCTTTTGAT GGGAAAAAGT ATCCAGGATT CCGACGCTAT GGCAATGATG GCTATGGTGA ACAAATCAAC GATGGTAGTA ACTTTCGGGA TATTGACGAT GGCCAGAAAA AACTCCGCAG AGGTAGGATT TGGCCGTTCT TCACTGGTGA GCGTGGTCAT TATGAACTTG AGTTAGCTAA GGCCAAAAAT GGAGGCACTA TTAGCGATCA AGATGTTGCT AAACTACGTG ATACTTATGT ACGAGCAATG GAATATTTTG CTAACGAAAG TCTCATGCTT CCTGAACAAG TATGGGATGG TGTTGGCGAG AATAAGGCTC ATAATTACAT CACTGGAGAA GGAACGAATA GCGCCACACC TTTAGCTTGG GCGCACGCCG AATATATTAA GTTAGTCAAA TCATTGACTG ATAAGAATGT TTGGGATTCC TACCCCATTG TTCAGGCAAG ATACCAATCT TCGGAATCTC TTACTGCCTC AAGTTCTCAT CCGAACGGCA CTAAGATCAG CGCAAATCCT TAA
|
Protein sequence | MMIFTLKIAS LTFWKSIHLR LNFTHSKKQF IHMKRWLLAN LIALCSFLFL GTNAAWAVGG GIAPGAPGVS SVWSYAGKQG IGTSYEQYVD NKYSDRAPTN AISKVWFSIA QGIVTETAYG EIDRAQINDL QFLVTGNGFF DEEKVSTTSK VDYLDKDKDG RPLSPAYRVI NKDKDGKYTI EKHIFTDPDH QTLFTKVIFT AKEDNVTPYI LINPHMNNTG KEDVAFVKSD SLNAREGEIV YLSLKSSLPF VKTSAGYVAR SDGYQDLKDN GVMDWTYDYT DQSKPGSVAM IGQLPTLNKG QTSTFNIAVG FGSTYQEATE QVDASLKEGY ESLLAKYNGK GSAVGWEDYL ASLKNLPAMI ANTGDKGKQL YASAFTLKSM EDKENPGALI ASLSVPWGDT VNADTFATGY RAVWPRDFYQ AAMALLALGD KETPLTAFKY LPQVQVQSDT PGNAGATGWF LQKTHVDGTL EWLGVQLDQT AMPIMLGWKL WKAGVLSDGE ITEQYRTMLK PAAEFLANGG NVNIHTSDQA NNRKINPPST HQERWEEQSG YSPSTTAAII TGLVAAGDIA ENAADDPLAA KYYFSKADEY QKNVDKFMFT TTGDIKNCNS SSEYLLRVTP NADPNDGSRI HDNNSLLEAD ERQILDGGFL ELVRYGVRKG DDAHIAASVC ALDNISLSED LRVRYDIPFD GKKYPGFRRY GNDGYGEQIN DGSNFRDIDD GQKKLRRGRI WPFFTGERGH YELELAKAKN GGTISDQDVA KLRDTYVRAM EYFANESLML PEQVWDGVGE NKAHNYITGE GTNSATPLAW AHAEYIKLVK SLTDKNVWDS YPIVQARYQS SESLTASSSH PNGTKISANP
|
| |