Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4452 |
Symbol | |
ID | 3680386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5573021 |
End bp | 5576197 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719806 |
Product | glycoside hydrolase family protein |
Protein accession | YP_324945 |
Protein GI | 75910649 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCTA CCGAATCTCA GTATCAGACA GATTTAATCT CAGACACAAT TGAGAAATTA CGCAGTTGTT GTCAAGTTAA TGTTCAATCT ACTTGGTTAT ACCAGGACTC GAACACAGAA ATTACTGGCG TTGCTACATC TAGTATATCC AATTGGCAAC CTGTAGAGTT AGATACCAAG GGTAACATTG CTTGGACTGG TGGACAGCAA GTACTTTGGC TAGAACAGAA ATTCGTAGTT CCCCAAAATT TACATGATTA TCCTTTGGCG GGGTTGTCTT TGCGGCTGTC TCTACTTTGG TGGGCGGACT CTGCCAAAAT CTACGTGAAT GGGCAATTAG TGCTAGAAGG AGATTTATTT GATTGTTCCC CCAGAGTTTT ACTCAGTCGG GAAGTATCAC CAGGACAAGA ATTTGTAGTG GCTTTGCGGC TGGCGAGTCC TGGTCATTGT GATGGTGCTT TAGTGCGATC GCTCCTTGTC TACGAGTCTA CAGATTATAA TTATCCTGAC CCCGGTTTTA TTGCCGATGA GTTAGCAATA TTACAGCTTT ATTTAGAAAA GTTCGCCCCA GAAAAGTTAA ATATCCTCAC ACAAGCAATC CCAGAAATTC ATCCCTCCAA CCCAGAATCT CTAGTTACCT TCCGTCAAAA CCTGATAAAT CATCTCTCTA TCAGCGACCC AAAATTTAAA ATCTATTTAT TAGGTCACGC TCATTTAGAT TTAGCATGGC TATGGTCAGT CAGTGAAACT TGGAATGCAG CCCAAAACAC TTTTACATCA GTCCTCAAAC TACAACAAGA TTTTCCCGAA TTAATCTTCT GTCATTCCAC CCCAGCCCTG TATGCTTGGA TTGAAGAACA TCGTCCAGAT TTATTTACAG CGATTCAACA AGCTGTAGCT GCCAAAAAAT GGGAAGTTAT CGGCGGTTTT TGGGTAGAAC CTGACCTTAA TTTAATCGCT GGGGAATCCA TAGTCCGTCA GTTACTATAC GGTCAACGCT ATTTCCAAGA AAAATTTGGC AAACTGACGA CTGTAGTTTG GGTTCCCGAC ACCTTTGGTT TTTGTGCAAC CCTACCGCAA TTTTTGGCGA ATGCAGGTAT TGAGTATTTC GTGACGCAAA AATTACGATG GAACGATACT ACTAAATTTG ATTATGGGGC TTTTTGGTGG CGATCGCCTG ACGGTAGTCA AGTATTAAGT GTTATGTCTG CAACCATAGG CGAAGGCATT GACCCCATCA AAATGGCAGC CTATTCTCTA GAATGGCAAA CCCAAACCTG CTTAACTCAA TCTTTATGGC TTCCTGGTGT CGGTGACCAC GGCGGCGGCC CCACCCGTGA TATGTTAGAA ACCGCCCAAC GCTGGCAAAC TTCACCATTT TTCCCAGACC TAGAATTTAT CACCGCCGAA AAGTATCTCC AGCAAATTCA GTCAACGGTC AATGGTCAAC AGTCAATAGT CAATAGTCAA CGGTCAACAT TTCCTATATG GAATGATGAA CTATATCTAG AATTTCATCG TGGTTGTTAC ACTACCCACG CAGACCAAAA ACGTTGGAAT CGGCATTCTG AAAATTTACT ATACGAAGCC GAACTATTTG CTACCTTGGC AACTTTTATC TGTGGCGTGA CATATCCCAA ATCCGACATC GAAACAGCTT GGAAGCAAGT ATTATTTAAC CAATTCCACG ATATTTTACC TGGTTCTTCC ATTACCCAAG TATATACAGA TGCCTTGCCC GAATGGCAGC AAGTCGAACA AACGGGAACC AAAATATTAA AAGAATCATT ACAGGCGATC GCATCTCACT TTACTCTACC AGAGCCACCA AAAACCGATA GTCTACCCAT TTTCGTTTTC AATTCTCTCA ACTGGCAGCG CTCTGAGGTA GTATCAGTCA CCCTACCCCC ACCACCACCT AACCAACAAT GGCAAGTCTA CGATACTACT GGCAAACAAA TCATTTCCCA ATTAACTGAA CCATCAACCA TACTATTCCT CGCCGAAGAT ATTCCCTCTG TAGGCTATCG CCTCTTTTGG CTTTCCCCCA CATCGCCCAC ATCTTCCACA TCGCCCACAT CTTCCACATC CCTAGACTAT ATTCTCGAAA ATGAACACCT GCGCGTTATT GTAGATCCTG ATACTGGAGA TTTATCAAGT ATCTATGACA AAACTCATCA ACGAGAAGTA TTGTCTGGTG CGGGTAATCA ACTACAAGCT TTTAAAGACA GTGGTCAATA TTGGGATGCT TGGAATATCG ACCCCAATTA TAGTCAGCAT CCCCTACCAG CAACTAACCT CAAATCTATC CAGTGGTTAG AACAAGGAAC TGTACAGAAT TGTCTCCGCG TAGTGCGTCA ATTGGGTAAG TCGGAATTTT GCCAAGACTA TATCTTGCAA GTCGGCTTAC CCCAATTGAA AATCGTTTCT AGAGTCAATT GGCAAGAAAA GCACGTTTTA GTCAAAGCAG CGTTTCCTCT CAACGTTACA GCCGACTTTG CCACCTACGA AATTCCCTGC GGTGCAATTC GTCGCCCGAC TCAACCCCAA ACCCCGCAGG ATAAAGCAAA ATGGGAAGTC CCAGCTTTAC GTTGGGCTGA TTTAACAGCA GAGACAGATG AGGGTCTTTA CGGTGTTAGT TTACTGAATG ATTGTAAATA TGGTTACGAC AGTCAACCCC AGCTATTAAG GCTAACCTTA CTCCGTAGCC CTACTTGGCC TGACCCAGAA GCTGACACAG GCGGCATACA CGAATTTGCT TATACTGTGT ATCCTCACGC TGATAGCTGG GAATCAGCCC ATACAGTACA AAAGGGATAT GAATTAAACA TTCCCCTGCA AGTAATATTA AACCCAACTC AACACTTCCA ACTCAACACT TCCAAATCAA CACCAAACAC CAGAGACAAA GCAAGTTTTT TAAATTTACC AGCCGAGAAT CTGGTCTTGA TGGCTGTCAA ACCATCGGAA GACGACCAGC AGCAATTAAT TCTGCGCTTT TATGAATCTC ATGGTGTGAC TACAGAATTA TCTTTGCAGA GCGATTTAAA GTTAACCTTG GGTATTCCAG TAGATTTACT GGAACGCCCC ATTAGCCAAT TCTCATCTGG GCAACAAATC TCCACAATTG AACCTTGGAA AATTGCGACT TTTAAAGTTT TAGAGGTCAG AGGCTAG
|
Protein sequence | MTPTESQYQT DLISDTIEKL RSCCQVNVQS TWLYQDSNTE ITGVATSSIS NWQPVELDTK GNIAWTGGQQ VLWLEQKFVV PQNLHDYPLA GLSLRLSLLW WADSAKIYVN GQLVLEGDLF DCSPRVLLSR EVSPGQEFVV ALRLASPGHC DGALVRSLLV YESTDYNYPD PGFIADELAI LQLYLEKFAP EKLNILTQAI PEIHPSNPES LVTFRQNLIN HLSISDPKFK IYLLGHAHLD LAWLWSVSET WNAAQNTFTS VLKLQQDFPE LIFCHSTPAL YAWIEEHRPD LFTAIQQAVA AKKWEVIGGF WVEPDLNLIA GESIVRQLLY GQRYFQEKFG KLTTVVWVPD TFGFCATLPQ FLANAGIEYF VTQKLRWNDT TKFDYGAFWW RSPDGSQVLS VMSATIGEGI DPIKMAAYSL EWQTQTCLTQ SLWLPGVGDH GGGPTRDMLE TAQRWQTSPF FPDLEFITAE KYLQQIQSTV NGQQSIVNSQ RSTFPIWNDE LYLEFHRGCY TTHADQKRWN RHSENLLYEA ELFATLATFI CGVTYPKSDI ETAWKQVLFN QFHDILPGSS ITQVYTDALP EWQQVEQTGT KILKESLQAI ASHFTLPEPP KTDSLPIFVF NSLNWQRSEV VSVTLPPPPP NQQWQVYDTT GKQIISQLTE PSTILFLAED IPSVGYRLFW LSPTSPTSST SPTSSTSLDY ILENEHLRVI VDPDTGDLSS IYDKTHQREV LSGAGNQLQA FKDSGQYWDA WNIDPNYSQH PLPATNLKSI QWLEQGTVQN CLRVVRQLGK SEFCQDYILQ VGLPQLKIVS RVNWQEKHVL VKAAFPLNVT ADFATYEIPC GAIRRPTQPQ TPQDKAKWEV PALRWADLTA ETDEGLYGVS LLNDCKYGYD SQPQLLRLTL LRSPTWPDPE ADTGGIHEFA YTVYPHADSW ESAHTVQKGY ELNIPLQVIL NPTQHFQLNT SKSTPNTRDK ASFLNLPAEN LVLMAVKPSE DDQQQLILRF YESHGVTTEL SLQSDLKLTL GIPVDLLERP ISQFSSGQQI STIEPWKIAT FKVLEVRG
|
| |