Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51190 |
Symbol | algE1 |
ID | 7763964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5194329 |
End bp | 5198540 |
Gene Length | 4212 bp |
Protein Length | 1403 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807944 |
Product | Secreted mannuronan C-5 epimerase |
Protein accession | YP_002802178 |
Protein GI | 226947105 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.156968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA ACGTCAAGGA TTTCGGAGCA CTGGGCGATG GCGTCAGCGA CGACACGGCG GCCATCCAGG CGGCGATCGA CGCCGCCCAC GCGGCGGGCG GCGGCACCGT CTACCTGCCG GCCGGCGAAT ATCGGGTCAG CGGCGGCGAG GAGCCTTCCG ATGGTTGTCT GACCATCAAG AGCAACGTCC ATATCGTCGG CGCCGGGATG GGCGAGACGG TCATCAAGCT GGTCGATGGC TGGGAGCAGG ACGTCACCGG CATGGTGCGC TCGGCTTATG GCGAGGAGAC CAGCAACTTC GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACG TGTCTGCCAA GGTCGACGGC TGGTTCAACG GCTACATCCC CGGCCAGGAC GGCGCCGACC GCGACGTGAC CCTGGAGCGG GTGGAAATCC GCGAAATGTC CGGTTACGGT TTCGACCCGC ACGAGCAGAC CATCAACCTG ACGATCCGCG ACAGCGTGGC TCACGACAAC GGCCTCGACG GCTTCGTCGC CGACTACCAG GTCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CTTCAACATC GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CGGCGGCGCC GGCCTGGTGG TACAGCGCGG GTCATCCGAC CTGCCCCATC CCTACGACAT CCTGATCGAC GGCGGCGCCT ACTACGACAA CGGCCTGGAA GGCGTGCAGC TCAAGATGGC CCACGACGTC ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTGTACG GGGTGCGCGT CTACGGCGCC CAGGACGTGC AAATCCTCGA CAACCAGATC CACGACAATT CGCAGAACGG CGCCTATGCC GAAGTCCTGC TGCAGTCCTA CGACGACACC GCCGGGGTGT CCGGCAACTT CTACGCCACC ACCGGCACCT GGATCGAGGG CAACACCATC GTCGGCTCGG CCAATTCCAC CTACGGCATC CAGGAGCGCG CCGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCAT CGACGGTGTG CAGACCGGGG CGGTACGGCT GTATGGCGCC AACTCGACGG TTTCCAGCCA GTCCGGCACC GGCCAGCAGG CGACCCTCGA AGGCAGCGCG GGCAACGATG CGCTGAGCGG GACCGAGGCC CACGAGACGC TGCTCGGCCA GGCCGGCGAC GACCGCCTGA ACGGCGATGC CGGCAACGAC ATCCTCGACG GCGGGGCAGG GCGCGACAAC CTGACCGGCG GCGCGGGCGC CGACACCTTC CGCTTCTCCG CGCGCACCGA CAGCTACCGC ACCGACAGCG CCAGCTTCAA CGACCTGATC ACCGACTTCG ACGCCGACGA GGACAGCATC GACCTGTCCG CGCTGGGCTT CACCGGCCTG GGCGACGGCT ACAATGGCAC CCTGCTGCTG AAGACCAACG CCGAGGGTAC GCGCACCTAC CTGAAGAGCT ACGAAGCGGA CGCCCAGGGC CGGCGCTTCG AGATCGCCCT GGACGGCAAC TTCACCGGTC TGTTCAACGA CAACAACCTG TTGTTCGACG CCGCTCCGGC CACCGGTACC GAGGGCAGCG ACAACCTGCT CGGCACCGAT GCCAACGAGA CCCTCCTGGG CTACGGCGGC AACGACACCC TCAACGGCGG GGCCGGCGAC GACATCCTGG TCGGCGGCGC CGGGCGCGAC ACCCTCACCG GTGGCGCCGG GGCGGACGTG TTCCGCTTCG ACGCGCTGTC CGACAGCCAG CGCAACTACA CCACCGGCGA CAACCAGGCC GACCGCATTC TCGACTTCGA CCCGACCCTG GACAGGATCG ACGTATCGGC GCTGGGCTTC ACCGGGCTGG GCAACGGCCG CAACGGCACC CTCGCCGTGG TGCTCAACAG CGCCGGCGAC CGCACCGATC TGAAGAGTTA CGACACCGAC GCCAACGGCT ACAGCTTCGA GCTTTCCCTC GCGGGCAACT ACCAGGGGCA GCTCAGCGCC GAGCAGTTCG TTTTCGCGAC GTCTCAGGGG GGACAGATGA CGATTATCGA AGGCACCGAC GGCAACGATA CCTTGCAGGG CACCGAGGCC AACGAGCGGC TCCTCGGCCT GGACGGCCGG GACAACCTGA ACGGCGGCGC CGGCAACGAC ATCCTCGACG GCGGAGCGGG GCGCGACACC CTGACCGGCG GCACGGGCGC CGACACCTTT CTGTTCTCCA CGCGTACCGA CAGCTACCGC ACCGACAGCG CCAGCTTCAA CGACCTGATC ACCGACTTCG ATCCCACCCA GGACCGCATC GACCTGTCCG GCCTGGGCTT CAGCGGTTTC GGCAACGGCT ACGACGGCAC CCTGCTGCTG CAGGTCAACG CCGCGGGCAC CCGCACCTAC CTGAAGAGTT TCGAGGCCGA TGCCAACGGC CAGCGCTTCG AGATCGCCCT GGACGGCGAC TTCAGCGGCC AACTGGACAG CGGCAACGTG ATCTTCGAGC CCGCCGTGTT CAATGCCAAG GACTTCGGCG CGCTGGGCGA CGGCGCCAGC GACGACCGGC CGGCCATCCA GGCGGCGATC GACGCCGCCT ACGCGGCCGG TGGCGGCACC GTCTACCTGC CGGCCGGCGA GTACCGGGTC AGCCCCACCG GGGAGCCGGG CGACGGCTGC CTGATGCTCA AGGACGGCGT CTACCTGGCC GGCGACGGCA TAGGCGAAAC GGTCATCAAG CTGATCGACG GCTCCGACCA GAAGATCACC GGCATGGTCC GCTCGGCCTA TGGCGAAGAG ACCAGCAACT TCGGCATGCG CGACCTGACC CTCGACGGCA ACCGCGACAA CACCAGCGGC AAGGTCGACG GCTGGTTCAA CGGCTACATC CCCGGCCAGG ACGGCGCCGA CCGCAACGTG ACCATCGAGC GGGTGGAAAT CCGCGAGATG TCCGGCTACG GCTTCGATCC GCACGAGCAG ACCATCAACC TGACGATCCG CGACAGCGTG GCCCACGACA ACGGCCTCGA CGGCTTCGTC GCCGACTATC TGGTCGACAG CGTGTTCGAG AACAACGTCG CCTACAACAA CGACCGCCAC GGCTTCAACG TGGTCACCAG CACCTACGAT TTCGTCATGA CCAACAACGT CGCCTACGGC AACGGCGGCG CCGGCCTGAC GATCCAGCGG GGCTCGGAGG ACCTGGCCCA GCCGACCGAT ATCCTGATCG ACGGCGGCGC CTACTACGAC AACGCCCTGG AAGGCGTGCT GTTCAAGATG ACCAACAACG TCACCCTGCA GAACGCCGAG ATCTACGGCA ACGGCTCCTC CGGCGTGCGC CTGTACGGCA CGGAGGACGT GCAGATCCTC GACAACCAGA TTCACGACAA TTCGCAGAAC GGCACCTATC CGGAAGTCCT GCTGCAGGCC TTCGACGACA GCCAGGTCAC CGGTGAGCTG TACGAGACCC TGAACACCCG GATCGAAGGC AATCTCATCG ACGCTTCGGA CAACGCCAAC TATGCGGTGC GCGAGCGCGA CGACGGCAGC GACTACACCA CGCTCGTGGA CAACGACATC AGCGGCGGCC AGGTCGCCTC GGTGCAGCTT TCCGGCGCCC ATTCGAGTCT TTCCGGCGGC ACCGTCGAAG TGCCGCAGGG AACCGACGGC AACGACGTGC TGGTCGGCAG CGATGCCAAC GACCAGCTCT ACGGCGGAGC CGGCGACGAC CGCCTGGACG GCGGCGCCGG TGACGACCAG CTCGACGGCG GAGCGGGGCG CGACGACCTG ACCGGCGGCA CGGGTGCCGA CACCTTCGTG TTCGCCGCGC GTACCGATAG CTACCGCACC GACGCGGGGG TGTTCAACGA CCTGATCCTC GACTTCGACG CCAGCGAGGA CCGCATCGAC CTGTCCGCCC TGGGCTTCAG CGGCTTCGGC GACGGCTACA ACGGCACCCT GCTGGTGCAG CTCAGCAGCG CCGGAACCCG TACCTACCTC AAGAGCTACG AGGAGGACCT CGAGGGCCGG CGCTTCGAGG TCGCCCTGGA CGGCGACCAC ACGGGCGATC TTTCCGCCGC CAATGTGGTT TTCGCCGACG ACGGCTCGGC CGCCGTGGCG AGCAGCGATC CCACCGCCAC ACAGTTGGAG GTGGTCGGCA GCAGCGGCAC CCAGACCGAT CAACTCGCCT GA
|
Protein sequence | MDYNVKDFGA LGDGVSDDTA AIQAAIDAAH AAGGGTVYLP AGEYRVSGGE EPSDGCLTIK SNVHIVGAGM GETVIKLVDG WEQDVTGMVR SAYGEETSNF GMSDLTLDGN RDNVSAKVDG WFNGYIPGQD GADRDVTLER VEIREMSGYG FDPHEQTINL TIRDSVAHDN GLDGFVADYQ VGGVFENNVS YNNDRHGFNI VTSTNDFVLS NNVAYGNGGA GLVVQRGSSD LPHPYDILID GGAYYDNGLE GVQLKMAHDV TLQNAEIYGN GLYGVRVYGA QDVQILDNQI HDNSQNGAYA EVLLQSYDDT AGVSGNFYAT TGTWIEGNTI VGSANSTYGI QERADGTDYS SLYANSIDGV QTGAVRLYGA NSTVSSQSGT GQQATLEGSA GNDALSGTEA HETLLGQAGD DRLNGDAGND ILDGGAGRDN LTGGAGADTF RFSARTDSYR TDSASFNDLI TDFDADEDSI DLSALGFTGL GDGYNGTLLL KTNAEGTRTY LKSYEADAQG RRFEIALDGN FTGLFNDNNL LFDAAPATGT EGSDNLLGTD ANETLLGYGG NDTLNGGAGD DILVGGAGRD TLTGGAGADV FRFDALSDSQ RNYTTGDNQA DRILDFDPTL DRIDVSALGF TGLGNGRNGT LAVVLNSAGD RTDLKSYDTD ANGYSFELSL AGNYQGQLSA EQFVFATSQG GQMTIIEGTD GNDTLQGTEA NERLLGLDGR DNLNGGAGND ILDGGAGRDT LTGGTGADTF LFSTRTDSYR TDSASFNDLI TDFDPTQDRI DLSGLGFSGF GNGYDGTLLL QVNAAGTRTY LKSFEADANG QRFEIALDGD FSGQLDSGNV IFEPAVFNAK DFGALGDGAS DDRPAIQAAI DAAYAAGGGT VYLPAGEYRV SPTGEPGDGC LMLKDGVYLA GDGIGETVIK LIDGSDQKIT GMVRSAYGEE TSNFGMRDLT LDGNRDNTSG KVDGWFNGYI PGQDGADRNV TIERVEIREM SGYGFDPHEQ TINLTIRDSV AHDNGLDGFV ADYLVDSVFE NNVAYNNDRH GFNVVTSTYD FVMTNNVAYG NGGAGLTIQR GSEDLAQPTD ILIDGGAYYD NALEGVLFKM TNNVTLQNAE IYGNGSSGVR LYGTEDVQIL DNQIHDNSQN GTYPEVLLQA FDDSQVTGEL YETLNTRIEG NLIDASDNAN YAVRERDDGS DYTTLVDNDI SGGQVASVQL SGAHSSLSGG TVEVPQGTDG NDVLVGSDAN DQLYGGAGDD RLDGGAGDDQ LDGGAGRDDL TGGTGADTFV FAARTDSYRT DAGVFNDLIL DFDASEDRID LSALGFSGFG DGYNGTLLVQ LSSAGTRTYL KSYEEDLEGR RFEVALDGDH TGDLSAANVV FADDGSAAVA SSDPTATQLE VVGSSGTQTD QLA
|
| |