Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33710 |
Symbol | algE5 |
ID | 7762266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3445248 |
End bp | 3448244 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643806232 |
Product | Secreted mannuronan C5-epimerase |
Protein accession | YP_002800496 |
Protein GI | 226945423 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | [TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA ACGTCAAAGA TTTCGGGGCG CTGGGCGATG GCGTCAGCGA CGATACGGCC GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCGGCCGGCG GCGGCACCGT CTACCTGCCG GCCGGCGAAT ACCGGGTCAG CGGTGGCGAG GAGCCTTCCG ACGGTTGCCT GACCATCAAG AGCAACGTCT ACATCGTCGG CGCGGGGATG GGCGAGACGG TGATCAAGCT GGTCGACGGC TGGGATCAGG ACGTCACCGG CATCGTCCGC TCGGCCTATG GCGAGGAGAC CAGCAACTTC GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GGTCGACGGC TGGTTCAACG GCTACATTCC CGGCGAGGAC GGCGCCGACC GCGACGTGAC CCTGGAGCGG GTGGAAATCC GTGAAATGTC CGGTTACGGT TTCGATCCGC ACGAGCAGAC CATCAACCTG ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CGATTTCCAG ATCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CTTCAACATC GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CGGCGGCGCC GGCCTGGTGA TCCAGCGCGG CTCCTACGAC GTGGCTCACC CCTACGGCAT CCTGATCGAC GGCGGCGCCT ACTACGACAA CGGCCTGGAA GGCGTGCAGA TCAAGATGGC CCACGACGTC ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTCTATG GGGTGCGCGT CTACGGCGCC GAGGACGTGC AGATCCTCGA CAACTACATC CACGACAATT CGCAGAGCGG TTCCTACGCG GAAATCCTCC TGCAGTCCTA CGACGATACC GCCGGGGTGT CCGGCAATTT CTACACCACC ACCGGCACCT GGATCGAAGG CAACACCATC GTCGGCTCGG CCAACTCCAC CTACGGCATC CAGGAGCGCG CCGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCGT CAGCAATGTG CAGAGTGGCT CGGTGCGCCT CTACGGCACC AACTCCGTCG TCTCCGACCT GCCCGGCACC GGCCAGCAGG CGACCCTCGA AGGCACGACC GGCAACGACA CGCTGACCGG CAGCGACGCC CACGAGACGC TGCTCGGCCT GGACGGCGAT GACCGCCTGA ACGGCGGCGC CGGCAACGAC ATCCTCGACG GCGGGGCGGG GCGCGACAAC CTGACCGGCG GCGCGGGCGC CGACCTGTTC CGCGTCTCCG CGCGCACCGA CAGCTACCGC ACCGACAGCG CCAGCTTCAA CGACCTGATC ACCGACTTCG ACGCCGACGA GGACAGCATC GACCTGTCGG CGTTGGGCTT CACCGGGCTG GGCGACGGCT ACAACGGCAC CCTCGCCGTG GTGCTCAACA GCGCCGGGAC CCGCACCTAC CTGAAGAGCT ATGAGGCGGA TGCCGAGGGC CGGCGTTTCG AGATCGCCCT GGACGGCAAC TTCGCCGGCC TGCTCGACGA CGGCAACCTG ATCTTCGAGC GTCCCGTCAT CGAAGGGGAC GCCGGCAACA ACGCCCTGCT CGGCACCTCG GCCGCCGAGA CGCTGCTCGG CCATGCCGGC AACGATACGC TGGACGGCGC CGGCGGCGAC GACATCCTGG TCGGCGGTGC CGGGCGCGAC ACCCTCACCG GTGGGGCCGG GGCCGACCTG TTCCGCTTCG ACGCGCTGTC CGACAGCCAG CGCAACTACA CCACCGGCGA CAACCAGGGC GACCGCATCG TCGACTTCAG CGTGGGCGAA GACAAGCTCG ACGTGTCGGC GCTGGGCTTC ACCGGGCTGG GCGACGGCTA CAACGGCACC CTCGCCGTGG TGGTCAACAG CGCCGGCGAC CGCACCTACG TGAAAAGCTA CGAGAACGGC GCCGACGGCT ACCGCTTCGA GTTTTCCCTC GACGGCAACT ATCTGGAGCT GCTCGGCAAC GAGGATTTCA TCTTCGCCAC GCCCAGCGGC CAGCAACTCC TCGAAGGCAG CGCCGGCAAC GACAGCCTGC AGGGCACGGC CGCCGACGAA ATCGTCCACG GTGGGGCAGG GCGCGACACC CTGAGCGGCG GGGCCGGGGC CGACGTGTTC CGCTTCAGCG AACTGACCGA CAGCTACCGC ACCGCGAGTA CCAGCTTCGC CGATCTGATT ACCGACTTCG ATCTGGCCGA CGACCGCATC GACCTGTCCG GGCTCGGTTT CAGCGGCCTG GGCGACGGCT ACGACGGCAC CTTGGCCGTG GTGGTCAACA GCACCGGCAC CCGCACCTAC CTGAAGAGCT ACGAGGCCAA CGCCGCCGGC GAACGCTTCG AGATCGCCCT GGACGGCGAC CTGTCCGCGT TCACCGGGGC CAACCTGATC CTCGACGAGC GCGTCGTGCT GGAAGGCAGC GACGGCAACG ACACGCTCGA CGGCGGCAGT GCGGCCGAGG AATTGCTCGG CGGGGCCGGC AACGACAGCC TGGACGGCGG CGCCGGCAAC GATATCCTCG ACGGCGGCGC CGGGCGCGAC ACTCTCACCG GCGGCAGCGG CGCCGACGTG TTCCGCTACG ACGACGCACT CGACAGCTTC CGCAACTACG GCACCGGCGT GACCGGCACC GACACCATCA CCGACTTCAC CCCCGGCGAG GATCTGATCG ACCTGTCCGC GCTCGGCTAC ACCGGGCTGG GCGACGGTTA CAACGGCACC CTTGCCGTGG TGCTCAACGG CGACGGCACC AGAACCTACC TGAAGGACCG CGAGAGCGAC GCCGAAGGCA ACCAGTTCGA GATCGCCCTG GGCGGCGATC TCGTCGACCG GCTTGATGCG GGCGACTTCA TCTTTGCCGA GGCAGCCGCG ACCACCGCGA TCGAGGTGGT CGGCGGTACG CCGACCGAGG AGCAGTTGGT TGCTTGA
|
Protein sequence | MDYNVKDFGA LGDGVSDDTA AIQAAIDAAY AAGGGTVYLP AGEYRVSGGE EPSDGCLTIK SNVYIVGAGM GETVIKLVDG WDQDVTGIVR SAYGEETSNF GMSDLTLDGN RDNTSGKVDG WFNGYIPGED GADRDVTLER VEIREMSGYG FDPHEQTINL TIRDSVAHDN GLDGFVADFQ IGGVFENNVS YNNDRHGFNI VTSTNDFVLS NNVAYGNGGA GLVIQRGSYD VAHPYGILID GGAYYDNGLE GVQIKMAHDV TLQNAEIYGN GLYGVRVYGA EDVQILDNYI HDNSQSGSYA EILLQSYDDT AGVSGNFYTT TGTWIEGNTI VGSANSTYGI QERADGTDYS SLYANSVSNV QSGSVRLYGT NSVVSDLPGT GQQATLEGTT GNDTLTGSDA HETLLGLDGD DRLNGGAGND ILDGGAGRDN LTGGAGADLF RVSARTDSYR TDSASFNDLI TDFDADEDSI DLSALGFTGL GDGYNGTLAV VLNSAGTRTY LKSYEADAEG RRFEIALDGN FAGLLDDGNL IFERPVIEGD AGNNALLGTS AAETLLGHAG NDTLDGAGGD DILVGGAGRD TLTGGAGADL FRFDALSDSQ RNYTTGDNQG DRIVDFSVGE DKLDVSALGF TGLGDGYNGT LAVVVNSAGD RTYVKSYENG ADGYRFEFSL DGNYLELLGN EDFIFATPSG QQLLEGSAGN DSLQGTAADE IVHGGAGRDT LSGGAGADVF RFSELTDSYR TASTSFADLI TDFDLADDRI DLSGLGFSGL GDGYDGTLAV VVNSTGTRTY LKSYEANAAG ERFEIALDGD LSAFTGANLI LDERVVLEGS DGNDTLDGGS AAEELLGGAG NDSLDGGAGN DILDGGAGRD TLTGGSGADV FRYDDALDSF RNYGTGVTGT DTITDFTPGE DLIDLSALGY TGLGDGYNGT LAVVLNGDGT RTYLKDRESD AEGNQFEIAL GGDLVDRLDA GDFIFAEAAA TTAIEVVGGT PTEEQLVA
|
| |