Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51250 |
Symbol | algE7 |
ID | 7763968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5208142 |
End bp | 5210712 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643807948 |
Product | Secreted bifunctional mannuronan C-5 epimerase/alginate lyase |
Protein accession | YP_002802182 |
Protein GI | 226947109 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATACA ACGTTAAGGA TTTTGGTGCC AAGGGTGATG GCAAGACGGA CGACACGGAT GCCATACAGG CGGCGATAGA TGCCGCCCAC AAGGCGGGGG GCGGGACGGT ATACCTGCCG TCCGGCGAAT ATCGGGTCAG CGGTGGCGAC GAGGCCTCCG ACGGCGCTCT GATCATCAAG AGCAACGTCT ATATCGTCGG TGCCGGCATG GGCGAGACGG TGATCAAGCT GGTCGATGGG TGGGACGAAA AGCTCACCGG CATCATCCGC TCGGCCAACG GCGAGAAAAC CCATGATTAC GGTATCAGCG ACCTGACCAT CGACGGTAAC CAGGACAACA CCGAAGGCGA GGTCGACGGC TTCTATACCG GCTATATTCC CGGCAAGAAT GGCGCGGACT ACAACGTCAC GGTCGAACGG GTGGAGATCC GCGAGGTATC CCGCTACGCC TTCGATCCCC ACGAGCAGAC CATCAACCTG ACGATCCGCG ACAGCGTCGC CCACGACAAC GGCAAGGACG GGTTCGTCGC CGACTTCCAG ATCGGCGCCG TGTTCGAGAA CAACGTCTCG TACAACAACG GCCGCCACGG CTTCAACATC GTCACCAGCA GTCACGACAT CGTCTTCACC AACAACGTCG CCTACGGCAA CGGCGCCAAC GGCCTGGTGG TCCAGCGCGG CTCGGAAGAC CGGGACTTCG TCTACAACGT GGAGATCGAG GGCGGCTCCT TCCATGACAA CGGTCAGGAA GGCGTGCTGA TCAAGATGAG CACCGATGTC ACCCTGCAGG GCGCCGAGAT CTACGGCAAC GGCTACGCGG GCGTGCGCGT GCAGGGCGTC GAGGACGTGC GGATCCTCGA CAACTACATC CACGACAACG CACAGAGCAA GGCCAACGCG GAAGTCATCG TGGAATCCTA CGACGACCGC GACGGCCCGT CCGACGACTA CTACGAAACG CAGAACGTCA CGGTCAAGGG CAATACCATC GTCGGTTCGG CCAATTCCAC CTACGGCATC CAGGAGCGCG CCGACGGCAC CGACTACACC AGCATCGGCA ACAACAGCGT CAGCGGCACC CAGCGCGGGA TCGTGCAGCT CTCGGGGACG AACTCGACGT TCTCCGGCAG GTCGGGCGAT GCCTACCAGT TCATCGACGG CAGCACCGGC AATGACCTGC TGACCGGTAC CCCGATCGCC GATCTGATCG TGGGCGGCAG CGGCAACGAC ACCCTGAGCG GCGACGCCGG CAACGACGTT CTCGAAGGCG GTGCCGGCAG CGATCGCCTG ACCGGCGGCG AGGGCGCCGA CATCTTCCGC TTCACGGCGG TCAGCGACAG CTATTACACC GCCAGCAGCA GCGTCGCCGA CCAGATCCTC GACTTCGACG CCAGCAATGA TCGCATCGAC CTCACCGGGC TCGGCTTCAC CGGCCTGGGC GACGGCTACG GCGGCACCCT GGCCGTGCTG GCCAACAGCG ACGGCAGCCG CACCTATCTG CGCAGCTACG AGAAGGACGC CGACGGCCGC TATTTCTCGC TCACCCTGGA CGGCAACTTC GTCGGTCGGC TCGACGACAG CAACCTGGTC TTCAGGCACA AGACCATCGC CGGCACCGAG GGCGACGACA GCCTGACCGG CAACGCGATG GCGGAAATCC TCGACGGCGG CAGCGGCAAC GACAGCCTCG CGGGCGGTCT GGGCAACGAC GTGCTGAGAG GCGGTGCCGG CGACGACATC CTGAACGGCG GCCTGGGGCG CGACCAGCTC AGCGGCGGCG AAGGCGCGGA CATATTCCGC TTCACCAGCG TGGCCGACAG CTACCAGAAC TCGGGCGACA ACTTCTCCGA CCTGATTCTC GATTTCGACC CGGGCGAAGA CCGCATCGAT CTCAGCGGCC TGGGCTTCAG CGGCCTGGGC GACGGCCACA ACGGTACCCT GCTGCTCTGG ACCAGCAGCG AAACCAACCG CACCTATCTC AAGAACTTCG ACACGGATGC CGACGGCCGG CGCTTCGAGA TCGCCCTGGA GGGCGTCTTC TCCGACCTGA GCGAGAAGCA ACTGGTCTTC GAACGCCTGG TACTGGAGGG CACTCGCCTC GGCGACCAGC TTTCCGGCAC CGAGCTGAAC GAGGAACTGC TCGGCGGCGC GGGGCGCGAC ATCCTGAACG GCGGCGCCGG CGACGATATT CTCGATGGCG GTTCCGAACG CGACACCCTG ACCGGCGGCA GCGGCGCGGA CGTGTTCCGC TTCAACGCCA CGCTGGACAG CTTCCGCAAC TACGACAATG GGACGAGCCG GGTCGACGAC ATCACCGACT TCACCGTCGG CGAGGATCTG ATCGACCTCT CCGCCCTCGG CTATAGCGGC TTGGGCAACG GCTACGACGG CACGCTCGCC GTGCTGCTGA ATGCCGACGG CACCAAGACC TACCTCAAGG ACCGCGAAAG CGATGCGGAC GGCAACCACT TCGAGATCGC CCTGGACGGC AACTATGCCG ATCAGCTCTC CAACGGCGAC TTCATCTTCA CCAACCTCGA AGTGATCGGC AGCAGCTCGC AGGCTGCCTG A
|
Protein sequence | MEYNVKDFGA KGDGKTDDTD AIQAAIDAAH KAGGGTVYLP SGEYRVSGGD EASDGALIIK SNVYIVGAGM GETVIKLVDG WDEKLTGIIR SANGEKTHDY GISDLTIDGN QDNTEGEVDG FYTGYIPGKN GADYNVTVER VEIREVSRYA FDPHEQTINL TIRDSVAHDN GKDGFVADFQ IGAVFENNVS YNNGRHGFNI VTSSHDIVFT NNVAYGNGAN GLVVQRGSED RDFVYNVEIE GGSFHDNGQE GVLIKMSTDV TLQGAEIYGN GYAGVRVQGV EDVRILDNYI HDNAQSKANA EVIVESYDDR DGPSDDYYET QNVTVKGNTI VGSANSTYGI QERADGTDYT SIGNNSVSGT QRGIVQLSGT NSTFSGRSGD AYQFIDGSTG NDLLTGTPIA DLIVGGSGND TLSGDAGNDV LEGGAGSDRL TGGEGADIFR FTAVSDSYYT ASSSVADQIL DFDASNDRID LTGLGFTGLG DGYGGTLAVL ANSDGSRTYL RSYEKDADGR YFSLTLDGNF VGRLDDSNLV FRHKTIAGTE GDDSLTGNAM AEILDGGSGN DSLAGGLGND VLRGGAGDDI LNGGLGRDQL SGGEGADIFR FTSVADSYQN SGDNFSDLIL DFDPGEDRID LSGLGFSGLG DGHNGTLLLW TSSETNRTYL KNFDTDADGR RFEIALEGVF SDLSEKQLVF ERLVLEGTRL GDQLSGTELN EELLGGAGRD ILNGGAGDDI LDGGSERDTL TGGSGADVFR FNATLDSFRN YDNGTSRVDD ITDFTVGEDL IDLSALGYSG LGNGYDGTLA VLLNADGTKT YLKDRESDAD GNHFEIALDG NYADQLSNGD FIFTNLEVIG SSSQAA
|
| |