Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51200 |
Symbol | algE4 |
ID | 7763965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5198816 |
End bp | 5200477 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807945 |
Product | Secreted mannuronan C-5 epimerase |
Protein accession | YP_002802179 |
Protein GI | 226947106 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.227069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA ACGTCAAGGA TTTCGGTGCA TTGGGCGACG GCGTCAGCGA CGACCGGGCC GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCCGCCGGTG GCGGTACCGT CTACCTGCCG GCCGGCGAGT ACCGGGTCAG CGCCGCCGGG GAGCCGGGCG ACGGCTGCCT GATGCTCAAG GACGGCGTCT ACCTGGCCGG TGCCGGCATG GGCGAGACGG TGATCAAGCT GATCGACGGC TCCGACCAGA AGATCACCGG CATGGTCCGC TCGGCCTACG GCGAGGAAAC CAGCAACTTC GGCATGCGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GGTCGACGGC TGGTTCAACG GCTATATCCC CGGCGGGGAC GGCGCCGACC GCGACGTGAC CATCGAGCGG GTGGAGGTCC GCGAGATGTC CGGCTACGGC TTCGACCCCC ACGAGCAGAC CATCAACCTG ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CGACTACCTG GTCGACAGCG TGTTCGAGAA CAACGTCGCC TACGCCAACG ACCGCCACGG CTTCAACGTG GTCACCAGCA CCCACGATTT CGTCATGACC AACAACGTCG CCTACGGCAA CGGCAGCAGC GGCCTGGTGG TGCAGCGGGG TCTGGAGGAC CTCGCGCTGC CCAGCAACAT CCTGATCGAC GGCGGCGCCT ACTACGACAA CGCCCGCGAA GGCGTGCTGC TCAAGATGAC CAGCGACATC ACCCTGCAGA ACGCCGATAT CCACGGCAAC GGCTCCTCCG GGGTGCGCGT CTACGGCGCC GAGGACGTGC AGATCCTCGA TAACCAGATC CACGACAACG CGCAGGCGGC CGCCGTGCCC GAGGTCCTGC TGCAGTCCTT CGACGATACC GCCGGGGCGT CCGGCACCTA CTACACGACC CTGAACACCC GGATCGAGGG CAACACCATC AGCGGCTCGG CCAACTCCAC CTACGGCATC CAGGAGCGCA ACGACGGCAC CGACTACAGC AGCCTGATCG ACAACGACAT CGCCGGGGTG CAACAGCCCA TCCAACTGTA CGGACCTCAC TCGACGGTAT CCGGCGAACC CGGCGCGACA CCGCAACAGC CGTCCGCGGG AAGCGACGGC GAGCCACTGG TCGGCGGCGA CGCGGACGAC CAGCTCCAGG GCGGCTCCGG CGCCGATCGC CTGGACGGCG GGGCCGGCGA CGACATCCTC GACGGCGGCG CCGGGCGCGA CCGGCTGAGC GGCGGCGCGG GCGCCGACAC CTTCGTGTTT TCCGCCCGCG AGGACAGCTA CCGTACCGAC ACGGCGGTGT TCAACGACCT GATCCTCGAC TTCGAGGCCA GTGAGGATCG CATCGACCTG TCCGCGCTGG GCTTTTCCGG TCTGGGCGAC GGCTATGGCG GCACCCTGCT CCTGAAGACC AACGCCGAGG GCACGCGCAC CTACCTGAAA AGCTTCGAGG CGGATGCCGA GGGACGGCGC TTCGAGGTCG CCCTGGACGG CGACCACACG GGCGATCTTT CCGCCGCCAA TGTGGTCTTC GCCGCGACCG GGACGACCAC CGAACTCGAA GTGCTCGGCG ACAGCGGCAC GCAGGCCGGG ACGATCGTCT AG
|
Protein sequence | MDYNVKDFGA LGDGVSDDRA AIQAAIDAAY AAGGGTVYLP AGEYRVSAAG EPGDGCLMLK DGVYLAGAGM GETVIKLIDG SDQKITGMVR SAYGEETSNF GMRDLTLDGN RDNTSGKVDG WFNGYIPGGD GADRDVTIER VEVREMSGYG FDPHEQTINL TIRDSVAHDN GLDGFVADYL VDSVFENNVA YANDRHGFNV VTSTHDFVMT NNVAYGNGSS GLVVQRGLED LALPSNILID GGAYYDNARE GVLLKMTSDI TLQNADIHGN GSSGVRVYGA EDVQILDNQI HDNAQAAAVP EVLLQSFDDT AGASGTYYTT LNTRIEGNTI SGSANSTYGI QERNDGTDYS SLIDNDIAGV QQPIQLYGPH STVSGEPGAT PQQPSAGSDG EPLVGGDADD QLQGGSGADR LDGGAGDDIL DGGAGRDRLS GGAGADTFVF SAREDSYRTD TAVFNDLILD FEASEDRIDL SALGFSGLGD GYGGTLLLKT NAEGTRTYLK SFEADAEGRR FEVALDGDHT GDLSAANVVF AATGTTTELE VLGDSGTQAG TIV
|
| |