Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49400 |
Symbol | algY |
ID | 7763794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5002627 |
End bp | 5004225 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643807775 |
Product | Secreted mannuronan C5-epimerase-like protein |
Protein accession | YP_002802010 |
Protein GI | 226946937 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTCA ACGTCAAGGA TTTCGGGGCA CTGGGCGATG GCGTCAGCGA CGATCGGGCC GCCATCCAGG CGGCGATCGA TGCCGCCCAC GCGGCCGGCG GCGGTACCGT CCACCTGCCG GCCGGCGAGT ATCGGGTCAG CGGCGGCGAA CGGGGAGTGG ATGGCGCCCT GATGATGAAG AGCAATGTCT ACCTGGCGGG CGCCGGCATG GGCGAAACCG TCGTCAAGCT GCTCGATGGC TGGAACGGGC ATGTCAACGG CATGATCCGC TCGTCCGGAA CCGAGGAGAC GCACGACTTC GGCGTCCGCG ACCTGACCCT CGACGGCAAC CGCGACAACA ATCCCGAAGG CACGGTGTTC GGCTTCTATA CCGGCTACAA GTTCGGCGAC GGCGCCGATC GCAATGTCAT CGTCGAGCGG GTGGAGGCCC GCGAGATGTC CGGCTACGGC TTCGACCCGC ACGCGCGTAC CGTCAACCTG GTGATCCGCG ACAGCGTGGC CCACGACAAC GGCTTCGTCG GTTTCGTCGC CGACCACCAG ATCGACGGCG CGTTCGAGAA CAACGTCGCC TACAACAACG ACCTCCACGG TTTCAACGTG GTCACCAGCA GCCACGACTT CACCCTGAGC GACAACGTCG CCTACGGTAA CGGCGCCGCC GGCCTGGTGG TGCAACGCGG CTCGTACGAC GTGCCCCACG CCTACAATAT CCGGATCGAC GGCGGCTCCT ACCACGACAA CGCTCTGGAA GGCGTGCTGA TCAAGCTGAG CCACGACGTC ACCCTGCAGA ACGCCCATAT CTACGACAAC GGTACGGCCG GCGTACGCAT CGCCGGCGCA CAGGACGTGC AACTCCTGGA CAATCGGATC CACGATAACG TGCAGAACGG CACTTACCCG GAAGTCCTCC TGCAGGCCTT CGATGATTCC GGCATCACGG GGAATGTCTA CGAAACGCTG AACACCCTGA TCGAGGGCAA CCTCATCACC ACCTCGGGCG ACGCCACCTA CATCGTGCAG GAGCGCAACG ACGGCAGCGA CTACACCACG CTTCGCGACA ACGGCATCAG CGGCGGGCGG ATCGCCTCGG TGCAGCTCTC CGGCGCCCAC TCGTCCAGCG GGCCCCTGCG CGGCACGGAC GGCAACGACA CGCTGATCGG CGGCGCGGCC AACGAGCAGC TTCTCGGCGG CGCCGGCGCC GATCTGCTCG ACGGCGGTGC CGGGCGCGAC CGGCTGACCG GCGGCGAGGA GGCCGACACC TTCCGCTTCT CCGCACGCGA GGACAGCTAC CGCACCGCCA GCGAAAACTT CGCCGATCGG ATTCTCGACT TCGAGGCTGG TACGGATCGC ATCGACCTTT CGGCGCTCGG CTTCAGCGGG CTGGGCAACG GTCGCGACGG CACCCTGGCC GTACAGGTGA ACAGTGCCGG CACACGGACC TACCTGAAGA GTTTCGAGGC AAATGCCGCG GGCGAGCGCT TCGAGATCGC CCTGGAGGGC AACCACGCGG GGCTGGACGA ATCCAGCTTG GTCTTCGACG ACAGCGCCAC GGAGCTTGCG CTGGTAGGCA GTGCTCCGCA GACCGACCCG AGCGTCTGA
|
Protein sequence | MDFNVKDFGA LGDGVSDDRA AIQAAIDAAH AAGGGTVHLP AGEYRVSGGE RGVDGALMMK SNVYLAGAGM GETVVKLLDG WNGHVNGMIR SSGTEETHDF GVRDLTLDGN RDNNPEGTVF GFYTGYKFGD GADRNVIVER VEAREMSGYG FDPHARTVNL VIRDSVAHDN GFVGFVADHQ IDGAFENNVA YNNDLHGFNV VTSSHDFTLS DNVAYGNGAA GLVVQRGSYD VPHAYNIRID GGSYHDNALE GVLIKLSHDV TLQNAHIYDN GTAGVRIAGA QDVQLLDNRI HDNVQNGTYP EVLLQAFDDS GITGNVYETL NTLIEGNLIT TSGDATYIVQ ERNDGSDYTT LRDNGISGGR IASVQLSGAH SSSGPLRGTD GNDTLIGGAA NEQLLGGAGA DLLDGGAGRD RLTGGEEADT FRFSAREDSY RTASENFADR ILDFEAGTDR IDLSALGFSG LGNGRDGTLA VQVNSAGTRT YLKSFEANAA GERFEIALEG NHAGLDESSL VFDDSATELA LVGSAPQTDP SV
|
| |