Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51420 |
Symbol | melA |
ID | 7763982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5227140 |
End bp | 5228465 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643807961 |
Product | alpha-galactosidase |
Protein accession | YP_002802195 |
Protein GI | 226947122 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCA AAGCCACCAA GATCGCCCTG ATCGGCGCCG GCTCGACGGT ATTCATGAAG AACCTGCTGG GCGACCTGCT GCAGGTCGAG CTGCTGCGCA ACGCCCACAT CGCACTGATG GACATAGACC CGCACCGCCT GGAAACCTCG CAACTGGTGG CCGGCAAGAT CGCCGAGGCG CTCGGCGCCT CGCCCACCTT CGAGGCCACC ACCGACCGGC GGCGCGCCCT GGACGGCGCC GACTACGTGA TCACCATGAT CCAGGTGGCC GGCTACAAGC CCGGCACCGT CACCGACTTC GAGGTGCCGA AGCGCCACGG CCTGCGCCAG ACCATCGGCG ATACCCTGGG CATCGGCGGC ATCATGCGCG GCCTGCGTAC CGCGCCGGTG CTGGTGGACA TCGCCAGGGA CATGCGCGAA CTCTGCCCGA ATGCGCTGAT GCTGCAATAC GTCAATCCGA TGGCTATCAA TTGCCTGGCG CTGAATCGTT TCACTCCGGA AGTTCGCACC GTCGGACTCT GCCATTCCGT GCAGGGCACC GCCAAGGAAC TGGCCGAGGA CATCGGCGTA CCCATCGAGG AAATCCGCTA CCGCTGCGCG GGCATCAACC ACATGGCCTT CTACCTGTCC TTCGAGCGCC TGCTGCCGGA CGGTCGCACC GAGGACCTCT ACCCGCGCAT CCGCTCGGTG CTCGAGACCG GCAAGGTGCC CGACTGGAAC CGGGTCCGCT ACGAAGTGTT CAGGCACTTC GGCCATTTCG TCACCGAATC CTCCGAGCAC TTCGCCGAAT ACGTGCCCTG GTTCATCAAG CGCCATCGCC CCGACCTGAT CGAGCAGTTC AATATCCCGC TCGACGAATA CATCCGCCGC TGCGAGGTGC AGATCGGCGA CTGGGCCCAG CAGGAAAAAG CCCTGCTGGC CGGCGAAGGC CTCAAGGTCT GCCGCAGCCA CGAATACGCC TCGCAGATCA TCGAAGCCGA ACTGACCGGA AGGCCCACCC TGATCAACGG CAACGTCATG AATACCGGGC TGATCGCCAA CCTGCCGGAA AGCGCCTGCG TCGAGGTGCC CTGCGTGGTC GACCACAACG GCATCCAGCC GACCCGCATC GGCGACCTGC CAGTCGGCCT CGCCGCCCTG ATGCAGACCA ACATCAACGT GCAGACCCTG ACCGTGGAGG CCCTGGCCAG CGGCCGCCGC GAGCACGTCA AGCAGGCGGC GATGCTCGAC CCGCACACCG CCGCCGAACT CAGCCTCGAC GAGATAGACC GCCTGGTCGA CGAACTGATC CGGGCCCACG AAGGCTGGCT GCCGCACTTG AGCTAA
|
Protein sequence | MTLKATKIAL IGAGSTVFMK NLLGDLLQVE LLRNAHIALM DIDPHRLETS QLVAGKIAEA LGASPTFEAT TDRRRALDGA DYVITMIQVA GYKPGTVTDF EVPKRHGLRQ TIGDTLGIGG IMRGLRTAPV LVDIARDMRE LCPNALMLQY VNPMAINCLA LNRFTPEVRT VGLCHSVQGT AKELAEDIGV PIEEIRYRCA GINHMAFYLS FERLLPDGRT EDLYPRIRSV LETGKVPDWN RVRYEVFRHF GHFVTESSEH FAEYVPWFIK RHRPDLIEQF NIPLDEYIRR CEVQIGDWAQ QEKALLAGEG LKVCRSHEYA SQIIEAELTG RPTLINGNVM NTGLIANLPE SACVEVPCVV DHNGIQPTRI GDLPVGLAAL MQTNINVQTL TVEALASGRR EHVKQAAMLD PHTAAELSLD EIDRLVDELI RAHEGWLPHL S
|
| |