Gene Avin_51420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51420 
SymbolmelA 
ID7763982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5227140 
End bp5228465 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content65% 
IMG OID643807961 
Productalpha-galactosidase 
Protein accessionYP_002802195 
Protein GI226947122 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCA AAGCCACCAA GATCGCCCTG ATCGGCGCCG GCTCGACGGT ATTCATGAAG 
AACCTGCTGG GCGACCTGCT GCAGGTCGAG CTGCTGCGCA ACGCCCACAT CGCACTGATG
GACATAGACC CGCACCGCCT GGAAACCTCG CAACTGGTGG CCGGCAAGAT CGCCGAGGCG
CTCGGCGCCT CGCCCACCTT CGAGGCCACC ACCGACCGGC GGCGCGCCCT GGACGGCGCC
GACTACGTGA TCACCATGAT CCAGGTGGCC GGCTACAAGC CCGGCACCGT CACCGACTTC
GAGGTGCCGA AGCGCCACGG CCTGCGCCAG ACCATCGGCG ATACCCTGGG CATCGGCGGC
ATCATGCGCG GCCTGCGTAC CGCGCCGGTG CTGGTGGACA TCGCCAGGGA CATGCGCGAA
CTCTGCCCGA ATGCGCTGAT GCTGCAATAC GTCAATCCGA TGGCTATCAA TTGCCTGGCG
CTGAATCGTT TCACTCCGGA AGTTCGCACC GTCGGACTCT GCCATTCCGT GCAGGGCACC
GCCAAGGAAC TGGCCGAGGA CATCGGCGTA CCCATCGAGG AAATCCGCTA CCGCTGCGCG
GGCATCAACC ACATGGCCTT CTACCTGTCC TTCGAGCGCC TGCTGCCGGA CGGTCGCACC
GAGGACCTCT ACCCGCGCAT CCGCTCGGTG CTCGAGACCG GCAAGGTGCC CGACTGGAAC
CGGGTCCGCT ACGAAGTGTT CAGGCACTTC GGCCATTTCG TCACCGAATC CTCCGAGCAC
TTCGCCGAAT ACGTGCCCTG GTTCATCAAG CGCCATCGCC CCGACCTGAT CGAGCAGTTC
AATATCCCGC TCGACGAATA CATCCGCCGC TGCGAGGTGC AGATCGGCGA CTGGGCCCAG
CAGGAAAAAG CCCTGCTGGC CGGCGAAGGC CTCAAGGTCT GCCGCAGCCA CGAATACGCC
TCGCAGATCA TCGAAGCCGA ACTGACCGGA AGGCCCACCC TGATCAACGG CAACGTCATG
AATACCGGGC TGATCGCCAA CCTGCCGGAA AGCGCCTGCG TCGAGGTGCC CTGCGTGGTC
GACCACAACG GCATCCAGCC GACCCGCATC GGCGACCTGC CAGTCGGCCT CGCCGCCCTG
ATGCAGACCA ACATCAACGT GCAGACCCTG ACCGTGGAGG CCCTGGCCAG CGGCCGCCGC
GAGCACGTCA AGCAGGCGGC GATGCTCGAC CCGCACACCG CCGCCGAACT CAGCCTCGAC
GAGATAGACC GCCTGGTCGA CGAACTGATC CGGGCCCACG AAGGCTGGCT GCCGCACTTG
AGCTAA
 
Protein sequence
MTLKATKIAL IGAGSTVFMK NLLGDLLQVE LLRNAHIALM DIDPHRLETS QLVAGKIAEA 
LGASPTFEAT TDRRRALDGA DYVITMIQVA GYKPGTVTDF EVPKRHGLRQ TIGDTLGIGG
IMRGLRTAPV LVDIARDMRE LCPNALMLQY VNPMAINCLA LNRFTPEVRT VGLCHSVQGT
AKELAEDIGV PIEEIRYRCA GINHMAFYLS FERLLPDGRT EDLYPRIRSV LETGKVPDWN
RVRYEVFRHF GHFVTESSEH FAEYVPWFIK RHRPDLIEQF NIPLDEYIRR CEVQIGDWAQ
QEKALLAGEG LKVCRSHEYA SQIIEAELTG RPTLINGNVM NTGLIANLPE SACVEVPCVV
DHNGIQPTRI GDLPVGLAAL MQTNINVQTL TVEALASGRR EHVKQAAMLD PHTAAELSLD
EIDRLVDELI RAHEGWLPHL S