Gene Avin_28010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_28010 
SymbolglgE 
ID7761706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2886656 
End bp2888659 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content67% 
IMG OID643805680 
Productfamily 13 glycosyl hydrolase 
Protein accessionYP_002799948 
Protein GI226944875 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACA CCAGCCAGCC CGGCGTGCAT TGCGAGAACC GTCGTCCGAC CCTGGAGGAA 
GCGCTGCGTC TACCGCGGAT CGCCATCGAG AACACCGAAC CGGTTGTCGA CAACGGACGC
TTCGCCGTCA AGGCGATCGC CGGCCTGCCG GTGACGGTGG CCAGCACCGT CTTCACCGAC
GGCCACGACC AACTCGGCGC CGCCCTCTAC TGGCGCGCCG ACGGCGAGAG CGGCTGGCAC
CGCCTGCGCA TGAAGTTCGT CGGCAACGAC CGCTGGCAGG TCGAGTTCAC GCCGCTGCGC
GTCGCCCGGC ATCTGTTCGT GGTCGAGGCC TGGTGGGATC TCTACGAAAC CTACCGGCAC
GAGTTGCAGA AGAAGCATGC CGCCGGGGTG CCGGTCTCCC TGGAGCTGCA GGAGGGCCGC
CTGCATCTGC AGCGGGCCGC CGAACACGCC CGGGGCGAGG TGCGCGCCAC GCTGGAGGAT
CTCCTGCGGC GTCTGGAGCA GGCCCCGCAC ACCGAGGCCG TGGAGTTGCT GCTGAGCGGC
GAGGCCTCCG CCGCCATGAG CGCCGCCGAC CCGTATCCGC ATCGCACCTA CAGCAATGCC
TTCCCCCTGG ACGTGGAGCG CGAACGGGCC CTGTTCGCTA GTTGGTACGA ACTCTTCCCC
CGCTCGCAGA CCGACAGCCC GCATCGCCAC GGCACCTTCA AGGATGTCAT CGCCCGCCTG
CCGGCGATCC ACGACATGGG TTTCGATGTC CTCTACTTCC CGCCGATCCA TCCGATCGGC
CGGCGCTTTC GCAAGGGCCG CAACAACAGC CTGGAGGCCG GGCCGGACGA TCCCGGCAGC
CCCTATGCCA TCGGCGGCGA GGAGGGCGGC CACGAGGCCA TCCATCCGCA ACTGGGCTCG
CGCGAGGATT TCCGCGAACT GGTCGCCGCC GCCGGCGAGT ACGATCTGGA GATCGCCCTG
GACTTCGCCG TGCAGTGCTC CCAGGACCAC CCCTGGCTGA AGCAGCATCC CGGCTGGTTC
TCCTGGCGCC CGGACGGCAG CATCCGCTAC GCCGAGAATC CGCCGAAGAA ATACCAGGAC
ATCGTCAACG TCGACTTCTA TGCCAAGGAC GCCATCCCCG ACCTCTGGAT GGCGCTGCGC
GACGTGGTGC TCGGCTGGGT CGCCGAGGGG GTGAAGATCT TCCGCGTCGA CAACCCACAC
ACCAAGCCGT TGCCGTTCTG GGAGTGGCTG ATCGGCGACG TGCGCGGCCG GCACCCGGAA
GTGATCTTTC TCGCCGAGGC CTTCACCCGC CCGGCGATGA TGCTGCGCCT GGGCAAGCTC
GGCTTCTCGC AGAGCTACAC CTATTTCACC TGGCGCAATG CCAAGGAAGA ACTGACCGCC
TACTTCGCCG AGCTGAACGA GGCGCCGGCC AGACACTGCG TGCGACCGAA CTTCTTCGTC
AACACGCCGG ACATCAACCC CTTCTTCCTG CAGCACTCCG GGCGCGCCGG CTTCCTGATC
CGCGCCGCGC TGGCGACCAT GGGCTCCGGG CTGTGGGGCA TGTACTCGGG GTTCGAGCTG
TGCGAGGCGG CGGCGCTGCC CGGCAGGGAG GAATACCTCG ACTCGGAGAA GTACGAGATC
CGCCCGCGCG ACTACCGGGC GCCCGGCAAC ATCGTCGCCG AGATCGCCCA GCTCAACCGC
ATCCGCCGCT ACAACCCGGC GCTGCAGACC CATCTCGGCT TCGAGCCGTA CAACATCTGG
AACGACAACA CCCTGCTGTT CGGCAAGCGT ACGCCGGACC TCTCCAACTT CGTCCTGGTC
GCCATCAACC TCGATCCCTG GCACGCCCAG GAGGCCCATT TCGAGCTGCC GCTCTGGGAG
TTCGGCCTGC CCGACCACGC TGACATGCAC GGCGAGGATC TGATGAACGG CCATCGCTGG
ACCTGGCACG GTAAGAAGCA GTGGACGCGC CTCGACCCCG ACTATCTGCC CTTCGGTATC
TGGAAGCTGA CCCCACCGGA GTAA
 
Protein sequence
MSDTSQPGVH CENRRPTLEE ALRLPRIAIE NTEPVVDNGR FAVKAIAGLP VTVASTVFTD 
GHDQLGAALY WRADGESGWH RLRMKFVGND RWQVEFTPLR VARHLFVVEA WWDLYETYRH
ELQKKHAAGV PVSLELQEGR LHLQRAAEHA RGEVRATLED LLRRLEQAPH TEAVELLLSG
EASAAMSAAD PYPHRTYSNA FPLDVERERA LFASWYELFP RSQTDSPHRH GTFKDVIARL
PAIHDMGFDV LYFPPIHPIG RRFRKGRNNS LEAGPDDPGS PYAIGGEEGG HEAIHPQLGS
REDFRELVAA AGEYDLEIAL DFAVQCSQDH PWLKQHPGWF SWRPDGSIRY AENPPKKYQD
IVNVDFYAKD AIPDLWMALR DVVLGWVAEG VKIFRVDNPH TKPLPFWEWL IGDVRGRHPE
VIFLAEAFTR PAMMLRLGKL GFSQSYTYFT WRNAKEELTA YFAELNEAPA RHCVRPNFFV
NTPDINPFFL QHSGRAGFLI RAALATMGSG LWGMYSGFEL CEAAALPGRE EYLDSEKYEI
RPRDYRAPGN IVAEIAQLNR IRRYNPALQT HLGFEPYNIW NDNTLLFGKR TPDLSNFVLV
AINLDPWHAQ EAHFELPLWE FGLPDHADMH GEDLMNGHRW TWHGKKQWTR LDPDYLPFGI
WKLTPPE