Gene Avin_05220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_05220 
Symbol 
ID7759478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp500421 
End bp501332 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content70% 
IMG OID643803442 
Product5-dehydro-4-deoxyglucarate dehydratase 
Protein accessionYP_002797750 
Protein GI226942677 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR03249] 5-dehydro-4-deoxyglucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCAC AAGAACTCAA GTCCATCCTT TCCTCCGGGC TGCTGTCCTT CCCGGTCACC 
GATTTCGATG CCGCCGGCGA TTTCGACGCC GAGTCCTACG CCCGCCGCCT GGAATGGCTG
GCGCCCTACG GCGCCAGTGC GCTGTTCGCC GCCGGCGGCA CCGGCGAGTT CTTCTCGCTG
GGCCTCGACG AATACCCGCG GATCATCAAG ACCGCGGTCG ACACCTGCGC CGGCAGCGTG
CCGATCCTCG CCGGCGTCGG CGGCCCGACC CGCCAGGCCA TCCACATGGC CCAGGAGGCC
GAGCGGCTCG GCGCCAAGGG CCTGCTGCTG CTGCCGCACT ACCTGACCGA GGCCAGCCAG
GAGGGCGTCG CCGCGCATGT CGAGGCGGTC TGCAGGGCGG TGAAGATCGG CGTGGTGGTC
TACAACCGCA ACGTCTGCCG GCTGACCCCG GCGCTGCTCG AACAACTGGC CGAGCGTTGC
CCGAACCTGG TCGGCTACAA GGACGGCCTT GGCGAGATCG AGCTGATGGT GTCGGTCCGC
CACCGCCTGG GCGAGCGCTT CGCCTACCTC GGCGGCCTGC CGACCGCCGA GGTCTACGCC
GCGGCCTACA AGGCGCTGGG CGTGCCGGTC TACTCCTCGG CGGTGTTCAA CTTCATCCCG
CGCACGGCGA TGGAGTTCTA CAAGGCGGTG GCCGCCGACG ACCAGGTCAC CGTGGGCCGG
CTGATCGACG ACTTCTTCCT GCCGCTGCTG GAGATCCGCA ACCGCCGCGC CGGTTATGCG
GTGAGCATCG TCAAGGCCGG GGTGAGGGTG ATCGGCCACG ACGCCGGTCC GGTGCGCGCG
CCGCTGACCG ACCTGCTGCC GGACGAGTAC GAGCGTCTCG CCGCGCTGAT CAGGAAGCTC
GGCCCGCAGT GA
 
Protein sequence
MTPQELKSIL SSGLLSFPVT DFDAAGDFDA ESYARRLEWL APYGASALFA AGGTGEFFSL 
GLDEYPRIIK TAVDTCAGSV PILAGVGGPT RQAIHMAQEA ERLGAKGLLL LPHYLTEASQ
EGVAAHVEAV CRAVKIGVVV YNRNVCRLTP ALLEQLAERC PNLVGYKDGL GEIELMVSVR
HRLGERFAYL GGLPTAEVYA AAYKALGVPV YSSAVFNFIP RTAMEFYKAV AADDQVTVGR
LIDDFFLPLL EIRNRRAGYA VSIVKAGVRV IGHDAGPVRA PLTDLLPDEY ERLAALIRKL
GPQ