Gene Avin_08740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_08740 
SymbolxylK 
ID7759824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp829300 
End bp830349 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content67% 
IMG OID643803788 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_002798090 
Protein GI226943017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTCG ATCCGCACAA GAAGCTGTAC ATCTCCGACG TGACCCTGCG CGACGGCAGC 
CATGCCGTGC GTCACCAGTA CTCGATCAGG AACGTCCAGG ACATCGCCCG CGCACTGGAC
AAGGCGAAAG TGGATTCCAT CGAGGTCGCC CACGGCGACG GCCTGCAGGG TTCGAGCTTC
AACTACGGCT TCGGCGCGCA CAGCGACATC GAGTGGATCG AGGCGGTGGC CGAGGTGGTG
ACTCATGCCA GGATCGCCAC CCTGTTGCTG CCCGGCATCG GCACCGTCCA CCACCTCAAG
GAGGCCTACG ACGCCGGCGC GCGCATCGTC CGGGTGGCCA CCCACTGCAC CGAGGCGGAC
GTGTCCAGAC AGCACATCGC CTACGCGCGC GAGTTGGGCA TGGACACCGT GGGCTTCCTG
ATGATGAGCC ACATGACCAC GCCGCAGAAC CTCGCCGTCG AGGCGAAGAA GATGGAAAGC
TACGGCGCCA CCTGCATCTA CGTGGTCGAC TCCGGCGGGG CCTTGAGCAT GCAGGACGTG
CGCGAGCGCT TCCGCGCGGT CAAGGACCTG CTGGAGCCTT CGACCCAGAC CGGCATCCAC
GCCCACCACA ACCTCAGCCT CGGGGTGGCC AACTCCATCG TCGCGGTGGA GGAGGGCTGC
GACCGCATCG ACGCCAGCCT CGCTGGCATG GGCGCGGGGG CGGGCAATGC GCCGCTGGAG
GTGTTCGTCG CCGCGGCCGA GCGGCTGGGC TGGAACCACG GCACCGACCT CTACACCCTG
ATGGACGCCG CCGACGAGAT CGTCCGGCCG TTGCAGGACC GCCCGGTACG GGTCGACCGC
GAGACGCTGG CGCTGGGTTA TGCCGGGGTC TATTCGAGCT TTCTGCGCCA CGCCGAGGTG
GCGGCGAGCA AATATGGCCT GAGCACCGTG GACATCCTGG TCGAACTGGG CCGGAGGCGG
ATGGTCGGCG GCCAGGAGGA TATGATCGTC GATGTGGCGC TGGATCTGCT GCGCCAGCGG
GGAGACGCTG CCCGGCAGGC CGCGGTGTAA
 
Protein sequence
MTFDPHKKLY ISDVTLRDGS HAVRHQYSIR NVQDIARALD KAKVDSIEVA HGDGLQGSSF 
NYGFGAHSDI EWIEAVAEVV THARIATLLL PGIGTVHHLK EAYDAGARIV RVATHCTEAD
VSRQHIAYAR ELGMDTVGFL MMSHMTTPQN LAVEAKKMES YGATCIYVVD SGGALSMQDV
RERFRAVKDL LEPSTQTGIH AHHNLSLGVA NSIVAVEEGC DRIDASLAGM GAGAGNAPLE
VFVAAAERLG WNHGTDLYTL MDAADEIVRP LQDRPVRVDR ETLALGYAGV YSSFLRHAEV
AASKYGLSTV DILVELGRRR MVGGQEDMIV DVALDLLRQR GDAARQAAV