Gene Avin_09230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_09230 
SymbolhpaI 
ID7759871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp868662 
End bp869567 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content70% 
IMG OID643803835 
Product2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase 
Protein accessionYP_002798137 
Protein GI226943064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase 
TIGRFAM ID[TIGR02311] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.380791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCGC CGGTACTCGC GGCGACCTCG CCGGGTGCCG GGCGGGCCAT CCACCTCATC 
AATCCCGCCA TGCCCGCATT CCGCGCGGCT TTCGAGGAGA CACTCATGAA AATGCCGCAC
AACGCCTTCA AGGCGGCGCT GCAACGACCG GAAACCCAAT ACGGCATCTG GGCCGGCTTC
GCCAGCGGCT ATGCCGCCGA AATCGTCGCC GGCACCGGCT ACGACTGGAT GCTGATCGAC
GGCGAGCACG CGCCCAACAG CGTGCCGACC ATCCTGGCCC AATTGCAGAG CGTGGCGCCG
TATCCGACCC AGCCGGTGGT GCGGCCGGTC TGTGGCGATC CGGTACTGAT CAAGCAACTG
CTGGATATCG GCGCGCAGAC GCTGATGGTG CCGATGGTGG AAAGCGCCGA GCAGGCGAGG
GCGCTGGTGC GCGCCATGCG CTACCCGCCG CACGGCATCC GCGGCGTCGG CGGCGGCCTG
GCCCGCGCCA CCCGCTGGGA CGGTGTGCCC GACTACCTGA ACACCGCCCA TGAGGAGCTG
TGCCTGATCG TCCAGGTGGA ATCGCGTGCC GGGGTCGAGA ACGTCGAGGC GATCGCCGCC
GTGGAAGGCG TCGACGCGGT GTTCATCGGC CCGGCCGATC TTTCCATCGG CCTCGGCCAT
CCCGGCGATC CGGGCCATCC GCAGGTGCAG GAGCTTATCC ATCACGCCAT CGAGGCCACC
CGCGCCGCCG GCAAGGCCTG CGGCATCCTC GCCCCGCACG AGGAGGACGC CCGCCGCTAC
CGGGAATGGG GCTGCCGGTT CATCGCCGTC GCCATCGACA TCAGCCTGCT GCGCCAGGGC
GCGCTGGCCG GCCTGGCGCG CTTCCGCGAC ACTCCGGCGT CCGACGCGCC CTCGCGCACC
TACTGA
 
Protein sequence
MPAPVLAATS PGAGRAIHLI NPAMPAFRAA FEETLMKMPH NAFKAALQRP ETQYGIWAGF 
ASGYAAEIVA GTGYDWMLID GEHAPNSVPT ILAQLQSVAP YPTQPVVRPV CGDPVLIKQL
LDIGAQTLMV PMVESAEQAR ALVRAMRYPP HGIRGVGGGL ARATRWDGVP DYLNTAHEEL
CLIVQVESRA GVENVEAIAA VEGVDAVFIG PADLSIGLGH PGDPGHPQVQ ELIHHAIEAT
RAAGKACGIL APHEEDARRY REWGCRFIAV AIDISLLRQG ALAGLARFRD TPASDAPSRT
Y