Gene Avin_22040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22040 
Symbol 
ID7761122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2202322 
End bp2203926 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content66% 
IMG OID643805089 
ProductNAD-dependent aldehyde dehydrogenase 
Protein accessionYP_002799370 
Protein GI226944297 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTTT TTCCTAGTAT TCAGGACATT CCGGAGAAGT ACCGCCTGGG CGCGCCCATC 
GAACAGCGCG ACTACCTGGT CGACGGCGAA CTGCGCCGCT GGGACGGCCC GCTGGCCGCC
GTGCGCAGCC CCATCCACCT GAAGACCGCC AAGGGCGACG AGCAGGTCGT CCTCGGCAGC
ACCCCGCTGC TCGACGCCCA GGCTGCGCTG GGCGCCCTGG ACGCCGCGGT CAGGGCCTAC
GACAACGGCC AGGGCCTGTG GCCGAGCATG CCGGTGGCCG GGCGCATCCA GCACGTCGAG
ACCTTCCTGG CCCGCATGCG TGAACAGCGC GAGGCGGTGG TCAAACTGCT GATGTGGGAG
ATCGGCAAGA ACCTCAAGGA CGCCGAGAAG GAATTCGACC GCACCTGCGA CTACATCGTC
GACACCATCC ACGAACTCAA GGAACTCGAC CGCCGCTCCA GCCGCTTCGA GCTGGAGCAG
GGCACCCTCG GCCAGATCCG CCGCGTGCCG CTGGGCGTGG CGCTGTGCAT GGGCCCCTAC
AACTACCCGC TGAACGAGAC CTTCACCACC CTGATCCCGG CGCTGATCAT GGGCAACACC
GTGGTGTTCA AGCCGGCCAA GTTCGGCGTG CTGCTGATCC GCCCGCTGCT CGAGGCGTTC
CGCGACAGCT TCCCGGCCGG GGTGATCAAC GTCATCTACG GGCGCGGCCG CGAGACCGTC
AGCGCGCTGA TGGAAAGCGG CAAGGTGGAC GTGTTCGCCT TCATCGGCAC CAACAAGGGC
GCCAGCGACC TGAAGAAGCT GCACCCGCGC CCGCACCGCC TGCGCGCAAT CCTCGGCCTG
GACGCCAAGA ACCCCGGCAT CGTCCTGCCC GAGGTGGACC TGGACAACGC GGTCGGCGAG
GCGATCACCG GCGCCCTGTC GTTCAACGGC CAGCGCTGCA CGGCGCTGAA GATCCTCTTC
GTCCACGAAC AAGTGGTCGA CGCCTTCCTG GAGAAGTTCA ACGCCAGGCT GGCCGCGCTC
AAGTCGGGCA TGCCCTGGGA ACCAGGGGTG GCGCTGACCC CGTTGCCGGA GCCGGGCAAG
ACCGATTTTC TCGCCACCCT GGTGGCCGAC GCCCTGGCCA AGGGCGCGAA GGTGGTCAAC
CCCGGCGGTG GCGAGGTGCG CGAGACCTTC TTCTACCCGG CGCTGCTCTA CCCGGTGAGT
CCGCAGATGC GCGTCTACCA GGAGGAGCAG TTCGGCCCGC TGATCCCGGT GGTGCCTTAC
CGCGACCTGC AGACGGTGAT CGACTACGTG CGCGAGTCGG ACTTCGGCCA GCAGCTGTCG
ATCTTCGGCA ACGACCCGCA GCAGGTCGCC AGGCTGGTGG ATGCCTTCGC CAACCAGGTC
GGGCGGATCA ACCTCAACAC CCAGTGCCAG CGCGGGCCGG ACAGCTTCCC GTTCAACGGC
CGCAAGAACT CGGCGGAGGG GACTCTGTCG GTGTACGACG CGCTGCGGGC GTTCTCGATC
CGCACGCTGG TGGCGACCAA GCTTCAGGAG GACAACAAGC AGTTGATCAG CGACATCATC
CGCAACCGCG AGTCGAGCTT CCTGACCACC GATTATCTTT TTTGA
 
Protein sequence
MIVFPSIQDI PEKYRLGAPI EQRDYLVDGE LRRWDGPLAA VRSPIHLKTA KGDEQVVLGS 
TPLLDAQAAL GALDAAVRAY DNGQGLWPSM PVAGRIQHVE TFLARMREQR EAVVKLLMWE
IGKNLKDAEK EFDRTCDYIV DTIHELKELD RRSSRFELEQ GTLGQIRRVP LGVALCMGPY
NYPLNETFTT LIPALIMGNT VVFKPAKFGV LLIRPLLEAF RDSFPAGVIN VIYGRGRETV
SALMESGKVD VFAFIGTNKG ASDLKKLHPR PHRLRAILGL DAKNPGIVLP EVDLDNAVGE
AITGALSFNG QRCTALKILF VHEQVVDAFL EKFNARLAAL KSGMPWEPGV ALTPLPEPGK
TDFLATLVAD ALAKGAKVVN PGGGEVRETF FYPALLYPVS PQMRVYQEEQ FGPLIPVVPY
RDLQTVIDYV RESDFGQQLS IFGNDPQQVA RLVDAFANQV GRINLNTQCQ RGPDSFPFNG
RKNSAEGTLS VYDALRAFSI RTLVATKLQE DNKQLISDII RNRESSFLTT DYLF