Gene Avin_22370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22370 
Symbol 
ID7761155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2237049 
End bp2238089 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID643805122 
ProductAldo/keto reductase 
Protein accessionYP_002799403 
Protein GI226944330 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTACC GCCCACTCGG CAGGACCGAT CTCCAGGTCA GCGCGCTGGC GCTCGGCAGC 
ATGACCTGGG GCGAACAGAA CGGCGAAGCC GAAGGCTTCG AGCAGATCCG CCGCGCCCAG
GCGGCCGGGA TCAATTTCAT CGATACCGCC GAGATGTACC CGGTGCCGCC GAAGGCGGAA
ACCGCCGGCG CCACCGAGAC CATCATCGGC AACTACTTCA AGCGCCACGG CGGACGCGGC
GACTGGGTCC TCGCCACCAA GGCGGCGCCG CCCGGCAACG GCATCACCCA CATCCGCGAC
GGCCGCAACC ACTTCGACCG GGCCAACCTG ACCGCCGCCG TGGACGCCAG CCTCGCGCGC
CTGCAGACCG ACTACATCGA CCTCTACCAG TTGCACTGGC CGGACCGGCA GACCAACTTC
TTCGGCCAGC TCGGCTACCG GCACGACCCG GACGCCCACC TCACGCCGAT CGAGGAGACG
CTGGAGGTCC TCGACGGCCT GGTGAAGAGC GGCAAGATCC GCCATATCGG CCTGTCCAAC
GAAACGCCCT GGGGCGTGCA CCGCTTCCTG CACCTGGCCG AGACCCGCGG CTGGCCGCGG
GTGGTGTCGA TCCAGAACCC CTACAACCTG CTCAACCGCA GCTTCGAGGT GGGCCTGGCG
GAAATCGCCA TCCGCGAACG GGTCGGCCTG CTCGCCTACT CGCCACTGGC CTTCGGCCTG
CTCTCCGGCA AGTACGAGAA CGGCGCGCAA CCGCCCAAAG CACGGCTGAC CCTGTTCGAG
CGCTTCCAGC GCTACAACAG CCCGCAGGCG CGCCGCGCCG CCAGCGCCTA CGTCGCCCTG
GCCCGCGAGC ACGGCGTCGA TCCGGCGCGA CTGGCGCTGG CCTACGTCAC CAGCCGGCCG
TTCCTCACCA GCAACATCAT CGGCGCCACC ACGCTGGAGC AACTGGACAG CGACATCGCC
AGCCTGGAAC TGAAACTCAG CGACGAACTG CTCGCCGGCA TCGAGCGCAT CCACACAGAG
CAACCCAACC CGGCGCCCTG A
 
Protein sequence
MIYRPLGRTD LQVSALALGS MTWGEQNGEA EGFEQIRRAQ AAGINFIDTA EMYPVPPKAE 
TAGATETIIG NYFKRHGGRG DWVLATKAAP PGNGITHIRD GRNHFDRANL TAAVDASLAR
LQTDYIDLYQ LHWPDRQTNF FGQLGYRHDP DAHLTPIEET LEVLDGLVKS GKIRHIGLSN
ETPWGVHRFL HLAETRGWPR VVSIQNPYNL LNRSFEVGLA EIAIRERVGL LAYSPLAFGL
LSGKYENGAQ PPKARLTLFE RFQRYNSPQA RRAASAYVAL AREHGVDPAR LALAYVTSRP
FLTSNIIGAT TLEQLDSDIA SLELKLSDEL LAGIERIHTE QPNPAP