Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_07420 |
Symbol | |
ID | 7759694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 700559 |
End bp | 701632 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803659 |
Product | Zinc-containing alcohol dehydrogenase superfamily |
Protein accession | YP_002797963 |
Protein GI | 226942890 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATGA TGAAAGCCGC CATGTTCGTC GAGCCCGGCC GCATCGAACT GCAGGACAAG CCCATCCCCG ACGTCGGACC GGACGACGCC CTGGTGCGCA TCACCACGAC CACCATCTGC GGCACCGACG TGCACATCCT CAAGGGCGAA TACCCGGTGG CCTCGGGCCT GACCATCGGC CACGAGCCGG TGGGCATCGT CGAGAAGCTC GGCAGCAACG TGAAGGGCTA CAGCGAAGGC CAGCGGGTCA TCGCCGGCGC CATCTGCCCG AGCTTCACCT CCTATGCCTG CCAGGATGGC TACCCGTCCC AGGACGGCGG CTGCGCCTGC CACGGCTACA AGCCGATGGG CGGCTGGCGC TTCGGCAACA GCATCGACGG TACCCAGGCC GAATACGTGC TGGTGCCCGA CGCCCAGGCC AACCTGGCGC CGGTGCCGGA CGGCCTGAGC GACGAGGAAG TGCTGATGTG CCCGGACATC ATGTCCACCG GCTTCGCCGG CGCCGAGGCG GCGAACATCC GGATCGGCGA CATCGTCGCG GTGTTCGCCC AGGGCCCGAT CGGCCTGTGC GCCACCGCCG GCGCCAAGCT GCGCGGCGCC AGCACCATCA TCGCCATCGA CGGCGTCGAC GAACGCCTGG AGATCGCCCG GCGCCTCGGC GCCGACGTCA CCCTGAACTT CCGCAAGGTG GACGTGGTGG ACGAGATCCT CCGGCTGACC GGCGGGCGCG GGGTGGACGC CTCGATCGAG GCGCTGGGCC TGCAGTCGAC CTTCGAGAGC GCCCTGCGCG TGCTCAAGCC GGGCGGCGCC CTGTCCAGCC TGGGCGTCTA CTCCAGCGAC CTGACTATTC CCCTCGGCGC CTTCCACGCC GGTCTCGGCG ACAACCGCAT CGTCACCTCG CTGTGCCCCG GCGGCAAGGA ACGCATGCGC CGCCTGCTCA ACGTGGTCGC CTCGGGGCGC GTCGACCTCA AGCCGCTGGT CACCCACCAG TACCGGCTGG ACGACATCGA GGCCGCCTAC GACCTGTTCG CCCACCAGCG CGACGGCGTG CTCAAGGTGG CGATCAAGCC CTGA
|
Protein sequence | MPMMKAAMFV EPGRIELQDK PIPDVGPDDA LVRITTTTIC GTDVHILKGE YPVASGLTIG HEPVGIVEKL GSNVKGYSEG QRVIAGAICP SFTSYACQDG YPSQDGGCAC HGYKPMGGWR FGNSIDGTQA EYVLVPDAQA NLAPVPDGLS DEEVLMCPDI MSTGFAGAEA ANIRIGDIVA VFAQGPIGLC ATAGAKLRGA STIIAIDGVD ERLEIARRLG ADVTLNFRKV DVVDEILRLT GGRGVDASIE ALGLQSTFES ALRVLKPGGA LSSLGVYSSD LTIPLGAFHA GLGDNRIVTS LCPGGKERMR RLLNVVASGR VDLKPLVTHQ YRLDDIEAAY DLFAHQRDGV LKVAIKP
|
| |