Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_22030 |
Symbol | |
ID | 7761121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2201075 |
End bp | 2202202 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643805088 |
Product | Zinc-containing alcohol dehydrogenase superfamily |
Protein accession | YP_002799369 |
Protein GI | 226944296 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.255526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA CAATAATGAC CTCCGTGGCT AAAGAGAGTA TTTCCACCCG CAGACTGAAT AACATTCCGA AAACCATGAA GGCCGTGATG GCCTACGCAC CTGGCGACTA CCGCCTGGAG GAAGTAGCGG TACCAAAGGC CGGACCGGGC GAAATCATAG CCAAGGTCGA AGCCTGCGGC ATCTGTGCAG GCGATATCAA GTCCTTCGGT GGCGCACCCA GCTTCTGGGG AGACGAAACC CAGCCGGCCT ATATCAAGGC GCCAATGATT CCAGGACATG AGTTTATCGC TCATATCGTC GAACTGGGCA AAGGCGTTGA AGGCTACGAA CTGGGCGATC GCGTGATCTC CGAGCAGATT GTGCCCTGCT GGAACTGTCG TTTCTGTAAA CGTGGCCATT ACTGGATGTG CCAAAAGCAC GACCTCTACG GCTTCCAGAA CAACGTGAAC GGTGCAATGG CCGAGTACAT TAGGTTCACC AAAGAGAGTA TCAACTATAA GGTTCCGCGG GATCTGCCGA TCGAGAAGGC GGTACTTATC GAGCCCTATG CCTGCTCGAT GCATGCGGTA CAGCGTGCGC AAATCCAGTT CGGAGATGTG GTAGTACTCG CGGGCGCCGG TACCCTGGGC TTAGGGATGA TCGGCGCGGC CAAGAAGGCC GGCCCCGGAA AACTGGTCGT GATGGATTTG TTCGAGGATC GACTCGAGCT GGCGAAGAAA TTCGGTGCCG ATCTTGTCAT CAATCCTGCG AAAGAGGATC CCGTGGCCCG GATCAAGGAG ATCACCGATG GTTATGGTTG CGATGTCTAT ATCGAAGCGA CCGGGCATCC GAAGTCCGTG GAACAAGGCT TATCGATGAT CCGCAACCTT GGACGCTTCG TCGAGTTCAG TGTTTTCAAG GACCCGGTAA CCGTCGACTG GAGCATCATC AGTGACCGCA AGGAGCTCGA TGTTCTCGGT GCGCACCTCG GTCCCTATTG CTACCCGCTG GTAATCGAAG GTATTGCCGA CGGTTCCCTG CCGACCGAGG GTGTCGTCAC CCATAACTTC CCCTTAGAGC GCTTCATGGA TGGTTTCAAA CTCGCAATGA GCGGCAAGGA CTCCCTGAAA GTGATTTTGA CTCCTTGA
|
Protein sequence | MEKTIMTSVA KESISTRRLN NIPKTMKAVM AYAPGDYRLE EVAVPKAGPG EIIAKVEACG ICAGDIKSFG GAPSFWGDET QPAYIKAPMI PGHEFIAHIV ELGKGVEGYE LGDRVISEQI VPCWNCRFCK RGHYWMCQKH DLYGFQNNVN GAMAEYIRFT KESINYKVPR DLPIEKAVLI EPYACSMHAV QRAQIQFGDV VVLAGAGTLG LGMIGAAKKA GPGKLVVMDL FEDRLELAKK FGADLVINPA KEDPVARIKE ITDGYGCDVY IEATGHPKSV EQGLSMIRNL GRFVEFSVFK DPVTVDWSII SDRKELDVLG AHLGPYCYPL VIEGIADGSL PTEGVVTHNF PLERFMDGFK LAMSGKDSLK VILTP
|
| |