Gene Avin_22030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22030 
Symbol 
ID7761121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2201075 
End bp2202202 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID643805088 
ProductZinc-containing alcohol dehydrogenase superfamily 
Protein accessionYP_002799369 
Protein GI226944296 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.255526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA CAATAATGAC CTCCGTGGCT AAAGAGAGTA TTTCCACCCG CAGACTGAAT 
AACATTCCGA AAACCATGAA GGCCGTGATG GCCTACGCAC CTGGCGACTA CCGCCTGGAG
GAAGTAGCGG TACCAAAGGC CGGACCGGGC GAAATCATAG CCAAGGTCGA AGCCTGCGGC
ATCTGTGCAG GCGATATCAA GTCCTTCGGT GGCGCACCCA GCTTCTGGGG AGACGAAACC
CAGCCGGCCT ATATCAAGGC GCCAATGATT CCAGGACATG AGTTTATCGC TCATATCGTC
GAACTGGGCA AAGGCGTTGA AGGCTACGAA CTGGGCGATC GCGTGATCTC CGAGCAGATT
GTGCCCTGCT GGAACTGTCG TTTCTGTAAA CGTGGCCATT ACTGGATGTG CCAAAAGCAC
GACCTCTACG GCTTCCAGAA CAACGTGAAC GGTGCAATGG CCGAGTACAT TAGGTTCACC
AAAGAGAGTA TCAACTATAA GGTTCCGCGG GATCTGCCGA TCGAGAAGGC GGTACTTATC
GAGCCCTATG CCTGCTCGAT GCATGCGGTA CAGCGTGCGC AAATCCAGTT CGGAGATGTG
GTAGTACTCG CGGGCGCCGG TACCCTGGGC TTAGGGATGA TCGGCGCGGC CAAGAAGGCC
GGCCCCGGAA AACTGGTCGT GATGGATTTG TTCGAGGATC GACTCGAGCT GGCGAAGAAA
TTCGGTGCCG ATCTTGTCAT CAATCCTGCG AAAGAGGATC CCGTGGCCCG GATCAAGGAG
ATCACCGATG GTTATGGTTG CGATGTCTAT ATCGAAGCGA CCGGGCATCC GAAGTCCGTG
GAACAAGGCT TATCGATGAT CCGCAACCTT GGACGCTTCG TCGAGTTCAG TGTTTTCAAG
GACCCGGTAA CCGTCGACTG GAGCATCATC AGTGACCGCA AGGAGCTCGA TGTTCTCGGT
GCGCACCTCG GTCCCTATTG CTACCCGCTG GTAATCGAAG GTATTGCCGA CGGTTCCCTG
CCGACCGAGG GTGTCGTCAC CCATAACTTC CCCTTAGAGC GCTTCATGGA TGGTTTCAAA
CTCGCAATGA GCGGCAAGGA CTCCCTGAAA GTGATTTTGA CTCCTTGA
 
Protein sequence
MEKTIMTSVA KESISTRRLN NIPKTMKAVM AYAPGDYRLE EVAVPKAGPG EIIAKVEACG 
ICAGDIKSFG GAPSFWGDET QPAYIKAPMI PGHEFIAHIV ELGKGVEGYE LGDRVISEQI
VPCWNCRFCK RGHYWMCQKH DLYGFQNNVN GAMAEYIRFT KESINYKVPR DLPIEKAVLI
EPYACSMHAV QRAQIQFGDV VVLAGAGTLG LGMIGAAKKA GPGKLVVMDL FEDRLELAKK
FGADLVINPA KEDPVARIKE ITDGYGCDVY IEATGHPKSV EQGLSMIRNL GRFVEFSVFK
DPVTVDWSII SDRKELDVLG AHLGPYCYPL VIEGIADGSL PTEGVVTHNF PLERFMDGFK
LAMSGKDSLK VILTP