Gene Ava_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4500 
Symbol 
ID3680201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5636180 
End bp5637358 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content45% 
IMG OID637719856 
Productzinc-containing alcohol dehydrogenase superfamily protein 
Protein accessionYP_324993 
Protein GI75910697 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.658078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.320847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAG TTTGCTGGCA CGGGACAAAT GATGTCAGGG TAGAAACTGT ACCCGATCCA 
AAAATTCTTA ACCCGCGCGA CGCAATTATT AAAATTACAT CTACCGCTAT TTGTGGGTCT
GATTTACATA TATATAATGG CTATATTCCC ACAATGCAAA GTGGCGATAT CCTTGGTCAT
GAATTTATGG GGGAAGTTGT CGAGTTAGGT AGTGCTGTAA AAAATGTGAA AGTAGGCGAT
CGCGTGGTTG TCCCTTTCAC TATTTCCTGC GGTTCTTGCT TCTTCTGTCA ACGGGATTTA
TGGTCTTTGT GCGATAACTC CAACCCCAAC GCCTGGATGG TAGAACTGCA AATGGGGCAT
TCTCCAGCAG GTTTATTCGG CTACTCTCAT CTATTTGGCG GCTATGCTGG TGGTCAAGCA
GAATACGCCC GTGTACCTTT TGCAGATGTC GGTTTACTCA AAATCCCCGA TAATCTACCA
GATGAACAAG TATTATTTTT AACTGACATT TTTCCTACCG GCTATATGGC AGCGGAAAAC
TGCAACATCA AACCAGGCGA TATTGTGGCT GTGTGGGGTT GTGGCCCCGT CGGGCAATTT
GCTATCAAGA GTGCATATAT GTTGGGTGCG GAAAGAGTTA TCGCCTTTGA CCGCATCCCT
GAACGCCTAC AAATGGCTAA AGAACAATGT AATGCGGAAG TCCTCAATTA CGAAGAGGTA
AACATTGGGG AAGCACTGAA AGAAATGACT GGTGGACGCG GCCCTGATGC TTGTATAGAT
GCGGTGGGAA TGGAAGCCCA CGGTACAGAT TTGATGGCTT TCTACGACCA AGTAAAGCAA
GCTGTAAGGC TAGAAACAGA CAGACCAACA GCATTACGAC AAGTCATTGT GTCTGCGGCT
AAAGGCGGTC ATGTTTCCCT GGCTGGTGTA TATGGCGGCT TTCTAGACAA AATCCCGATG
GGTTCAGCAA TGAATAAGGG CTTAACTTTC AAGATGGGAC AAACTCATGT GCATAAATAC
TTGAGGCCTT TACTAGAACG CATTCAAAAC GGTGAAATTG ACACCTCATT TGTCATCACC
CACACCCTCC CCCTAGAACA AGCACCCCAC GGTTACGAAA TTTTTAAGCA CAAAAAAGAT
AACTGCATCA AAGTTGTACT CAAACCCTCA GGTAATTAA
 
Protein sequence
MKAVCWHGTN DVRVETVPDP KILNPRDAII KITSTAICGS DLHIYNGYIP TMQSGDILGH 
EFMGEVVELG SAVKNVKVGD RVVVPFTISC GSCFFCQRDL WSLCDNSNPN AWMVELQMGH
SPAGLFGYSH LFGGYAGGQA EYARVPFADV GLLKIPDNLP DEQVLFLTDI FPTGYMAAEN
CNIKPGDIVA VWGCGPVGQF AIKSAYMLGA ERVIAFDRIP ERLQMAKEQC NAEVLNYEEV
NIGEALKEMT GGRGPDACID AVGMEAHGTD LMAFYDQVKQ AVRLETDRPT ALRQVIVSAA
KGGHVSLAGV YGGFLDKIPM GSAMNKGLTF KMGQTHVHKY LRPLLERIQN GEIDTSFVIT
HTLPLEQAPH GYEIFKHKKD NCIKVVLKPS GN