Gene Ava_4744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4744 
Symbol 
ID3679631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5953329 
End bp5954804 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content46% 
IMG OID637720100 
ProductShort-chain dehydrogenase/reductase SDR 
Protein accessionYP_325236 
Protein GI75910940 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAGTC AGAGTTGCCA TCAAATTAAT GTTGCCCAAA CTCCTTTATG GGGTATGGGT 
CAGGTCATAG CCCTAGAATA TCCCCACCTT TGGGGAGGAC TAATTGATTT TGGCAAGCAG
GATGTTTTTG CAACTGCCAT GATCGCAGAG ATGACCGCAA AAACTGGAGA AGACCGGGTG
GCATTTCGGG ATGGTAAACG GTATGTTGCC CGCTTGATGC CTATTTCCGC CCCATTACCC
ACACCCCAAC CCTTAATTAG TGATGGCAGT TATTTAATTA CAGGTGGTTT AGGGGCTTTA
GGACTCACTC TAGCAGAATG GTTGGTGCAA CAGGGCGCGC GTCATTTAGT TTTAACTAGT
CGTCAAGGGC TGTTAAATCA ATCTGAGGAG AAACAACAAA AAATCCGTGC CTTAGAAAAC
CAAGGGGCAA CTGTCAAAGT TGTAGCGGCT GATGTGAGTG ATTATCAGCA AATGTCTCAA
CTATTTGCGG AAATACAGTT AAATTCTCCC AAGTTGCGTG GAATTATTCA CGCAGCCGGG
GTATTAAATG ATTGTTCGAT TTCCCAAATG GATTGGGAAA CCTTTGCCAA GGTATTTCAG
CCCAAAGTTA CAGGTGCATG GAATCTCCAC CAACTCACCC AGGACTTATC TTTAGACTTC
TTTGTTTGTT TTTCTTCCAT GTCTGCACTA CTGGGTTCGC GGGGTCAACT TCATTATGCT
GCTGCCAACT CTTTCTTAGA TGGGCTAATT TACCACCGTC AGACCTTAGG TTTACCAGGC
TTAAGCATTA ATTGGGGGCC TTGGGCTGAG GGAGGTATGG CTACCCAAGG TTATGAGGTG
GGCTTAAAAC GGATGGGGAT TGAACCCTTA GAACCCACAG CCGCCCTGCA AGTCCTCGGT
GGTTTGTTGG GGAGTGCATC GGTGCAGACG ATGGTAGCAG CCATTAATTG GTCGGCTTTT
GGGAAGATTG TTGCTGCCAA AGGACGAGTA GCTTTTTTAG CAGCACTATT GACCCAAGAA
AGTCAAGATG GCAGCAATGG TGAGAGTTTT TGTCAAAAAT TAGCAGCCGC ACCCCTTCAT
AGACGAGCTG CTTTACTCAC CACCCAAGTA CAACAAGAAG TTGCCCAGGT ATTAGGTCAC
AGTGGGTCTT ATGTCCCGGA AATTGAGCAG GGCTTTTTCG ATATGGGGAT GGATTCTCTG
ATGTCTGTGC AATTGCGCCA TCGTCTCGAA GCTTTGCTTG CTGTATCTCT ACCCTCAACT
TTGGTATTTG AATGCCCCTC TATTGGTGAT GTGGTGAGTT ATTTAATGCG GGAAGTGTTT
GCTTGGCAAC TCGATGGTAG TGATGGGTCT GCGATGGAAT CACCAGCCAG CCTAGTTATA
GAAAATACGA TCGCCCAACT AGAGGGATTA TCCACAGCAG AAACTGAAGC CCTGATGGAA
CAAGAAATCG CCGAATTACA AGCATTGCTG TCTTAA
 
Protein sequence
MKSQSCHQIN VAQTPLWGMG QVIALEYPHL WGGLIDFGKQ DVFATAMIAE MTAKTGEDRV 
AFRDGKRYVA RLMPISAPLP TPQPLISDGS YLITGGLGAL GLTLAEWLVQ QGARHLVLTS
RQGLLNQSEE KQQKIRALEN QGATVKVVAA DVSDYQQMSQ LFAEIQLNSP KLRGIIHAAG
VLNDCSISQM DWETFAKVFQ PKVTGAWNLH QLTQDLSLDF FVCFSSMSAL LGSRGQLHYA
AANSFLDGLI YHRQTLGLPG LSINWGPWAE GGMATQGYEV GLKRMGIEPL EPTAALQVLG
GLLGSASVQT MVAAINWSAF GKIVAAKGRV AFLAALLTQE SQDGSNGESF CQKLAAAPLH
RRAALLTTQV QQEVAQVLGH SGSYVPEIEQ GFFDMGMDSL MSVQLRHRLE ALLAVSLPST
LVFECPSIGD VVSYLMREVF AWQLDGSDGS AMESPASLVI ENTIAQLEGL STAETEALME
QEIAELQALL S