Gene Ava_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0798 
Symbol 
ID3680738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp979441 
End bp980568 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content47% 
IMG OID637716127 
Productsemialdehyde dehydrogenase, NAD - binding 
Protein accessionYP_321317 
Protein GI75907021 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.577459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAA TCGCTGTTAT CGGGGTAGGA CGCTGGGGAG TACATTTGTT GCGGAATTTT 
TTAGCACATC CGCAAGCGGA GGTCGTGGCA ATAGTTGACC CCCATCCAGA AAGGTTAACG
GTAGTCAAGC AGCAGTTTAA TTTGGCTGAA AGTGTCCTGT TAACCACCCA GTGGTCTGAC
TTACAAACAG TGCCAGAATT AACAGCAGTA GCGATCGCTA CTCCAGCTAC CACTCACTAC
GCTTTAATTA AAGATGCTCT GGCTCAAGGC TATCATGTTC TGGCAGAAAA ACCCCTAACC
TTAGACCCTA CAGAATGCCA AGAACTTTGC CAATTGGCAG AGCGACGGCA ATTAATACTC
ATGGTGGATC ATACCTATTT ATTTCACCCA GCCGTTGAGG AAGGTCAAAC TGTCATTCAG
GCTGGTAAAT TAGGTGAGTT ACGTTACGGC TATGCTACAC GCACCCATTT AGGCCCTGTC
CGTCAAGATG TTGATGCCTT ATGGGATTTA GCCATCCATG ATATCGCCAT CTTTAACAAC
TGGTTAGGTA AAGCACCTGT AAGTGTACAG GCGACGGGTA CAGTTTGGCT GCAAGGTGAG
GGGAAAGAGG CAGGGGGCAG GGGGCAGGGG GCAGGGGAGG CAGGGGGAGA ACTGACTGCA
AGATTTTCGC CCCAGTCCCC AATCCCCAAT CCTCAATCCC CAGTCCCCAG TCCTCAGTCC
CCAGAATTAG CCGATTTAGT TTGGGTAACG TTAACTTATC CAGATGGTTT TAAGGCGTAT
ATTCACCTGT GCTGGTTGAA TAATGATAAA CAGCGCCGTC TGGCGGTGGT AGGAAGCCTT
GGCACTTTAA TTTTTGATGA AATGTCACCA TCATCACAAT TGACTTTATT GCATGGTGAA
TTTGAACGTC AGGGAAATCT ATTTTTGCCT GTAAATCAAA GCCGAGAAGT ATTAGAACTC
AAAGCCGGCG AACCTTTACA ACGAGTTTGC GATCGCTTTA TTACTTCTGT TCTCCAGAAT
ACACCCCCAA GCATTTCTTC TGGTTGGGTA GGTACAGAGT TAGTCAAAAT TCTCTCTGCT
CTAACTACAT CTCTCCAACA AAGCGGCCAA TCTGTTTCTC TTCAATAA
 
Protein sequence
MTKIAVIGVG RWGVHLLRNF LAHPQAEVVA IVDPHPERLT VVKQQFNLAE SVLLTTQWSD 
LQTVPELTAV AIATPATTHY ALIKDALAQG YHVLAEKPLT LDPTECQELC QLAERRQLIL
MVDHTYLFHP AVEEGQTVIQ AGKLGELRYG YATRTHLGPV RQDVDALWDL AIHDIAIFNN
WLGKAPVSVQ ATGTVWLQGE GKEAGGRGQG AGEAGGELTA RFSPQSPIPN PQSPVPSPQS
PELADLVWVT LTYPDGFKAY IHLCWLNNDK QRRLAVVGSL GTLIFDEMSP SSQLTLLHGE
FERQGNLFLP VNQSREVLEL KAGEPLQRVC DRFITSVLQN TPPSISSGWV GTELVKILSA
LTTSLQQSGQ SVSLQ