Gene Ava_5010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5010 
Symbol 
ID3679023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6297095 
End bp6298132 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content40% 
IMG OID637720370 
Productaldo/keto reductase 
Protein accessionYP_325502 
Protein GI75911206 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.250615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.714816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATA ACCAATTAGG CAATAGCGAT CTACAAGTTT CTGATATTTG CCTTGGCACA 
ATGACCTATG GGCAGCAAAA TACTATAGAA GAAGCCCATC AACAGCTAGA TTATGCAATT
GCTCAAGGAG TTAATTTCAT CGATGCGGCT GAGATGTATC CAGTACCGAC CAGCGCTGAA
ACATACGGAT TAACTGAGAC TTATATTGGA GAATGGTTAA AAAATCAACA GCGAGAGCAA
CTAATAATTG CTACTAAAAT CGCGGGGCCT GGTCGTGGCT TTAAATGGGT ACGTGATGGA
GCAAAAGCCA TTGACCGTAA TAATATCAAA CAAGCAGTGG ATGATAGTCT GCAAAGATTG
CAGACAGATT ATATTGATTT ATATCAAATT CATTGGCCTG ATCGTTATGT ACCCCGTTTT
GGACAAACAG TTTTCGATCC CACTCAGGTA GGGGAAACAA TTCCCATCAC TGAACAGCTG
GAAGTTTTTG CTGATGTCAT CAATGCCGGA AAGATTCGCT ATATTGGCTT AAGTAATGAA
ACTCCTTGGG GTGTAGCACA ATTTAGTCAT GCGGCTAAAC AATTGGGATT ACCTAAAGTT
GTCTCCATTC AGAATGCTTA TAACTTGCTC AATCGAAATT TTGATGGCGC ACTTGCAGAA
ACAGTTTATT ACGAAAATAT TCCTTTACTA GCTTATAGTC CTTTGGGATT CGGCTATTTA
ACTGGTAAGT ATCTTAACGG TAAACCAGAG AAAGCAAGAG TTACTTTATT TGAAAACTTT
GGTCAGAGAT ATTTAAAACC AAATGTTAGC AAAGCAGTAG CAGCTTATGT AGATATTGCC
AAACGCCATC AACTGAGTCC TGCACAACTA GCGATCGCAT TCGTGCGGAG TCGTTGGTTT
GTTGCTAGTA CGATTATTGG TGCGACTACA CTAGAACAAC TCAAAGAGAA TATAGAAAGC
ATCAATGTAG TTCTTGATAA AGACATCTTG GCGCAATTGG ATGCAGTTCA CACTCAATAT
CCAAATCCAG CACCATAA
 
Protein sequence
MQYNQLGNSD LQVSDICLGT MTYGQQNTIE EAHQQLDYAI AQGVNFIDAA EMYPVPTSAE 
TYGLTETYIG EWLKNQQREQ LIIATKIAGP GRGFKWVRDG AKAIDRNNIK QAVDDSLQRL
QTDYIDLYQI HWPDRYVPRF GQTVFDPTQV GETIPITEQL EVFADVINAG KIRYIGLSNE
TPWGVAQFSH AAKQLGLPKV VSIQNAYNLL NRNFDGALAE TVYYENIPLL AYSPLGFGYL
TGKYLNGKPE KARVTLFENF GQRYLKPNVS KAVAAYVDIA KRHQLSPAQL AIAFVRSRWF
VASTIIGATT LEQLKENIES INVVLDKDIL AQLDAVHTQY PNPAP