Gene Ava_4196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4196 
Symbol 
ID3680999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5255246 
End bp5256655 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content37% 
IMG OID637719543 
Productperoxidase 
Protein accessionYP_324690 
Protein GI75910394 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.738897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00759418 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACTGA CTGAAAAAGA TTTGAAACAC CTACCAGAAG ATGGCATTGA TTCAGAAAAT 
CCTGGTAAAT ACCGAAATCT ATTAAATGAT TTACAAGGCA ACATTCTCAA AGGACATGGA
CGAGATCATA GTGTTCATCT ATTTTTACAA TTCAAGCCTG AGCAAGTAGA AGTAGTTAAG
CAGTGGATTC AGAATTTTGC CCAAACTTAT ATAACTTCTG CCAAAAAGCA GTCAGACGAA
GCATTTAAAT ATAGACAAAA AGGCATACCA GGACAGGTAT TTGGTAACTT TTTCTTGTCG
CGTCATGGAT ATGAATATTT AGAAATTGAG CCGTTTCAAA TACCCGGAGA TAAACCATTT
AGGATGGGTA TGAAAAACGA AGAAATTAGA ACTTCTTTGG GCGATCCTAA AATTGAAACC
TGGGATATAG GATTTCAAAA CGAAATTCAT GCCTTAATTT TGCTCGCAGA TGATGACATC
ATAGACTTAT TGCAAATTGT CAATCAAATG ACGCAAGAAC TGCGTCTAAT AGCAGAAATT
GTTCACCGAG AAGATGGATT TATCCTGAGA AATCAGTCCG GACAAATTAT CGAACACTTT
GGCTTTGTGG ATGGTGTAAG TCAACCGTTG TTTATGAAAC GGGACGTTGT GAAGGAGAGG
GTAAACAACT GCGATTTTGA TAAATGGGAC CCAAAAGCTC CTCTTGATAG TATTTTAGTC
GAAGATCCTA ACGGGAATAC AAAAGATAGC TATGGCAGTT ATTTAGTCTA CCGAAAACTC
GAACAGAATG TGAAAGCATT CCGTGAAGAT CAGCGCAAAT TAGCTCAAAA ATTAAACATC
CAAGAAAATT TAGCTGGAGC TTTAATTGTA GGTCGTTTCC CTGATGGCAC TCCAGTAACT
CTTTCAGATA TACCGACTTA TGCAGTTACA CCCACAAATA ACTTCAATTA TGATAATGAT
TTAGCCGCAA CTAAGTGTCC ATTTCACTCT CATACACGTA AAACTAATCC TCGTGGAGAT
ACAGCCAGAT TGTTAACTGC TGATGCTCAC TTTGATGAAG CATTTAAGGA AGAAAAAGGC
CATAGGATTA CTCGTCGTGC AGTTAGTTAT GGCGAAAATA ATCCTAATAA AGAACCAGTT
TTAGGTTCAG GATTACTGTT TCTTTGTTTT CAATCCAACA TTGAAAATCA GTTCAATTTT
ATCCAATCAC GATGGGCTAA TCCTCAAAAT TTTGTTCAGG TGAATACTGG GCCAGATCCG
TTAATTGGTC AACCATCGGG AACTCAGAAA TGGCCAAAGA AATGGGGTGA ACCAGAAACA
GAAGAATATA ATTTTAAACT CTGGATAAAT ATGAAAGGTG GCGAGTATTT TTTCGCTCCT
AGTATCAGTT TTCTCAAAAC CTTGGCATAG
 
Protein sequence
MALTEKDLKH LPEDGIDSEN PGKYRNLLND LQGNILKGHG RDHSVHLFLQ FKPEQVEVVK 
QWIQNFAQTY ITSAKKQSDE AFKYRQKGIP GQVFGNFFLS RHGYEYLEIE PFQIPGDKPF
RMGMKNEEIR TSLGDPKIET WDIGFQNEIH ALILLADDDI IDLLQIVNQM TQELRLIAEI
VHREDGFILR NQSGQIIEHF GFVDGVSQPL FMKRDVVKER VNNCDFDKWD PKAPLDSILV
EDPNGNTKDS YGSYLVYRKL EQNVKAFRED QRKLAQKLNI QENLAGALIV GRFPDGTPVT
LSDIPTYAVT PTNNFNYDND LAATKCPFHS HTRKTNPRGD TARLLTADAH FDEAFKEEKG
HRITRRAVSY GENNPNKEPV LGSGLLFLCF QSNIENQFNF IQSRWANPQN FVQVNTGPDP
LIGQPSGTQK WPKKWGEPET EEYNFKLWIN MKGGEYFFAP SISFLKTLA