Gene Ava_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0042 
Symbol 
ID3683551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp47745 
End bp49103 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content38% 
IMG OID637715369 
Productradical SAM family protein 
Protein accessionYP_320563 
Protein GI75906267 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.467834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTT CACATTATCA TGTAGTAACA CAACCATTTT TCGATGAAAT TGAAGAACGA 
ACAAAGCGCG TTATCTTTTC TAGTCGAACA TCAAATGTCA GAATTATTGA TGAGCATAGT
TGGCACATTT TAGCTAGTGG TGATTTTGCT CAATTACCTC AATATATATT GTTTGATCTA
GTTGATGTTG AACTAATTGT ACCTGATGAT GAAAACGAAT TACAAACTAT TTTAGATTAC
AATAATGCCT TAGCAATTGA TAACGATGAT CTACATTTAG TTGTTCAACC AACTGCTTTT
TGTCAACTGG GTTGTCATTA CTGTGGTCAG GAACATACTA GCAAAATGAT GACCGAAGAT
GAGCAACAAA AATTTATAGA ACGAACTGCT AAAAAACTCG CCAGCAAGAA CTTTCGTAGT
CTCTCAATTG GTTGGTTTGG TGCAGAACCT TTGGTAGGTC TGCCAGTGAT GAGAACTCTT
ACGCCAAAAC TACAGGCTCT TGCGGCCAGT TTTGGTTGTA GTTATCATGC AAAAGTTGTC
ACCAATGGTT TAGCTTTAAC ACATCAAGTA GCAACGGAAA TTGTTCAAGA ATTAGGCGTA
AATTCTGTTG AAATTACTCT TGATGGCACT GGTGAATATC ATGATGTTCG GCGGATGCAG
AAAAATGGCT TACCTACATT TGAGAAAATT TTTGCTAATA CGGTTGCCTT AGCTCATCGG
CAAGATTTGG ATGTACAAAT TAATATTCGT TGTAATGTTG ATTATCAAAA TTATGAATCT
GTCTCTTTGT TACTACAAAA ATTAGCTGAG GCAGAGATAC AAGATAAGAT TAATTTCTAT
GTTGCACCGA TTCATTCTTG GGGAAATGAT GCTCATACTC GTTCCTTATC GAAAGAAGAA
TTTGCTGATT GGGAAATAAC TTGGCTTGGG GAAATGATTG AGTTAGGTTT CAAGGTTGGG
CTACTACCAG AGCGCAGACC TCTAGTTTGT ATGGCTGTAA TGCCCCATTC GGAATTAGTT
GATGCCTATG GCAATATTTT TAATTGTACA GAGGTGTCTT ATGTTCCTAC ATACGGCACA
CCTAATGAAT ATGCCATTGA TCATTTATCA GGTAAACAGA TGCCCGGTAA AAGGGAACGT
TTAGCTAGTT TCAATGATAA AGTGCGTCAA GGTGCATATC CCTGTTCTAC TTGCCCCATG
CTACCTGTTT GCGGTGGTTC CTGTCCGAAG AGTTGGTTAG AAGGTATTGA ACCATGCCCC
AGTGCTAAAC ATAACATTGA GCAACGTTTA TTACTTACCT ATGCGTTATC TCGGATTGAA
GAAGCAGAAA CCAACGAGGA GGCTTTAGTT TATGCTTAA
 
Protein sequence
MKLSHYHVVT QPFFDEIEER TKRVIFSSRT SNVRIIDEHS WHILASGDFA QLPQYILFDL 
VDVELIVPDD ENELQTILDY NNALAIDNDD LHLVVQPTAF CQLGCHYCGQ EHTSKMMTED
EQQKFIERTA KKLASKNFRS LSIGWFGAEP LVGLPVMRTL TPKLQALAAS FGCSYHAKVV
TNGLALTHQV ATEIVQELGV NSVEITLDGT GEYHDVRRMQ KNGLPTFEKI FANTVALAHR
QDLDVQINIR CNVDYQNYES VSLLLQKLAE AEIQDKINFY VAPIHSWGND AHTRSLSKEE
FADWEITWLG EMIELGFKVG LLPERRPLVC MAVMPHSELV DAYGNIFNCT EVSYVPTYGT
PNEYAIDHLS GKQMPGKRER LASFNDKVRQ GAYPCSTCPM LPVCGGSCPK SWLEGIEPCP
SAKHNIEQRL LLTYALSRIE EAETNEEALV YA