Gene Ava_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1504 
Symbol 
ID3682470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1851606 
End bp1852355 
Gene Length750 bp 
Protein Length249 aa 
Translation table11 
GC content41% 
IMG OID637716844 
ProductHAD family hydrolase 
Protein accessionYP_322022 
Protein GI75907726 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000137172 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTTA ATTTTGTGAC TACTATTAAA TGTAGAGATA TCGCATTTAC TAATATTCAA 
GCGATTTTAT TTGATAAAAA CGGTACGCTA GAAAATTCAG AAGTCTATTT GCGATCGCTT
GGACAAAAAG CAGCGAGAAT TATAGATGCC CAAGTACCAG GAATTGGGGA ACCTTTATTA
ATGGCCTTTG GTATTAACGG CGATACACTC GACCCAGCCG GTTTAATGTC GGTAGCCAGT
CGCCGCGAAA CAGAAATAGC CACAGCCGCC TATATTGCCG AAACTGGGAA AGGATGGTTT
GAGTCATTAA AAATAGCGCG TCAAGCCTTA GATGATGCCG AAAAATATAT TGGTGTAACT
CCTGCACCTC TTTTCACTGG TGCATTAGAA GTATTGCAAT CTCTCTCACA AGCCGGACTA
AAACTTGGCA TTGTTTCAGC AGCAACAACA TCAGAAGTAA AAAACTTTGT CGCACAACAT
AACTTAAGTA GTTATATCCA AGCCCAAGTA GGTGTAGATA ACGGCCCCAG TAAGCCAGAC
CCCATATTAT TTTTACAAGC TTGCCAAGCT TTAGGAGTAG AACCAGAAGC CACATTAATG
GTAGGTGATG CTGTAGGTGA TATGCAAATG GCTCGTAACG CTCAAGCCGC AGGTTGTATT
GGTATTACTT GGGTGAATAA ACCAGATAAT GTCCAAGGTG CGGATGTAGT GATTAATCGG
CTAGATGAAA TTCAAATTTT AGAAAGCTAA
 
Protein sequence
MGLNFVTTIK CRDIAFTNIQ AILFDKNGTL ENSEVYLRSL GQKAARIIDA QVPGIGEPLL 
MAFGINGDTL DPAGLMSVAS RRETEIATAA YIAETGKGWF ESLKIARQAL DDAEKYIGVT
PAPLFTGALE VLQSLSQAGL KLGIVSAATT SEVKNFVAQH NLSSYIQAQV GVDNGPSKPD
PILFLQACQA LGVEPEATLM VGDAVGDMQM ARNAQAAGCI GITWVNKPDN VQGADVVINR
LDEIQILES