Gene Ava_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3752 
Symbol 
ID3678995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4669141 
End bp4672077 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content46% 
IMG OID637719102 
ProductHAD family hydrolase 
Protein accessionYP_324252 
Protein GI75909956 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase
[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01990] beta-phosphoglucomutase
[TIGR02009] beta-phosphoglucomutase family hydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.625857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.249828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAA AATGTCCTTC TCGTGATTTT ACTTATCAAG ATTGGATATT AACCGAAACT 
AGGTTTAACC CTGAGCAATT GCATTCTAGA AGTACCGTTT TCACCATCGG TAACGGTTAC
TTAGGTACTA GAGGTAGCTT AGAGGAAGGT CACGCCCGTG GTTTACCTGC CACTTTTATT
CATGGTGTCT ATGATGATGT ACCTGTAGTT TATACGGAAC TTGCTAACTG TCCTGATTGG
TTGCCGATGA TAATTGCGAT TAATGGCGAT CGCTTCCGCA TGGATCAGGG AGAAATATTA
CAATATGAAC GTAAACTTGA TGTCAGTCAA GGTCTGCTCA GTCGTTCTCT GCGTTGGCGT
AGCCCCAGTG GTAGCATCAT TGATATCCAC TTTGAACGTT TCGCTAGTCT AGCAGATCAC
CATATATTGG GGCAGCGTTG CCAAATCACA GCTCATGATG GTGATTGCTT AGTTGAAATT
CAAGCTAGTA TCAACGGCTA CGCTGAAAAT CAAGGTTTCA ATCATTGGGA AGGAATAGAC
CAAGGTAAAA CCGAGCCGGG TATCTGGTTG CAAAGTCGTA CCCGTGGAAC CCAGATTGAA
ATCGGTATGG CAGCCAAGAT GACCATATCA GGAGTAGAGG CCGCGTTACA AGTGAGCATC
GTCCCTGGAT ATCCCACTAT CAGCGCCAGT TTTTTAGCTA AATCTCAACA AACTATCACT
GTGGAAAAGC TAGTGACAGT TTTTACCTCA AGGGAAGTCA ATAAACCAGT TACCGCCGCT
CAAGAAAAGC TGGCACAACT ACCAGACTAC ACAACTCTAC TCACGGCGAA TAAGCAAGCC
TGGGACGAAG TATGGCAAAA AAGCGATATA TACATAGAAG GAGATCCTAC AGCCGCTTTT
GCTGTCCGCT ACAACCTGTT TCAGTTGCTA ATTGCTGCCC CATACCATGA TGAAAAAGTC
AGTATTCCCG CTAAAACCCT TTCCGGCTTT GGTTATCGTG GACATATTTT CTGGGATACA
GAGATTTTTA TCTTGCCCTT TTTCACCTTC ACACAACCAG CTTTAGCCCG GAACTTACTC
AGCTACCGCC ACCACACCAT CAACGGGGCG CGACGCAAAG CCACCCATTA CGGCTTTAAA
GGGGCGATGT ATGCCTGGGA AAGTGGTGAT ACTGGGGATG AAGTTACGCC GCGTTGGGCA
TTGCCTGATA ATTATTATGG CGAAGATGTG CGTATTTGGT GCCGCGATCG CGAAATTCAT
AACAGTGCAG ATATTGCCTA TGCTGTCTGG CAATACTGGC AAGCTACTAG TGATGATGTT
TGGATGCGCG ATTATGGTGC AGAAATTATC TTAGATGCCG CTATTTTTTG GAGTAGCCGA
GTTGAGTATA ACTCCCAAGG CGATCGCTAT GAAATCCGTG GGGTGATTGG TACGGATGAA
TACCACGAGT TTGTACACAA CAACACCTTC ACGAACCGAA TGGTGCAATG GCATCTAGAA
AAGGCGCTGA AAGTTGCTGA CTGGTTGCGT CATACCTTCC CCGAAGGAGC CAAAGAACTA
GAAGAAAAAC TGCAACTTAC TCCAGAGTTA GAAACTCACT GGCAAGACAT CATCAAGAAA
ATTTGCATTT TTTACGACTC CTCAACCGGA TTAATCGAAC AATTTGAGGG ATTTTTCCAA
TTAAAAGATA TCAACTTGGA AGACTACGAA CCACGCCAGC GTTCTATGCA AGCCATCTTG
GGTATAGAAA CCACTAACCA ACACCAAGTC CTCAAGCAAC CAGATGTCTT GATGCTGCTG
TACCTGATGC GCCTATCAGC AGAGTTTCCC TACAACGAAA AAGCTTTAAA AAGTAATTGG
GATTATTACG CACCCCGGAC GGATATTACC TACGGTTCAT CACTAGGCCC GGCAATTCAT
GGCATATTAG CCTCAGATTT GGGGAAATCA GCCACCGCCT ACGAACGGTT TATGCAGGCA
CTTATGGTTG ATTTAGAAGA TAGCCGAGGT AACACCAATG ACGGTATTCA CGGAGCCAGT
GCTGGTGGCA TTTGGCAGGC AGTAATTTTC GGTTTTGGTG GAATCCAACT CACAGAACAA
GGCCCCATCG CCAACCCCCA TTTACCCCCA AATTGGACAC GTCTGAAGTT TCAACTACAT
TGGCGTGGGC AGTGGTATCC GTTTGATTTG CCTGGGGGGG TAGGGATTGG GGACTGGGGA
CTGGGGACTG GGGGAGTCAC GAGTACCCAA TCCCCGACCC CTCATACCCA TTCCCCAGAC
ATTCGAGGAT TTATTTTTGA TTTGGATGGT GTATTAACTG ATACGGCAGA ATATCATTAT
TTAGGGTGGC AGAGGCTGGC TGATGAGGAA GGGATTCCTT TTAATAGAAA GGCTAACGAA
GCCTTGCGGG GGGTGTCTCG TCGTGAGTCC TTGATGCGGA TTATTGGGGA TAGACCTTAT
TCAGAAGCAC AAATTCAAGA GATGATGGAG CGTAAAAATT GCTACTATGT AGAACTAATT
GAACACATTA CACCTAAAGA TTTATTACCA GGAGCGATCG CTCTTTTAGA TGAACTACGG
CAAGCGGGAA TTAAACTAGG TATCGGTTCA GCTAGTAAAA ATGCTCACAC AGTGATCGAA
AGGCTAGGGC TTGCTGATAA AGTAGATGCG ATCGCTGATG GTTACAGTGT CCAAAAACCC
AAGCCAGCAC CAGATTTATT TCTCTTCGCT GCCCATCAGT TAGGACTAGA ACCAAAACAA
TGTGTAGTCG TGGAAGATGC CGCCGCAGGT GTGGAAGCCG CTTTAGCTGG GGGAATGTGG
GCAGTAGGAC TTGGCCCCGT CGAGCGTGTA GGTGCAGCTC ATGTAGTCCT CCCCAGTCTT
GCAGGGGTGA CATGGACAGA CTTACGCACC AAATTAAATG AAGCTGCGGG GGTGTGA
 
Protein sequence
MNTKCPSRDF TYQDWILTET RFNPEQLHSR STVFTIGNGY LGTRGSLEEG HARGLPATFI 
HGVYDDVPVV YTELANCPDW LPMIIAINGD RFRMDQGEIL QYERKLDVSQ GLLSRSLRWR
SPSGSIIDIH FERFASLADH HILGQRCQIT AHDGDCLVEI QASINGYAEN QGFNHWEGID
QGKTEPGIWL QSRTRGTQIE IGMAAKMTIS GVEAALQVSI VPGYPTISAS FLAKSQQTIT
VEKLVTVFTS REVNKPVTAA QEKLAQLPDY TTLLTANKQA WDEVWQKSDI YIEGDPTAAF
AVRYNLFQLL IAAPYHDEKV SIPAKTLSGF GYRGHIFWDT EIFILPFFTF TQPALARNLL
SYRHHTINGA RRKATHYGFK GAMYAWESGD TGDEVTPRWA LPDNYYGEDV RIWCRDREIH
NSADIAYAVW QYWQATSDDV WMRDYGAEII LDAAIFWSSR VEYNSQGDRY EIRGVIGTDE
YHEFVHNNTF TNRMVQWHLE KALKVADWLR HTFPEGAKEL EEKLQLTPEL ETHWQDIIKK
ICIFYDSSTG LIEQFEGFFQ LKDINLEDYE PRQRSMQAIL GIETTNQHQV LKQPDVLMLL
YLMRLSAEFP YNEKALKSNW DYYAPRTDIT YGSSLGPAIH GILASDLGKS ATAYERFMQA
LMVDLEDSRG NTNDGIHGAS AGGIWQAVIF GFGGIQLTEQ GPIANPHLPP NWTRLKFQLH
WRGQWYPFDL PGGVGIGDWG LGTGGVTSTQ SPTPHTHSPD IRGFIFDLDG VLTDTAEYHY
LGWQRLADEE GIPFNRKANE ALRGVSRRES LMRIIGDRPY SEAQIQEMME RKNCYYVELI
EHITPKDLLP GAIALLDELR QAGIKLGIGS ASKNAHTVIE RLGLADKVDA IADGYSVQKP
KPAPDLFLFA AHQLGLEPKQ CVVVEDAAAG VEAALAGGMW AVGLGPVERV GAAHVVLPSL
AGVTWTDLRT KLNEAAGV