Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3752 |
Symbol | |
ID | 3678995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 4669141 |
End bp | 4672077 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637719102 |
Product | HAD family hydrolase |
Protein accession | YP_324252 |
Protein GI | 75909956 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01990] beta-phosphoglucomutase [TIGR02009] beta-phosphoglucomutase family hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.625857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.249828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAA AATGTCCTTC TCGTGATTTT ACTTATCAAG ATTGGATATT AACCGAAACT AGGTTTAACC CTGAGCAATT GCATTCTAGA AGTACCGTTT TCACCATCGG TAACGGTTAC TTAGGTACTA GAGGTAGCTT AGAGGAAGGT CACGCCCGTG GTTTACCTGC CACTTTTATT CATGGTGTCT ATGATGATGT ACCTGTAGTT TATACGGAAC TTGCTAACTG TCCTGATTGG TTGCCGATGA TAATTGCGAT TAATGGCGAT CGCTTCCGCA TGGATCAGGG AGAAATATTA CAATATGAAC GTAAACTTGA TGTCAGTCAA GGTCTGCTCA GTCGTTCTCT GCGTTGGCGT AGCCCCAGTG GTAGCATCAT TGATATCCAC TTTGAACGTT TCGCTAGTCT AGCAGATCAC CATATATTGG GGCAGCGTTG CCAAATCACA GCTCATGATG GTGATTGCTT AGTTGAAATT CAAGCTAGTA TCAACGGCTA CGCTGAAAAT CAAGGTTTCA ATCATTGGGA AGGAATAGAC CAAGGTAAAA CCGAGCCGGG TATCTGGTTG CAAAGTCGTA CCCGTGGAAC CCAGATTGAA ATCGGTATGG CAGCCAAGAT GACCATATCA GGAGTAGAGG CCGCGTTACA AGTGAGCATC GTCCCTGGAT ATCCCACTAT CAGCGCCAGT TTTTTAGCTA AATCTCAACA AACTATCACT GTGGAAAAGC TAGTGACAGT TTTTACCTCA AGGGAAGTCA ATAAACCAGT TACCGCCGCT CAAGAAAAGC TGGCACAACT ACCAGACTAC ACAACTCTAC TCACGGCGAA TAAGCAAGCC TGGGACGAAG TATGGCAAAA AAGCGATATA TACATAGAAG GAGATCCTAC AGCCGCTTTT GCTGTCCGCT ACAACCTGTT TCAGTTGCTA ATTGCTGCCC CATACCATGA TGAAAAAGTC AGTATTCCCG CTAAAACCCT TTCCGGCTTT GGTTATCGTG GACATATTTT CTGGGATACA GAGATTTTTA TCTTGCCCTT TTTCACCTTC ACACAACCAG CTTTAGCCCG GAACTTACTC AGCTACCGCC ACCACACCAT CAACGGGGCG CGACGCAAAG CCACCCATTA CGGCTTTAAA GGGGCGATGT ATGCCTGGGA AAGTGGTGAT ACTGGGGATG AAGTTACGCC GCGTTGGGCA TTGCCTGATA ATTATTATGG CGAAGATGTG CGTATTTGGT GCCGCGATCG CGAAATTCAT AACAGTGCAG ATATTGCCTA TGCTGTCTGG CAATACTGGC AAGCTACTAG TGATGATGTT TGGATGCGCG ATTATGGTGC AGAAATTATC TTAGATGCCG CTATTTTTTG GAGTAGCCGA GTTGAGTATA ACTCCCAAGG CGATCGCTAT GAAATCCGTG GGGTGATTGG TACGGATGAA TACCACGAGT TTGTACACAA CAACACCTTC ACGAACCGAA TGGTGCAATG GCATCTAGAA AAGGCGCTGA AAGTTGCTGA CTGGTTGCGT CATACCTTCC CCGAAGGAGC CAAAGAACTA GAAGAAAAAC TGCAACTTAC TCCAGAGTTA GAAACTCACT GGCAAGACAT CATCAAGAAA ATTTGCATTT TTTACGACTC CTCAACCGGA TTAATCGAAC AATTTGAGGG ATTTTTCCAA TTAAAAGATA TCAACTTGGA AGACTACGAA CCACGCCAGC GTTCTATGCA AGCCATCTTG GGTATAGAAA CCACTAACCA ACACCAAGTC CTCAAGCAAC CAGATGTCTT GATGCTGCTG TACCTGATGC GCCTATCAGC AGAGTTTCCC TACAACGAAA AAGCTTTAAA AAGTAATTGG GATTATTACG CACCCCGGAC GGATATTACC TACGGTTCAT CACTAGGCCC GGCAATTCAT GGCATATTAG CCTCAGATTT GGGGAAATCA GCCACCGCCT ACGAACGGTT TATGCAGGCA CTTATGGTTG ATTTAGAAGA TAGCCGAGGT AACACCAATG ACGGTATTCA CGGAGCCAGT GCTGGTGGCA TTTGGCAGGC AGTAATTTTC GGTTTTGGTG GAATCCAACT CACAGAACAA GGCCCCATCG CCAACCCCCA TTTACCCCCA AATTGGACAC GTCTGAAGTT TCAACTACAT TGGCGTGGGC AGTGGTATCC GTTTGATTTG CCTGGGGGGG TAGGGATTGG GGACTGGGGA CTGGGGACTG GGGGAGTCAC GAGTACCCAA TCCCCGACCC CTCATACCCA TTCCCCAGAC ATTCGAGGAT TTATTTTTGA TTTGGATGGT GTATTAACTG ATACGGCAGA ATATCATTAT TTAGGGTGGC AGAGGCTGGC TGATGAGGAA GGGATTCCTT TTAATAGAAA GGCTAACGAA GCCTTGCGGG GGGTGTCTCG TCGTGAGTCC TTGATGCGGA TTATTGGGGA TAGACCTTAT TCAGAAGCAC AAATTCAAGA GATGATGGAG CGTAAAAATT GCTACTATGT AGAACTAATT GAACACATTA CACCTAAAGA TTTATTACCA GGAGCGATCG CTCTTTTAGA TGAACTACGG CAAGCGGGAA TTAAACTAGG TATCGGTTCA GCTAGTAAAA ATGCTCACAC AGTGATCGAA AGGCTAGGGC TTGCTGATAA AGTAGATGCG ATCGCTGATG GTTACAGTGT CCAAAAACCC AAGCCAGCAC CAGATTTATT TCTCTTCGCT GCCCATCAGT TAGGACTAGA ACCAAAACAA TGTGTAGTCG TGGAAGATGC CGCCGCAGGT GTGGAAGCCG CTTTAGCTGG GGGAATGTGG GCAGTAGGAC TTGGCCCCGT CGAGCGTGTA GGTGCAGCTC ATGTAGTCCT CCCCAGTCTT GCAGGGGTGA CATGGACAGA CTTACGCACC AAATTAAATG AAGCTGCGGG GGTGTGA
|
Protein sequence | MNTKCPSRDF TYQDWILTET RFNPEQLHSR STVFTIGNGY LGTRGSLEEG HARGLPATFI HGVYDDVPVV YTELANCPDW LPMIIAINGD RFRMDQGEIL QYERKLDVSQ GLLSRSLRWR SPSGSIIDIH FERFASLADH HILGQRCQIT AHDGDCLVEI QASINGYAEN QGFNHWEGID QGKTEPGIWL QSRTRGTQIE IGMAAKMTIS GVEAALQVSI VPGYPTISAS FLAKSQQTIT VEKLVTVFTS REVNKPVTAA QEKLAQLPDY TTLLTANKQA WDEVWQKSDI YIEGDPTAAF AVRYNLFQLL IAAPYHDEKV SIPAKTLSGF GYRGHIFWDT EIFILPFFTF TQPALARNLL SYRHHTINGA RRKATHYGFK GAMYAWESGD TGDEVTPRWA LPDNYYGEDV RIWCRDREIH NSADIAYAVW QYWQATSDDV WMRDYGAEII LDAAIFWSSR VEYNSQGDRY EIRGVIGTDE YHEFVHNNTF TNRMVQWHLE KALKVADWLR HTFPEGAKEL EEKLQLTPEL ETHWQDIIKK ICIFYDSSTG LIEQFEGFFQ LKDINLEDYE PRQRSMQAIL GIETTNQHQV LKQPDVLMLL YLMRLSAEFP YNEKALKSNW DYYAPRTDIT YGSSLGPAIH GILASDLGKS ATAYERFMQA LMVDLEDSRG NTNDGIHGAS AGGIWQAVIF GFGGIQLTEQ GPIANPHLPP NWTRLKFQLH WRGQWYPFDL PGGVGIGDWG LGTGGVTSTQ SPTPHTHSPD IRGFIFDLDG VLTDTAEYHY LGWQRLADEE GIPFNRKANE ALRGVSRRES LMRIIGDRPY SEAQIQEMME RKNCYYVELI EHITPKDLLP GAIALLDELR QAGIKLGIGS ASKNAHTVIE RLGLADKVDA IADGYSVQKP KPAPDLFLFA AHQLGLEPKQ CVVVEDAAAG VEAALAGGMW AVGLGPVERV GAAHVVLPSL AGVTWTDLRT KLNEAAGV
|
| |