Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0111 |
Symbol | |
ID | 3683386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 147337 |
End bp | 149697 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637715438 |
Product | sulfatase |
Protein accession | YP_320632 |
Protein GI | 75906336 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00321161 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGTTTA AATCCAAGAA TGGGAATTTG TTTTATAGAA TAGCAAAGGC GATCGCTTTA TCTTTGCTCA TCACCTTATT GATGGTCAAT GGCCCCGCCC TAGCAGCTAC ATCTGAGGTA TTACCCCTGC CCTTACCAGC GTTTAAGGGG AAAATTGGCC TCACCTATAA AGAATCACAA CCAGACTTTC CCCAACCCAT CACCGCCCCA GCCAACGCCC CCAACGTCTT GCTAGTCATA TTAGATGATG TGGGCTTTGG ACAAGCCAGT ACCTTTGGCG GCCCTGTGGA TACTCCCAAC TTAACGCACC TAGCCGAAAC AGGATTACGC TACAACCAAT TTCACACCAC TGCCCTGTGT TCGCCTACCA GGGCAGCTTT ATTAACTGGA CGCAATCATC ATTCAGTGAA TACAGGGGTA GTTGAGGAAT TAGCCACAGG TTATCCTGGC TACACAACAG TTTTGCCTAA AAGTGCTGCC ACTGTTGCCG AAATCCTGCG ACAAAATGGT TATAACACAG CCGCTTTTGG TAAATGGCAT AATACACCAG ACTTTGAAAC CAGTGCTGTC GGGCCTTTTG ATCGCTGGCC TACAGGGTTA GGATTTGAGT ATTTTTACGG CTTCCTTGGT GGTGATACTA ATCAGTGGAG TCCGGCTTTA GTGGAAAACA CTAAGCGTGT AGATAAACCC AACAAGCCAG ATTATCACCT AACACCAGAC TTAGTAGACC ATGCGATCGC CTGGATTCGC AACCAACAGT CCATCGCTCC AGAAAAACCC TTTTTCGCCT ATCTCGCCAC CGGCGCTACC CACGCACCCC ACCACGCCCC CAAAGAGTGG ATTGACAAAT ATCAAGGCAA ATTTGACCAA GGCTGGGATA AATTACGCGA AGAAACCTTT GCCCGACAAA AGCAACTAGG TGTAATTCCA GCCAATGCCC AACTCACTCC CCGCCCCCCA GAACTCCCAG CTTGGGATTC CCTCTCAGCA GAACAGCAAA AACTCTATGC CCACATGGCT GAAGTATTTG CGGGATTTTT AGCCCATACA GATTATGAAG TCGGCAGGTT AATTAATGCC GTTGACCAAC TCGGTGAACT GGATAACACC TTAGTCATAT ATGTGGTGGG AGATAACGGT GCTAGTGCCG AAGGCGGTTT AACAGGTAGC GTCAACGAAC TGCAAGTTTT CAACGCTGTA CCCGAAAATC TCCAACAACT GCTAGCTGCT TATGATGATT TAGGTAGCCC CAAAACATTC AACCATTTTC CGGCTGCTTG GGCATGGGCA GTTAACACTC CCTTCCAATG GACAAAGCAA ATAGCCTCTC ATTTTGGTGG TACTCGCAAC CCCTTAGTAA TCTCCTGGGG CGCAAATATT AAAGACCAAG GTGGCATTCG CAGCCAATTC CATCATGTAA TTGATATTAC ACCCACAATT TTAGAAGTAG CGGGAATTAC TGTACCGAAA GAAGTGAATG GTGTGAAGCA ACGACCAATA GAAGGTACTA GCCTCGCATA TACGTTTAAT AATCCTAATG CGCCTTCTCA TCGGGAAACT CAGTATTTTG AGATGCTGGG TAACCGAGCC ATTTACGATG AAGGTTGGGT AGCTGCGGCG CGTCATGGTC GCTTACCTTG GGAACGAACT GTTAAAGGTA GCTTTGATAC AGATGAGTGG GAACTGTACA ACATTGCCGA AGATTTCAGC GAGGCGAATA ATCTAGCTAA GAAAAACCCC CAGAAGCTAG AAAAACTGCA AAAGTTGTTT TTAAAAGAAG CCAAGAAGCA TAACGTCTTA CCCCTAGACG ATCGCATTGC GGAAAGATTT GATGTTAAAA TTCGTCCCAG CCTCACCAGA GGACGCACAA CATTCACCTA CTATCCAGGT ACAGTCGGCA TTCCCGAAGG TAGCGCACCA AATTTGAAAA ATCGCTCCTT TACTATTACA GCTAATGTAG AAGTTCCAGA AAAAGGGGCA GAAGGTGTAA TTTTAACCCA AGGCGGTCGT TTTGCAGGTT GGAGTTTTTT CCTAGAGGAT AACAAGCCTA CTTATGTTTA CAACTATGCC AATACTGCCC GCTATACCAT TCAATCACCA GAAAAATTAC CCTCCGGTAA ATCTACAATC CGGTTCAACT TTGATTATGA CGGTGGTGTA GGTGCAGGTG GCATCGGCAA ATTATTCATT AACGATCAAC AGGTAGCTGA AGGTCGAGTA GATAAAACCA TCGCTTACCG TTTAGCCCTA GACGAAACCT TTGATGTAGG CAGAGATACA GGTACTCCTG TAGTTGACAC TTACCAAGTA CCCTTTAATT TCACGGGTAA CTTACAACAA GTCAGCCTGG ATTTGAAGTA A
|
Protein sequence | MKFKSKNGNL FYRIAKAIAL SLLITLLMVN GPALAATSEV LPLPLPAFKG KIGLTYKESQ PDFPQPITAP ANAPNVLLVI LDDVGFGQAS TFGGPVDTPN LTHLAETGLR YNQFHTTALC SPTRAALLTG RNHHSVNTGV VEELATGYPG YTTVLPKSAA TVAEILRQNG YNTAAFGKWH NTPDFETSAV GPFDRWPTGL GFEYFYGFLG GDTNQWSPAL VENTKRVDKP NKPDYHLTPD LVDHAIAWIR NQQSIAPEKP FFAYLATGAT HAPHHAPKEW IDKYQGKFDQ GWDKLREETF ARQKQLGVIP ANAQLTPRPP ELPAWDSLSA EQQKLYAHMA EVFAGFLAHT DYEVGRLINA VDQLGELDNT LVIYVVGDNG ASAEGGLTGS VNELQVFNAV PENLQQLLAA YDDLGSPKTF NHFPAAWAWA VNTPFQWTKQ IASHFGGTRN PLVISWGANI KDQGGIRSQF HHVIDITPTI LEVAGITVPK EVNGVKQRPI EGTSLAYTFN NPNAPSHRET QYFEMLGNRA IYDEGWVAAA RHGRLPWERT VKGSFDTDEW ELYNIAEDFS EANNLAKKNP QKLEKLQKLF LKEAKKHNVL PLDDRIAERF DVKIRPSLTR GRTTFTYYPG TVGIPEGSAP NLKNRSFTIT ANVEVPEKGA EGVILTQGGR FAGWSFFLED NKPTYVYNYA NTARYTIQSP EKLPSGKSTI RFNFDYDGGV GAGGIGKLFI NDQQVAEGRV DKTIAYRLAL DETFDVGRDT GTPVVDTYQV PFNFTGNLQQ VSLDLK
|
| |