Gene Ava_0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0111 
Symbol 
ID3683386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp147337 
End bp149697 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content46% 
IMG OID637715438 
Productsulfatase 
Protein accessionYP_320632 
Protein GI75906336 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00321161 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGTTTA AATCCAAGAA TGGGAATTTG TTTTATAGAA TAGCAAAGGC GATCGCTTTA 
TCTTTGCTCA TCACCTTATT GATGGTCAAT GGCCCCGCCC TAGCAGCTAC ATCTGAGGTA
TTACCCCTGC CCTTACCAGC GTTTAAGGGG AAAATTGGCC TCACCTATAA AGAATCACAA
CCAGACTTTC CCCAACCCAT CACCGCCCCA GCCAACGCCC CCAACGTCTT GCTAGTCATA
TTAGATGATG TGGGCTTTGG ACAAGCCAGT ACCTTTGGCG GCCCTGTGGA TACTCCCAAC
TTAACGCACC TAGCCGAAAC AGGATTACGC TACAACCAAT TTCACACCAC TGCCCTGTGT
TCGCCTACCA GGGCAGCTTT ATTAACTGGA CGCAATCATC ATTCAGTGAA TACAGGGGTA
GTTGAGGAAT TAGCCACAGG TTATCCTGGC TACACAACAG TTTTGCCTAA AAGTGCTGCC
ACTGTTGCCG AAATCCTGCG ACAAAATGGT TATAACACAG CCGCTTTTGG TAAATGGCAT
AATACACCAG ACTTTGAAAC CAGTGCTGTC GGGCCTTTTG ATCGCTGGCC TACAGGGTTA
GGATTTGAGT ATTTTTACGG CTTCCTTGGT GGTGATACTA ATCAGTGGAG TCCGGCTTTA
GTGGAAAACA CTAAGCGTGT AGATAAACCC AACAAGCCAG ATTATCACCT AACACCAGAC
TTAGTAGACC ATGCGATCGC CTGGATTCGC AACCAACAGT CCATCGCTCC AGAAAAACCC
TTTTTCGCCT ATCTCGCCAC CGGCGCTACC CACGCACCCC ACCACGCCCC CAAAGAGTGG
ATTGACAAAT ATCAAGGCAA ATTTGACCAA GGCTGGGATA AATTACGCGA AGAAACCTTT
GCCCGACAAA AGCAACTAGG TGTAATTCCA GCCAATGCCC AACTCACTCC CCGCCCCCCA
GAACTCCCAG CTTGGGATTC CCTCTCAGCA GAACAGCAAA AACTCTATGC CCACATGGCT
GAAGTATTTG CGGGATTTTT AGCCCATACA GATTATGAAG TCGGCAGGTT AATTAATGCC
GTTGACCAAC TCGGTGAACT GGATAACACC TTAGTCATAT ATGTGGTGGG AGATAACGGT
GCTAGTGCCG AAGGCGGTTT AACAGGTAGC GTCAACGAAC TGCAAGTTTT CAACGCTGTA
CCCGAAAATC TCCAACAACT GCTAGCTGCT TATGATGATT TAGGTAGCCC CAAAACATTC
AACCATTTTC CGGCTGCTTG GGCATGGGCA GTTAACACTC CCTTCCAATG GACAAAGCAA
ATAGCCTCTC ATTTTGGTGG TACTCGCAAC CCCTTAGTAA TCTCCTGGGG CGCAAATATT
AAAGACCAAG GTGGCATTCG CAGCCAATTC CATCATGTAA TTGATATTAC ACCCACAATT
TTAGAAGTAG CGGGAATTAC TGTACCGAAA GAAGTGAATG GTGTGAAGCA ACGACCAATA
GAAGGTACTA GCCTCGCATA TACGTTTAAT AATCCTAATG CGCCTTCTCA TCGGGAAACT
CAGTATTTTG AGATGCTGGG TAACCGAGCC ATTTACGATG AAGGTTGGGT AGCTGCGGCG
CGTCATGGTC GCTTACCTTG GGAACGAACT GTTAAAGGTA GCTTTGATAC AGATGAGTGG
GAACTGTACA ACATTGCCGA AGATTTCAGC GAGGCGAATA ATCTAGCTAA GAAAAACCCC
CAGAAGCTAG AAAAACTGCA AAAGTTGTTT TTAAAAGAAG CCAAGAAGCA TAACGTCTTA
CCCCTAGACG ATCGCATTGC GGAAAGATTT GATGTTAAAA TTCGTCCCAG CCTCACCAGA
GGACGCACAA CATTCACCTA CTATCCAGGT ACAGTCGGCA TTCCCGAAGG TAGCGCACCA
AATTTGAAAA ATCGCTCCTT TACTATTACA GCTAATGTAG AAGTTCCAGA AAAAGGGGCA
GAAGGTGTAA TTTTAACCCA AGGCGGTCGT TTTGCAGGTT GGAGTTTTTT CCTAGAGGAT
AACAAGCCTA CTTATGTTTA CAACTATGCC AATACTGCCC GCTATACCAT TCAATCACCA
GAAAAATTAC CCTCCGGTAA ATCTACAATC CGGTTCAACT TTGATTATGA CGGTGGTGTA
GGTGCAGGTG GCATCGGCAA ATTATTCATT AACGATCAAC AGGTAGCTGA AGGTCGAGTA
GATAAAACCA TCGCTTACCG TTTAGCCCTA GACGAAACCT TTGATGTAGG CAGAGATACA
GGTACTCCTG TAGTTGACAC TTACCAAGTA CCCTTTAATT TCACGGGTAA CTTACAACAA
GTCAGCCTGG ATTTGAAGTA A
 
Protein sequence
MKFKSKNGNL FYRIAKAIAL SLLITLLMVN GPALAATSEV LPLPLPAFKG KIGLTYKESQ 
PDFPQPITAP ANAPNVLLVI LDDVGFGQAS TFGGPVDTPN LTHLAETGLR YNQFHTTALC
SPTRAALLTG RNHHSVNTGV VEELATGYPG YTTVLPKSAA TVAEILRQNG YNTAAFGKWH
NTPDFETSAV GPFDRWPTGL GFEYFYGFLG GDTNQWSPAL VENTKRVDKP NKPDYHLTPD
LVDHAIAWIR NQQSIAPEKP FFAYLATGAT HAPHHAPKEW IDKYQGKFDQ GWDKLREETF
ARQKQLGVIP ANAQLTPRPP ELPAWDSLSA EQQKLYAHMA EVFAGFLAHT DYEVGRLINA
VDQLGELDNT LVIYVVGDNG ASAEGGLTGS VNELQVFNAV PENLQQLLAA YDDLGSPKTF
NHFPAAWAWA VNTPFQWTKQ IASHFGGTRN PLVISWGANI KDQGGIRSQF HHVIDITPTI
LEVAGITVPK EVNGVKQRPI EGTSLAYTFN NPNAPSHRET QYFEMLGNRA IYDEGWVAAA
RHGRLPWERT VKGSFDTDEW ELYNIAEDFS EANNLAKKNP QKLEKLQKLF LKEAKKHNVL
PLDDRIAERF DVKIRPSLTR GRTTFTYYPG TVGIPEGSAP NLKNRSFTIT ANVEVPEKGA
EGVILTQGGR FAGWSFFLED NKPTYVYNYA NTARYTIQSP EKLPSGKSTI RFNFDYDGGV
GAGGIGKLFI NDQQVAEGRV DKTIAYRLAL DETFDVGRDT GTPVVDTYQV PFNFTGNLQQ
VSLDLK