Gene Ava_4559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4559 
Symbol 
ID3680124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5713371 
End bp5716094 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content46% 
IMG OID637719915 
Productglycoside hydrolase family protein 
Protein accessionYP_325052 
Protein GI75910756 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCTG CTGAAATGCC GGCTAACCCT GGCTCAATAT CAACATCCCA TATAACTAGC 
AAAGATACGC ACCACAGCGA TCCCCGGAAT CAAGCTGTAG GCGTATATGT GACGGTGCAT
GGTCACTTTT ACCAGCCACC GCGCGAAAAC CCTTATCTAG ACTCCATTGA ACGTCAACCC
AGCGCTGCGC CTTTCCACGA TTGGAATGAG CGCATCCACT GGGAATGTTA CCGTCCCAAT
GCTTTTGCTA GGGTGTTGAA TGACCAAGGC GAAGTGACGG GGATCGTCAA TAATTACGAA
TATATGAGCT TTAACATCGG CCCGACGCTG ATGTCATGGC TGGAACGGTA TGATGTCGAG
GTTTATCAAC GGATTTTGGA GGCGGACGCA AAAAGCTGCC AACGTCTGCA AGGTCATGGC
AATGCGATCG CGCAAGTATA TAATCACATC ATCATGCCCT TGGCGAACGA ACGGGATAAA
CGCACCCAAA TTCGCTGGGG TAAAGAAGAC TTCCGCACCC GCTTTGGCCG TGATCCCGAA
GGAATGTGGT TGGCAGAAAC AGCCGTAGAT TACGCAACCT TAGAAGCTTT AGTGGCTGAA
GGTATTCGCT TTATTGTCTT AGCACCATCC CAAGCGCAAA GGTGTCGTCC TTTCCCCACA
GCCAATGATC CCCAACCAGA ATGGCATGAA GTAGGCGGGA GTCAAATTGA TCCCACCCGT
CCTTATCGTT GCTATCTGAA GTCTAGACTT AGCAATAAAT CTGCTGTTTT GAGTTCAGTA
AAAGATAGGG CAATTGCGGA CACCAATCCT CAATCTCTGC AAGATTTACC TTATATAGAT
ATCTTCTTCT ACGATGGCCC AATTTCACGG GATATGGGTT TTAGTGATGT TGTCTATAAC
TCCCACCACT TTGCCGGACG GATTGGTTCC GCAGTCCGTG GGGATCATCG GCAATCACAA
CTGATTTCCG TCGGTACTGA TGGGGAAACC TTCGGACACC ACAAAAAAGG CACAGAAAAA
ACTCTCGCCT ATGCCTTTAT CAAGGAGTTT CCTAGTCAAG GTTGGACTGT AACCAACTTT
GCCCATTACC TCAGCTTATG TTCTCCTACC TGGGAAGTAG AAATCAAGCC TGTGACAGCT
TGGAGTTGCG CCCACGGTGT AGATAGATGG CAAGAAGATT GTGGTTGCGG TGGTGAAGGC
GGTATTTGGC ATCAAAAATG GCGGCGGCCG TTGCGGAATG CTTTAAATTG GCTGCGGGAT
CAATTAGTTG AAGTCTTTGA GGAATATGGT AATCAATTAT TCCGTGATCC CTGGCAAGCA
CGAGACGAGT ATATTCAAGT CATCCGCGAT CGCTCTCCTG CCAATATTAG CCGTTTTCTC
TCCCGTCATC AAACCCACAA ACTCACCGCC GCCGAACAAG TAGACGCTTT GCGCTTATTA
GAAATGCAGC GTCACGCTTT ATTTATGTTC ACCAGTTGCG GTTGGTTTTT TGAAGAACTT
TCTCGCCCCG AAGGAACCCA GATTCTGCGT TACGCCGCCC GCGCTTTGGA ACTGGGTGGG
GATGTAGCTG GTGTGCAGTT GGAAAAAGGC TTCCTCAAAC GTTTGGGTTT AGCACCTAGT
AATGTAGATT CTTTTAAACA CGGTGGAGAA GTTTACCGCC AACTAGTACA AACAGCCCAG
ATTAGTTTTA AACAAGTAGC CGCCCATTAC GCGATTTCCT CCCTGTTCAA CAATCACAAA
CAGGTAGAGA CACTGCATAC CAAGCCTCTA CCTGGCACAA AACAACCCCA TCCTCACCAA
AAACGGGTTT ATTGCTACAC CGTCAATGAG GTAGATTACC AACTACAACG CATGGGATCA
TTAACCCTAG CAGTTGGTAA CTTAAAACTC GTGTCGGAAA TTACCTGGGA AAGCGAAAAT
TTAGTCTTTG CTGTCCTGCA TTTAGGCGGT TGGGATTTCC ACTGCTGCAT TCAACTATTT
ACTGGACGAC GTGATTACAG CCAATTAAAA GAAAAGCTGT TTACATCACT GCAACAGGCT
AGCGCCGCCC AAACTATTTT GGCGATGACC CAAGTATTTG GTGATGAAAC CTTCAACTTG
CAAAATCTGT TTGCGGAAGA ACGTCATCGG ATCATGCGCT TATTGAGTCA AGAAACCCTG
ACAAGGTTAG ACCAGTTATA TACTCAAGCA TACCGGGATA ATTATGGTGT GTTGATGGCA
TTTCACCGTG ATGAACTAGC CGTACCACAA GAATTACAAG TGGCGGCGGA GATTGCCTTG
GGTTATCGCT GTATGACAAC ATTGCGATCG CTAGACCAAG ATATCACCGA ACCCCAACTG
AGTTGGAATC ACATAGTAGA ATTAGAAGCG ATCGCCACTG AAGCTAAACA TCTGCGTTGC
CAATTAAATA TTCCTGAAGG TAAGCAGATG CTGGAACAGC TGATTCTGCG CTTGCTTTGG
AGATTACTAC ATGATACTAA TGGCAATTTT GCCATAGAGA TGCAATGCTT AGAACGGCTA
ATTAACGTTA GCTATCAGCT AAATATTGGC ATTTCCTTAC ATCAATCCCA AGAACTGTAC
TTCAGTTGTC TACAAAATCA AATACTACCT TTGTGTTTGA CTACTCTTTC TGATAAAGAA
GAAACGAGTC AATGTCTACA ATTGCTGAAA TTGGGACAGA AATTAGCAGT TGATGTTAGT
GCAATTCTCA ACCAATTCAA GTAG
 
Protein sequence
MTSAEMPANP GSISTSHITS KDTHHSDPRN QAVGVYVTVH GHFYQPPREN PYLDSIERQP 
SAAPFHDWNE RIHWECYRPN AFARVLNDQG EVTGIVNNYE YMSFNIGPTL MSWLERYDVE
VYQRILEADA KSCQRLQGHG NAIAQVYNHI IMPLANERDK RTQIRWGKED FRTRFGRDPE
GMWLAETAVD YATLEALVAE GIRFIVLAPS QAQRCRPFPT ANDPQPEWHE VGGSQIDPTR
PYRCYLKSRL SNKSAVLSSV KDRAIADTNP QSLQDLPYID IFFYDGPISR DMGFSDVVYN
SHHFAGRIGS AVRGDHRQSQ LISVGTDGET FGHHKKGTEK TLAYAFIKEF PSQGWTVTNF
AHYLSLCSPT WEVEIKPVTA WSCAHGVDRW QEDCGCGGEG GIWHQKWRRP LRNALNWLRD
QLVEVFEEYG NQLFRDPWQA RDEYIQVIRD RSPANISRFL SRHQTHKLTA AEQVDALRLL
EMQRHALFMF TSCGWFFEEL SRPEGTQILR YAARALELGG DVAGVQLEKG FLKRLGLAPS
NVDSFKHGGE VYRQLVQTAQ ISFKQVAAHY AISSLFNNHK QVETLHTKPL PGTKQPHPHQ
KRVYCYTVNE VDYQLQRMGS LTLAVGNLKL VSEITWESEN LVFAVLHLGG WDFHCCIQLF
TGRRDYSQLK EKLFTSLQQA SAAQTILAMT QVFGDETFNL QNLFAEERHR IMRLLSQETL
TRLDQLYTQA YRDNYGVLMA FHRDELAVPQ ELQVAAEIAL GYRCMTTLRS LDQDITEPQL
SWNHIVELEA IATEAKHLRC QLNIPEGKQM LEQLILRLLW RLLHDTNGNF AIEMQCLERL
INVSYQLNIG ISLHQSQELY FSCLQNQILP LCLTTLSDKE ETSQCLQLLK LGQKLAVDVS
AILNQFK