Gene Ava_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3035 
Symbol 
ID3681153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3758292 
End bp3760526 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content45% 
IMG OID637718381 
Productglycoside hydrolase family protein 
Protein accessionYP_323540 
Protein GI75909244 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATC CGCTATACGT TGCTTTTATT TGGCATCAAC ATCAGCCATT GTATAAATCT 
CCTGGTAGCA GCCTTTCCGT GCCTTCTAGT CAGCAATATC GCCTGCCTTG GGTGCGCTTA
CATGGTACAA AAGATTATTT AGATTTAATA CTAATTTTGG AAAAGTATCC AAAATTACAT
CAAACAGTAA ACTTAGTTCC TTCCTTAATT CTGCAACTGG AAGATTATAT TGCTGGTACA
GCGTTTGACC CTTACCTCAA AGCCAGTCTG ACACCAACTG AGCAACTGAC TCAGCAACAA
CGAGAGTTTA TCATTCAGCA CTTTTTTGAT GCTAATCACC ACACCTTGAT TGACCCTCAC
CCTCGTTACG CCGAGTTGTA CTATCAAAGG CAGGAAAAAG GGCCGGCTTG GTGTTTGGCG
AATTGGCAAT TAGCAGATTA CAGCGACTTG TTGGCGTGGC ACAATTTGGC ATGGATAGAC
CCACTGTTTT GGGATGATCC AGAAATTGCG GCTTGGTTAC AGCAGGGGCG TAACTTTACC
TTAGGCGATC GCCAGCGCAT TTATTCCAAA CAACGTGATA TTCTCAGCCG CATTATTCCT
CAGCACCGGA AAATGCAGGA ATCTGGGCAA TTAGAAGTTA CCACCACGCC CTACACTCAC
CCGATTTTGC CTTTGTTAGC TGACACTAAT TCTGGTCGGG TGGCTGTGCC AAATATGGCA
TTACCTGAGT CCCGGTTTCA GTGGTCAGAA GATATTCCTC GTCATTTGAG AAAAGCTTGG
GAACTATATA CAGAAAGATT TGGGCAGGAA CCCAAGGGCT TATGGCCGTC CGAACAGTCA
GTTAGTCCAG ATATATTACC GTATATTATC AAACAAGGAT TTCAGTGGAT TTGCTCAGAT
GAAGCAGTCT TAGGGTGGAC ACTGAAACAC TTCTTTCATC GGGATGGGGC AGGGAATGTA
CAGCAGCCAG AACTGTTATA TCGACCTTAT CGCCTAGCAA CTCCAGCCGG AGATTTGGCA
ATTGTCTTCC GTGACCACAG ATTATCAGAT TTAATAGGCT TTACCTATGG GGCAATGCCC
GCCAAACAGG CAGCCGCTGA CCTGGTGGGA CACCTACAAG CGATCGCCAA AATGCAACGA
GAACGGCCAA GCGAACAGCC TTGGTTAGTG ACTATCGCCT TAGATGGCGA AAACTGCTGG
GAATTTTACC CCCAAGATGG CAAACCATTC CTAGAAGCTT TATATCAAAG TTTAAGTAAC
GAATCCCATA TCAAACTCGT TACCGTCTCC GAATTTATCG AGGAATTTCC CGCCACAGCC
ACTATTCCCG CAGAACAACT ACATAGCGGT TCTTGGGTTG ATGGTAGCTT TACCACCTGG
ATTGGTGATC CTGCCAAAAA CCGGGCTTGG GATTACCTCA CCGAAGCGAG AATCATGTTG
GCAAATCATC CCGAAGCAAC AGAAGAAAAT AACCCCGAAG CTTGGGAAGC TTTATATGCT
GCCGAAGGTT CAGACTGGTT TTGGTGGTTT GGTGAAGGAC ATTCCTCAAA TCAAGATGCC
ATTTTTGACC AATTGTTTCG AGAACATTTG TGTGGCATCT ATAAAGCTTT GAATGAACCC
ATACCCGCAT ATCTCAAGAA TCCAGTGGAG GTTCATGCAG CCAGAGCAGA TCATTCTCCT
GAAGGCTTCA TTCATCCTGT AATTGATGGC AGAGGAGATG AGCAAGATTG GGACAAAGCT
GGACGGATAG AAATTGGTGG GGCGAGGGGG ACAATGCACA ACAGCAGCAT AGTTCAGCGC
CTATGGTATG GGGTAGATCA CCTGAATTTC TATTTGCGAG TAGATTTTAA AAGCGGTGTT
ACCCCTGGAC ATGGTTTGCC CCCAGAGTTA AACCTGTTGT GGTTTTATCC AGATCGAACA
ATGCACAATA GTCCGATTCC TTTAGCTGAT GTGCCGGACA CAGCCCCACT TAATTATCTA
TTCCATCATC ATTTGGAAAT TAACTTGCTG ACCCAATCAA TTCAGTTTCG GGAAGCAGCA
GAAAATTATC AATGGCATCC CCGTTTCAGC CGCGCTCAAG TCGCCTTAGA AAATTGTTTA
GAAGTGGCGA TACCCTGGGC AGATTTGCAA GTTCCGCCAG ATTATCCTCT GCGGCTAATT
CTAGTACTTG CTGATGAGGG ACGTTTTAGT AAATATTTAC CAGAAGATAC TTTAATTCCG
ATTGAAGTGC CGTAA
 
Protein sequence
MTHPLYVAFI WHQHQPLYKS PGSSLSVPSS QQYRLPWVRL HGTKDYLDLI LILEKYPKLH 
QTVNLVPSLI LQLEDYIAGT AFDPYLKASL TPTEQLTQQQ REFIIQHFFD ANHHTLIDPH
PRYAELYYQR QEKGPAWCLA NWQLADYSDL LAWHNLAWID PLFWDDPEIA AWLQQGRNFT
LGDRQRIYSK QRDILSRIIP QHRKMQESGQ LEVTTTPYTH PILPLLADTN SGRVAVPNMA
LPESRFQWSE DIPRHLRKAW ELYTERFGQE PKGLWPSEQS VSPDILPYII KQGFQWICSD
EAVLGWTLKH FFHRDGAGNV QQPELLYRPY RLATPAGDLA IVFRDHRLSD LIGFTYGAMP
AKQAAADLVG HLQAIAKMQR ERPSEQPWLV TIALDGENCW EFYPQDGKPF LEALYQSLSN
ESHIKLVTVS EFIEEFPATA TIPAEQLHSG SWVDGSFTTW IGDPAKNRAW DYLTEARIML
ANHPEATEEN NPEAWEALYA AEGSDWFWWF GEGHSSNQDA IFDQLFREHL CGIYKALNEP
IPAYLKNPVE VHAARADHSP EGFIHPVIDG RGDEQDWDKA GRIEIGGARG TMHNSSIVQR
LWYGVDHLNF YLRVDFKSGV TPGHGLPPEL NLLWFYPDRT MHNSPIPLAD VPDTAPLNYL
FHHHLEINLL TQSIQFREAA ENYQWHPRFS RAQVALENCL EVAIPWADLQ VPPDYPLRLI
LVLADEGRFS KYLPEDTLIP IEVP