Gene Ava_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3685 
Symbol 
ID3679104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4592517 
End bp4593815 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content42% 
IMG OID637719036 
Productextracellular solute-binding protein 
Protein accessionYP_324186 
Protein GI75909890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAT TGCAAAAATT AAAAAAATTT ATTGCTTGTG CAATGCTGGG TTTAATCACC 
AGTTGGATGG TCAGTTGCAG CACAGGAAAC GTTAATACAA ATACACAGCA AACTGCTTCG
GGAACAGCAA ATATTGAGTT TTGGACTATG CAACTCCAAC CTCAATTTAA TAACTACTTC
CAAAGTCTGA TTGCGAATTT TGAAACCCAA AATCCGGGTA TAAAGGTGAA CTGGGTAGAT
GTACCCTGGG CGGCAATGGA GAACAAAATT TTAACAGCTG TCTCCGCGAA AACGCCACCT
GATGTTGTTA ACCTCAATCC AGATTTTGCT TCTCAACTAG CAGGACGAAA TGCCTGGTTA
GATTTGAATA CAAAAGTCCC GCAGGAAGCA CGTTCCTCCT ATCTTCCGAA CATTTGGAAA
GCCAGTACTC TGAATGGCAA AAGTTTTGGC GTTCCTTGGT ACCTCACAAC GAGGCTAACC
ATTTATAACA CTGATTTATT AAAACAGGCA GGTATCAGTA AACCACCTGC TACTTACGCA
GAATTAGCAC AAGCGGCGCA ACAAATTAAA GATAAGACAG GGAAATACGC GTTTTTTGTG
ACGTTTGTAC CGCAAGATTC CGGTGAAGTC TTACAGTCTT TTGTACAGAT GGGAGCAACC
CTAGTAGATA CTGAAGGCAA AGCTGCTTTT AATTCACCCC AAGGAAAAGC CGCGTTTCAG
TATTGGGTAG ACCTTTACAA AAAAGGCTTA TTACCAAAAG AAGCACTAAC TCAAGGACAT
CGCCACGCAA TTGATTTATA TCAATCTGGA GAAACGGCCT TTCTAGCTTC TGGGCCTGAA
TTTCTGAAAA CGATCGCCAC CAACGCCCCA AAAATTGCTC AAGCTTCGGC GATCGCTCCC
CAACTCACTG GCGATACAGG TAAGAAAAAT GTAGCTGTGA TGAATATCGT CGTTCCCCGC
GATACAAAAC AACCAGATGC GGCGGTGAAA TTTGCTTTAT TTGTCACCAA TGACGAAAAT
CAATTAGCTT TTGCTAAAGC TGCAAATGTT TTACCATCCA CAACTAAAGC ACTAGCTGAT
AGTTATTTTA AAGATATTCC GGCTGATGCT TCCACAGTAG AAAAAGCGCG AGTTGTCAGC
GCGCAGCAAT TACAACAAGC AGAAATTCTC ACCCCGGCTT TAAAGGATAT TAAGAAGCTA
CAAAAGGCGA TTTATGACAA CTTACAAGCT GCAATGTTAG GGGAAAAAAC GGTAGATAAA
GCTGTAGAGG ATGCGTCGCA GGAGTGGAAT AATCGTTAG
 
Protein sequence
MIQLQKLKKF IACAMLGLIT SWMVSCSTGN VNTNTQQTAS GTANIEFWTM QLQPQFNNYF 
QSLIANFETQ NPGIKVNWVD VPWAAMENKI LTAVSAKTPP DVVNLNPDFA SQLAGRNAWL
DLNTKVPQEA RSSYLPNIWK ASTLNGKSFG VPWYLTTRLT IYNTDLLKQA GISKPPATYA
ELAQAAQQIK DKTGKYAFFV TFVPQDSGEV LQSFVQMGAT LVDTEGKAAF NSPQGKAAFQ
YWVDLYKKGL LPKEALTQGH RHAIDLYQSG ETAFLASGPE FLKTIATNAP KIAQASAIAP
QLTGDTGKKN VAVMNIVVPR DTKQPDAAVK FALFVTNDEN QLAFAKAANV LPSTTKALAD
SYFKDIPADA STVEKARVVS AQQLQQAEIL TPALKDIKKL QKAIYDNLQA AMLGEKTVDK
AVEDASQEWN NR