Gene Ava_4289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4289 
Symbol 
ID3680840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5374782 
End bp5376182 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content40% 
IMG OID637719638 
Productextracellular solute-binding protein 
Protein accessionYP_324783 
Protein GI75910487 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAA GGGTAGATGA TTGTTTTTGG TTACGGGTTT TTCCTGTGAT TTTGAATTCC 
CTGAAGGGAT TGACAGCTAA TTATGTGATG AATATCCAAA TTTGGGTATC CCAAATCACA
CAATTTTTCC GTCATTTAAT TAGCGCTAAT AGTCGGATTG GTTTACTGAG TATCTTTTTG
AGTTTGTGTC TATTATTTTT ATCTGGTTGC CAAGGTACAG TCAAGCAGGA GAACGGAGTA
ATTCATTTAA CACTGTGGCA AGGAATCAAC CCGCCAGCAA ATCGAGATGT ATTTGAAAAA
TTAGTTGCAA AATTCAACCA GACTCATGCG GATGTGCAAA TAGAATCTAT CTATGCAGAA
GGATTGCCTA AAATACTGAC GGCAGTTGTG GGTAATGTGC CTCCTGATAT CTTGTTATTT
TATCCACAAA CTACAGGTCA GTTAGTAGAG TTAGGAGCAG TCCGCCCTTT AGATGACTGG
CTAGACAAAT TGCCAGAGAA ATCACAGATT GTTCCCAGTC TATTTGAAGA AATGCAGTTA
GATGGTCACA TTTGGTCAGT CCCACTGTAC ACCGCGAACA TGGGTGTTTT CTATCGTCCT
CGGCTGTTTG AGGAGGCAGG AATCACAGAA ACACCAAAGA CTTGGGAAGA ACTACGACAA
GTTGCTAAAA AATTAACCAT AGACCGTAAT GGTGACAAAC GACCGGAACA ATATGGCATA
CTCCTACCCT TGGGAAAAGG AGAATGGACT GTTTTTAGTT GGTTGCCATT TCTCTGGGGT
GCGGGAGGGG AAATAGTCAC AAATAAGCAA CCCAATTTAA CTAGTCAAGC TGCTGTCACA
GCCTTACAAT TCTGGCAAGA CCTTATCAAA GATGGTTCAG CCATGCTTTC GTCGCCAGAA
CGAGGTTATG AAGAAGATGC TTTTGTTGGG GGTCGTGTGG CGATGCAAAT TACAGGGCCT
TGGACTTACA TCATGAAATC TAACATTGAT TATCAAGCCT TCCCTATCCC TGGCAATATT
AAATCAGCTA CAGCAATTGC TGGCAGTAAT TTTTATGTTA TGAAAACTCA GCCAGCAAGA
GAAGAAGCTG CACTCAAATT TTTAGAATAT GTTTTAAGCG AAGAGTTTCA AACGGAATGG
AGTATTGGCA CTGGTTTTTT ACCTGTGAAT ATTAAAGCTG CTCAAAGTGA AGCTTATCAG
CAATTCATTG ACAAACAACC AGTGTTAAAG GTGTTTCTAG AGCAAATGTC AGTAGCACAA
ACTCGACCGA TAATTTCTCA ATATAATCGT TTATCTGATA GTCTTGGTCG AGCAATAGAA
TCAAGTTTAT TGGGTGAATC AGTGCAACAA GCTCTCCAAA CATCTCAAAA GCGGTTAGAA
CTTATTTGGG TTGAGAAATG A
 
Protein sequence
MSVRVDDCFW LRVFPVILNS LKGLTANYVM NIQIWVSQIT QFFRHLISAN SRIGLLSIFL 
SLCLLFLSGC QGTVKQENGV IHLTLWQGIN PPANRDVFEK LVAKFNQTHA DVQIESIYAE
GLPKILTAVV GNVPPDILLF YPQTTGQLVE LGAVRPLDDW LDKLPEKSQI VPSLFEEMQL
DGHIWSVPLY TANMGVFYRP RLFEEAGITE TPKTWEELRQ VAKKLTIDRN GDKRPEQYGI
LLPLGKGEWT VFSWLPFLWG AGGEIVTNKQ PNLTSQAAVT ALQFWQDLIK DGSAMLSSPE
RGYEEDAFVG GRVAMQITGP WTYIMKSNID YQAFPIPGNI KSATAIAGSN FYVMKTQPAR
EEAALKFLEY VLSEEFQTEW SIGTGFLPVN IKAAQSEAYQ QFIDKQPVLK VFLEQMSVAQ
TRPIISQYNR LSDSLGRAIE SSLLGESVQQ ALQTSQKRLE LIWVEK