Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4289 |
Symbol | |
ID | 3680840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5374782 |
End bp | 5376182 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637719638 |
Product | extracellular solute-binding protein |
Protein accession | YP_324783 |
Protein GI | 75910487 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAA GGGTAGATGA TTGTTTTTGG TTACGGGTTT TTCCTGTGAT TTTGAATTCC CTGAAGGGAT TGACAGCTAA TTATGTGATG AATATCCAAA TTTGGGTATC CCAAATCACA CAATTTTTCC GTCATTTAAT TAGCGCTAAT AGTCGGATTG GTTTACTGAG TATCTTTTTG AGTTTGTGTC TATTATTTTT ATCTGGTTGC CAAGGTACAG TCAAGCAGGA GAACGGAGTA ATTCATTTAA CACTGTGGCA AGGAATCAAC CCGCCAGCAA ATCGAGATGT ATTTGAAAAA TTAGTTGCAA AATTCAACCA GACTCATGCG GATGTGCAAA TAGAATCTAT CTATGCAGAA GGATTGCCTA AAATACTGAC GGCAGTTGTG GGTAATGTGC CTCCTGATAT CTTGTTATTT TATCCACAAA CTACAGGTCA GTTAGTAGAG TTAGGAGCAG TCCGCCCTTT AGATGACTGG CTAGACAAAT TGCCAGAGAA ATCACAGATT GTTCCCAGTC TATTTGAAGA AATGCAGTTA GATGGTCACA TTTGGTCAGT CCCACTGTAC ACCGCGAACA TGGGTGTTTT CTATCGTCCT CGGCTGTTTG AGGAGGCAGG AATCACAGAA ACACCAAAGA CTTGGGAAGA ACTACGACAA GTTGCTAAAA AATTAACCAT AGACCGTAAT GGTGACAAAC GACCGGAACA ATATGGCATA CTCCTACCCT TGGGAAAAGG AGAATGGACT GTTTTTAGTT GGTTGCCATT TCTCTGGGGT GCGGGAGGGG AAATAGTCAC AAATAAGCAA CCCAATTTAA CTAGTCAAGC TGCTGTCACA GCCTTACAAT TCTGGCAAGA CCTTATCAAA GATGGTTCAG CCATGCTTTC GTCGCCAGAA CGAGGTTATG AAGAAGATGC TTTTGTTGGG GGTCGTGTGG CGATGCAAAT TACAGGGCCT TGGACTTACA TCATGAAATC TAACATTGAT TATCAAGCCT TCCCTATCCC TGGCAATATT AAATCAGCTA CAGCAATTGC TGGCAGTAAT TTTTATGTTA TGAAAACTCA GCCAGCAAGA GAAGAAGCTG CACTCAAATT TTTAGAATAT GTTTTAAGCG AAGAGTTTCA AACGGAATGG AGTATTGGCA CTGGTTTTTT ACCTGTGAAT ATTAAAGCTG CTCAAAGTGA AGCTTATCAG CAATTCATTG ACAAACAACC AGTGTTAAAG GTGTTTCTAG AGCAAATGTC AGTAGCACAA ACTCGACCGA TAATTTCTCA ATATAATCGT TTATCTGATA GTCTTGGTCG AGCAATAGAA TCAAGTTTAT TGGGTGAATC AGTGCAACAA GCTCTCCAAA CATCTCAAAA GCGGTTAGAA CTTATTTGGG TTGAGAAATG A
|
Protein sequence | MSVRVDDCFW LRVFPVILNS LKGLTANYVM NIQIWVSQIT QFFRHLISAN SRIGLLSIFL SLCLLFLSGC QGTVKQENGV IHLTLWQGIN PPANRDVFEK LVAKFNQTHA DVQIESIYAE GLPKILTAVV GNVPPDILLF YPQTTGQLVE LGAVRPLDDW LDKLPEKSQI VPSLFEEMQL DGHIWSVPLY TANMGVFYRP RLFEEAGITE TPKTWEELRQ VAKKLTIDRN GDKRPEQYGI LLPLGKGEWT VFSWLPFLWG AGGEIVTNKQ PNLTSQAAVT ALQFWQDLIK DGSAMLSSPE RGYEEDAFVG GRVAMQITGP WTYIMKSNID YQAFPIPGNI KSATAIAGSN FYVMKTQPAR EEAALKFLEY VLSEEFQTEW SIGTGFLPVN IKAAQSEAYQ QFIDKQPVLK VFLEQMSVAQ TRPIISQYNR LSDSLGRAIE SSLLGESVQQ ALQTSQKRLE LIWVEK
|
| |