Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1563 |
Symbol | |
ID | 3681106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1924290 |
End bp | 1926056 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637716903 |
Product | extracellular solute-binding protein |
Protein accession | YP_322081 |
Protein GI | 75907785 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.176735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.279007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATCT TGAATAAATT TCGTCAGGGT CAAATTTTTA CTTGGTCGTT ACTAAATATA TTTTTCCTTG CTGGTTGTTC TTTTCCTCAG GCGGAAACTC CTACAAACTC CACACCCGTA ACAAACACTA GTACAGGCGA AACCTTACGT CTACTTTATT GGCAAGCACC AACTATTCTC AATCCTCACC TAGCGCAGGG AACTAAAGAC TTTGAAGCTA GTCGTATCGT CTATGAACCC CTCGCCAGCC ACGATAAAGA CGGTAAATTG GTTCTGTTTT TAGCCGCAGA GGAACCTACT CTAAAAAATG GTGGTATAGC CAAAGATGGT AAATCAGTTA CCTGGAAACT CAAGCAAGGA GTCAAATGGT CTGATGGTCA ACCTTTTACA GCTGCGGATG TGGTATTTAC TTACAAATTT CTTTCCAATC CGGCTGTCGG TGCTACCACT TCCGCTAATT ACGAAGCTGT GCAAAGCGTC GAAGCCATCG ACGATTACAC TGTCAAAATT AATTTCCAGA GTCCTAACCC AGCTTGGTCA CTACCTTTTG TGGGCTTAAA TGGAATGATT ATTCCCCGTC ACATTTTTGA GAAATTTAAC GGTAGTAACG CTAGGGAAGC GCCAGGTAAT TTGATTCCTA TAGGTACAGG CCCTTATAAA GTCGGAGAAT TTAAACCCGG TGACACCATT ATCTATGAAG CTAATTCTGT GTTCCGTGAA GCCAATAAAC CTTTCTTTAA GCGAGTAGAA CTCAAGGGAG GTGGTGATGC GACATCAGCC GCGAGAGCAG TACTACAAAC TGGAGATGTA GACTACGCTT GGAACCTGCA AGTAGAAGCC CCGATTCTCA AGCAACTAGA AGCAGCAGGC AAAGGGAAAT TAAAAATTAG TTTTGGTTCT TTTTTAGAAC GGATTACCAT CAATCATACC GACCCTAATA AACAAACAAA AGACGGCGAA CGTTCTAGCA CTGAATTTCC TCATCCATTT TTTCAAGACA TCAAAGTGCG TCAAGCATTT AACTATGCAA TTGATCGGGA CACAATAAAT CAACAATTAT ATGGTTCTAG TGGTCGTCCT GCCGCCAATA TCCTATTAGC ACCAGAGATT TATAACTCAC CTAATACTAA ATATGAATTT AGCCCCAAGA AAGCTACTGA TTTATTAGAT GAAGCTGGAT GGAAAGATAC AAATGGTAAT GGTATTCGAG ACAAAAATGG TGTAGAGATG AATGTTTTGC TTCAGACATC TGTAAATCCA GTGAGACAGA AAACTCAGGA AATTATTAAA CAAGGATTAA CCTCTATTGG TGTTGGGGTG GAACTAAAAA GTATTGATGG TAGTATCTTC TTTTCTGGAG ACCCATCGAA CCCAGACACC TTGGGAAGGT TTCAAGCTGA TTTACAAATG TTTAGTACGG GTAGTACGAA TGTAGATCCT GGTGCTTATA TGAAAGGCTT TACTTGTAGC GAAATTCCCC AGAAAAAGAA TAACTGGTCA AAATCCAATC ATTCACGTTA CTGTAATCCT GAATATGATA AGCTCTGGCA ACAGTCCAAC ACAGAATTAA ATCCTGAAAA ACGTCGGCTG CTATTTATTC AGATGAATGA TCTGCTATTC AAAGATATTG CTTTAATTCC CTTGATTGCC CGTGCTGATG TCAATGGCGT GAGCAATAGA CTGGTCGGTG TAGATTTGAC CCCTTGGGAT ACTGATACAT GGAATATTAA AGATTGGCAA CAAGTCCAGT CTCTTGGTAA TAGGTAA
|
Protein sequence | MGILNKFRQG QIFTWSLLNI FFLAGCSFPQ AETPTNSTPV TNTSTGETLR LLYWQAPTIL NPHLAQGTKD FEASRIVYEP LASHDKDGKL VLFLAAEEPT LKNGGIAKDG KSVTWKLKQG VKWSDGQPFT AADVVFTYKF LSNPAVGATT SANYEAVQSV EAIDDYTVKI NFQSPNPAWS LPFVGLNGMI IPRHIFEKFN GSNAREAPGN LIPIGTGPYK VGEFKPGDTI IYEANSVFRE ANKPFFKRVE LKGGGDATSA ARAVLQTGDV DYAWNLQVEA PILKQLEAAG KGKLKISFGS FLERITINHT DPNKQTKDGE RSSTEFPHPF FQDIKVRQAF NYAIDRDTIN QQLYGSSGRP AANILLAPEI YNSPNTKYEF SPKKATDLLD EAGWKDTNGN GIRDKNGVEM NVLLQTSVNP VRQKTQEIIK QGLTSIGVGV ELKSIDGSIF FSGDPSNPDT LGRFQADLQM FSTGSTNVDP GAYMKGFTCS EIPQKKNNWS KSNHSRYCNP EYDKLWQQSN TELNPEKRRL LFIQMNDLLF KDIALIPLIA RADVNGVSNR LVGVDLTPWD TDTWNIKDWQ QVQSLGNR
|
| |