Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1810 |
Symbol | |
ID | 3681977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 2254064 |
End bp | 2255851 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637717150 |
Product | extracellular solute-binding protein |
Protein accession | YP_322327 |
Protein GI | 75908031 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATATA CAAATAATCT CATTTCCTTA ATTAAGCGTT TTTGGATACT CATAATTTTG GCTGCATTTA CAGCAGTTAC AGTTGCAGCT TGTAACCCAT CGAATTTTAA AAGTTCGGCT GCTCAAATCC CGCAATTAGT AACTAGTATT CTCAGTGATC CTAAAACTTT TAACTATCCT TTAAGTTCGG AATCACCTAA TGTTTTTGGT TTGATTTATG AGGGATTAAT CAGCGAAAAT TATGATACTG GTGAAGTGGA ACCAGCTTTA GCAGAATCTT GGACAATTTC TGATGATAAA TTAAAAATTG TTTTTACTCT CCGTGAGGGT TTAAAGTGGT CGGATGGGCA ACCACTAACT GTAGATGATG TTGTATTTAC TTACAATGAC ATTTACTTTA ACGAAGCCAT TCCTACAGAT GTTAGAGATA TTATGAGGAT TGGTGAAAGT CGGAAACTGC CAACTGTGAG AAAAGTTGAT AGTCGTCGAG TTGAGTTTGC TGTTCCCGAA CCGTTTGCGC CTTTTTTACG TAGTGCGACG AGTGCGGCAA TCTTACCAGC CCATGCACTG CGAGAATCTA TACAAACCAA AGATAGTGAG GGTAAGCCTA AGTTTCTGCA AAAATGGGGG GTAGATACAC CACCAGACCA AATCGTCGGG AATGGTTTGT ACAAATTGGA GCGTTATGAC ACCAGTGAAC GTGTAGTTTT CCGACGTAAT CCTTACTATT GGCGTAAAGG GCCTAAAGGT GAAGCTCAAC CTTATATTGA ACGATTAGTG TGGCAAATTG TCGAATCAAC AGATACCTCG TTACTCCAGT TTCGCTCTGG TGGTTTGGAT AGTATTGGTG TTTCCCCAGA CTATTTTTCT CTGCTGAAGG TGCAAGAAAA GCAAGGCAAT TTCAAGATAT ATAATGGCGG CCCGGCGGCT GGGACAACTT TTATATTATT TAACTTGAAT AAGGGTCAAA GAGACGGTAA ACCACTGGTT GATCCAGTAA AGTCTCGTTG GTTTAATACG GTAGAATTTC GCCAAGCTGT GGCTTATGCA GTTGACCGCC AAACGATGAT TAATAATATT TATCGGGGTT TGGGTCAAAC GCAAGATTCA CCAATTTCTG TGCAGAGTCC TTATTATTTG TCGCCCAAAG AAGGGTTAAA GGTTTACGAT TACAACTTAG AAAAAGCCAA GCAATTATTA TTGAAAGCGG GCTTTAAATA TAATGCTCAA AATCAGTTGT TAGACTCTGA CGGTAATCGA GTCCGCTTTA CACTGCTGAC GAATGCTGGT AACAAGATTC GTGAGGCCAT GGGTTCGCAA ATTAAACAGG ACTTGAGCAA AATCGGCATA CAGGTAGATT TTACACCCTT GGCATGGAAT ACTTATACAG ACAAGCTGGC GAATACTTTA GATTGGGAAG CTTCTATGCT GGGTTTGACT GGCGGTTTAG AACCGAATGA TGGTGCTAAC GTCTGGAATC CCGAAGGGGG ATTACATATG TTTAACCAAA AGCCCCAAGC AGGACAAAAA CCCATCACAG GTTGGGAAGT AGCACCGTGG GAAGCGAAAA TTGGTCAACT ATACATTCAA GGTGCTAGGG AGTTGGACGA AACCAAACGT AAAACAATCT ATGCCGAAAC CCAAAAAATC ACTCAAGAGA ATTTACCATT CATTTACTTG GTAAATCCAT ATTCATTATC CGCAGTACGC GATCGCTTTG CAGGTATTCG CTTCTCAGCA CTAGGCGGCG CATTCTGGAA CTTGTACGAA ATCAAAATAG TTAAGTAG
|
Protein sequence | MPYTNNLISL IKRFWILIIL AAFTAVTVAA CNPSNFKSSA AQIPQLVTSI LSDPKTFNYP LSSESPNVFG LIYEGLISEN YDTGEVEPAL AESWTISDDK LKIVFTLREG LKWSDGQPLT VDDVVFTYND IYFNEAIPTD VRDIMRIGES RKLPTVRKVD SRRVEFAVPE PFAPFLRSAT SAAILPAHAL RESIQTKDSE GKPKFLQKWG VDTPPDQIVG NGLYKLERYD TSERVVFRRN PYYWRKGPKG EAQPYIERLV WQIVESTDTS LLQFRSGGLD SIGVSPDYFS LLKVQEKQGN FKIYNGGPAA GTTFILFNLN KGQRDGKPLV DPVKSRWFNT VEFRQAVAYA VDRQTMINNI YRGLGQTQDS PISVQSPYYL SPKEGLKVYD YNLEKAKQLL LKAGFKYNAQ NQLLDSDGNR VRFTLLTNAG NKIREAMGSQ IKQDLSKIGI QVDFTPLAWN TYTDKLANTL DWEASMLGLT GGLEPNDGAN VWNPEGGLHM FNQKPQAGQK PITGWEVAPW EAKIGQLYIQ GARELDETKR KTIYAETQKI TQENLPFIYL VNPYSLSAVR DRFAGIRFSA LGGAFWNLYE IKIVK
|
| |