Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0044 |
Symbol | |
ID | 3683553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 50145 |
End bp | 51302 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637715371 |
Product | thiosulphate-binding protein |
Protein accession | YP_320565 |
Protein GI | 75906269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.195735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGT CGCAATATTG GGTGCATTTA TTAACTTGCC AGATACAGCG AATACATACT TGCGTTGTGA AGGCTGGGCA AACTGTGCAG CTTTCGGCTG AAAAAATTAT GCAGATTACC GGGAATAGAA GCTCTAATCA AAAGTATATG AGCCTATTCT TGATAGGTAC TATATTAAGT GTAGCGATCG CCTCCTGCGC TGGAAGTGGT TTTGCTAACA AAGGTGATGT AAAACTCAAA CTCGTTTCCT TCTCTGTCAC CAAAGCCGCA CATGATCAGA TAATTCCTAA GTTTGTGGAG AAATGGAAAC GGGAACATAA CCAAAACGTC ACAGTTGAAG CGACTTACGG AGGTTCTGGC GCTCAAACGG CGGCGGTTAT TGCAGGCTTG CAAGAAGCAG ACATAGTACA TTTGGCACTA CCTTTAGATG TCATCAAACT CCAACAAGCA GGTTTAATTA AATCAGGTTG GGAAATCAAA ACTCCCAGAA ATGGTATTGT TAGTAAATCA GTGGCGGCGA TCGTTACTCG TGAGGGTAAT CCTAAAAACA TTAAAACTTG GGCAGATTTA GCCAAAGATG GGGTGAAAAT AATTGCAGCT AACCCAAAAA CTTCTGGTAT TGCTATTTGG GAATTCTTAG CTTTTTGGAG TTCCGTCAGC CTAACGGGTG GTGATGAAAC AGCAGCATTA GATTATGTCA CTAAGGTTTA CAAGAATATT CCTGTGCTAA CGAAAGATGC TCGTGAAGCT AGTGATTTAT TTTTCCAGAA AGGCGAAGGA GATGTTTTAA TTAATTACGA AAATGAGGTT ATTTTGGCAG GTAAAAATGG TAACGAACTG CCTTATATTG TCCCTCAAGT CAATATTTCT ATTGATAATC CCGTGGCTCT CGTTGATAAA AACGTTAATA AACACGGTAC AAGAGAAGTT TCACAAGCAT TTGTTGATTT TCTTTATTCA ACAGAAGCGC AACGGGAATT TGCTAAATTG CAATATCGTC CTGTAAATCC AACTGTTACT CAAGAAGTCG TGTCACAGCA ACCGCCAGTT AAAACTTTAT TCACCTCACA AGATTTAGGT GGTTGGGAGC TTATCCAGAA AAAATTTTTT GAAGATGGGG CAATTTTTGA CAAAATTCAA GCTGCAAAAA AAGCGTAA
|
Protein sequence | MNKSQYWVHL LTCQIQRIHT CVVKAGQTVQ LSAEKIMQIT GNRSSNQKYM SLFLIGTILS VAIASCAGSG FANKGDVKLK LVSFSVTKAA HDQIIPKFVE KWKREHNQNV TVEATYGGSG AQTAAVIAGL QEADIVHLAL PLDVIKLQQA GLIKSGWEIK TPRNGIVSKS VAAIVTREGN PKNIKTWADL AKDGVKIIAA NPKTSGIAIW EFLAFWSSVS LTGGDETAAL DYVTKVYKNI PVLTKDAREA SDLFFQKGEG DVLINYENEV ILAGKNGNEL PYIVPQVNIS IDNPVALVDK NVNKHGTREV SQAFVDFLYS TEAQREFAKL QYRPVNPTVT QEVVSQQPPV KTLFTSQDLG GWELIQKKFF EDGAIFDKIQ AAKKA
|
| |