Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4715 |
Symbol | |
ID | 3679734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5891084 |
End bp | 5892220 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637720071 |
Product | thiosulphate-binding protein |
Protein accession | YP_325207 |
Protein GI | 75910911 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.250615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.353735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCTA AAATTCTATC AAGACTTAAC TCAATTGCCG AGGCGAAAAA GCAACTATTC ACTACGTGGA AGCCACCTCT GAGTCAACCT TCAGCCCTGA TACTAGCAGC CGGTTTGGGT TTAAGTGCTT TAATGCCCTT TACCACGACA AATTCTCAAG TTGCATCTGC AAACCAAAAG ACTCAACTGA TTAGCCAAGG AGGCCAGAAA GTTGAAATCA CGCTGGTTTC TTATGCAGTG ACAAAGGCTG CTTACGAGAA AATTATTCCT CAGTTTGTCG CTAAATGGAA GCGAGAAAAA GGCCAGGATA TAGTCATCAA GCAGAGTTAC GGTGGTTCTG GCTCTCAAAC CCGTGCTGTA ATAGATGGTT TAGAAGCAGA TGTTGTTGCT TTAGCGTTGG CGCTAGATAC CAAAAAAATT GAACAAGCAG GGTTAATTAA GCCAGGTTGG GAGCAAGAAG CGCCGAATAA TTCTATTGTG ACTCGTTCAG TAGTAGCGCT AGAAACCAGA GAAGGAAATC CAAAAAATAT TAAAACTTGG AATGATTTAG CTAAACCGGG AGTCAAGGTA ATTACAGCCA ACCCGAAAAC TTCAGGTGGT GCGCGCTGGA ACTTCTTGGC TTTGTGGGGT GCGATCGCTA AAAATAAGGG TAGTGAAGCT CAAGCTCTAG ATTTTGTGAC TAAAGTCTAT AGAAATGTGC CTGTATTACC TAGAGATGCG CGAGAAGCCA GCGATGCCTT TTACAAGAAA GGTCAAGGGG ATGTATTACT TAACTATGAA AACGAAGTTA TCCTAGCTGC ACAACAGGGG AAAACATCAC CGTCCTATAC TATTCCTCAA ACCAACATTT CCATAGATGG CCCAGTTGCA GTTGTTGATA AGGTCGTCGA TAAACGTGGT ACTCGTGCAG TTTCTGAGGC TTTTGTGAAA TTTCTTTTCA CTCCAGAAGC CCAACGCGAG TTTTCCAAAG TTGGTTTTAG ACCGGTTAAC TCTGCTGTAG CTAAAGAAGT CGGGAAAAAA TTCCCTAAAG TTGCCAATCT TTACAATGTT CAAAGTTTAG GCGGCTGGAA CACTGTACAG AAAAAATTCT TCGATGACGG AGCTATATTT GACAAAATTC AAAGTGGACG ACGTTGA
|
Protein sequence | MTPKILSRLN SIAEAKKQLF TTWKPPLSQP SALILAAGLG LSALMPFTTT NSQVASANQK TQLISQGGQK VEITLVSYAV TKAAYEKIIP QFVAKWKREK GQDIVIKQSY GGSGSQTRAV IDGLEADVVA LALALDTKKI EQAGLIKPGW EQEAPNNSIV TRSVVALETR EGNPKNIKTW NDLAKPGVKV ITANPKTSGG ARWNFLALWG AIAKNKGSEA QALDFVTKVY RNVPVLPRDA REASDAFYKK GQGDVLLNYE NEVILAAQQG KTSPSYTIPQ TNISIDGPVA VVDKVVDKRG TRAVSEAFVK FLFTPEAQRE FSKVGFRPVN SAVAKEVGKK FPKVANLYNV QSLGGWNTVQ KKFFDDGAIF DKIQSGRR
|
| |