Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4714 |
Symbol | |
ID | 3679733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5889386 |
End bp | 5890531 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637720070 |
Product | thiosulphate-binding protein |
Protein accession | YP_325206 |
Protein GI | 75910910 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.217544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.582939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGT GGCAGCGTCC CCTGAAAAAA TTGTGGTTGC TAGCAGAGCA GAGGACATAT AGGTTCAGAT TTAACTCCCT GAAAAGCTTT GTGTCACTGG TATTAGTTGG GGCTGTTTTG AGTGTGGCGC TTGCAGCCTG CACTGGTGGT AGTGAAAACA ATACTTCCAC CGCAAACCCC GTTGCTAGTC CTGTTGCAGC CAACAAACCA AACGTGGAAT TAACTCTAGT ATCCTTTGCT GTTACCAAGG CAGCACATGA GGCAATTATT CCTAAATTTG TAGAAAAATG GAAGCAAGAA CATAACCAAA CTGTCACATT TAAACAGAGT TATGGCGGTT CTGGTTCTCA AACCCGTGCC GTTATCGATG GTTTAGAAGC AGATGTTGTC CACTTAGCAC TGGCTTTAGA CACGCAAAAA ATTGAGAAAG CAGGGTTAAT TCAACCAGGA TGGGAAAAAG AACTTCCTAA TGATGGAATT GTCTCCAAAT CTGTAGCAGC AATCATTACC CGTCCAGGCA ACCCCAAAGG TATCAAAACA TGGGCAGATT TGGCTAAGGA CGATGTTAAA GTAATTACTG CTGATCCTAA AACCTCAGGT ATTGCTCGCT GGAACTTCTT AGCTTTGTGG AACTCCGTGA TCAAAACTGG TGGTGATGAA GCTAAGGCAA CAGAGTTTGT GACTAAGGTT TATGGAAACG TGCCAATTTT AACCAAGGAT GCGCGGGAAG CAACTGACGC ATTTGCCAAA CAAGGCCAAG GAGACGCTCT GATTAATTAC GAAAATGAAG TCATTTTGGC ACAGCAAAAA GGTGAAAAGT TGGATTATGT AATTCCTAGT GTCAATATTT CTATTGATAA TCCTATCGCT GTCGTAGATC AGAATGTTGA TAAGCATGGT AATCGAGAAG TAGCCGAAGG ATTTGTCAAA TTCTTATATA CCCCAGAAGC ACAAGAAGAG TTTGTGAAAT TAGGTTTTCG ACCAGTAGAT GAGAAAGTTG CTCAAACGAA AGAAGTAACA GATAAGTTTC CCAAAGTAGA TACTCTAGGT ACTGTTCAAG ACTTAGGAGG CTGGGCAACA ATTGACAAAA AATTCTTTGC TGATGGTGGT GTTTTTGACC AAATTCAAGC CAAAAACAAG CGGTAA
|
Protein sequence | MSEWQRPLKK LWLLAEQRTY RFRFNSLKSF VSLVLVGAVL SVALAACTGG SENNTSTANP VASPVAANKP NVELTLVSFA VTKAAHEAII PKFVEKWKQE HNQTVTFKQS YGGSGSQTRA VIDGLEADVV HLALALDTQK IEKAGLIQPG WEKELPNDGI VSKSVAAIIT RPGNPKGIKT WADLAKDDVK VITADPKTSG IARWNFLALW NSVIKTGGDE AKATEFVTKV YGNVPILTKD AREATDAFAK QGQGDALINY ENEVILAQQK GEKLDYVIPS VNISIDNPIA VVDQNVDKHG NREVAEGFVK FLYTPEAQEE FVKLGFRPVD EKVAQTKEVT DKFPKVDTLG TVQDLGGWAT IDKKFFADGG VFDQIQAKNK R
|
| |