Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1229 |
Symbol | |
ID | 3683269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1514019 |
End bp | 1515404 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637716567 |
Product | extracellular solute-binding protein |
Protein accession | YP_321748 |
Protein GI | 75907452 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0257979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.868479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTA AAAAATTGAT TAATCTGTTT TATCAAAACT GTGCGCGGTT TTTTTTCGCC TCTGATGATT CTAGGGTAAA AAGAAAAAAT TGGCGATTTT ACCGATATCG TTGGCTATCG CTGTTTCTAG TAGGAATATT GGTTACTAGT GGATGTCAGG TAATCCAAAA TGGCATGAAG GCAGATAAAG TCATTCATTT AACTTTATGG CATGGGGTGA ATCCGCCACC TAATCGGGAT GTCTTTCAAA AGCTGGTAGA TAAGTTTAAC CAAAGCCATA CAGATATTCA GGTAGATGCG ATTTACGCTG GACAACAAGA CCAACAAATG CCGAAGATTT TGGCAGCTGT CGTAGGTAAT GCGTCCCCGG ATATTTTGTG GTACAACCCA GCAACTACAG GGCAACTAGT AGAACTAAAT GCCTTGGTTC CTTTGGACGA AATGTTGTCA AGTTCACTAA TTAAAGATGA AATTGACCCC AGCTTATACC CGGCTGTGGA ATACCAGGGT AAAACTTGGG CAGTACCTTT TGCTACAAAT AATGTTGGCG TTTTTTATCG TCCCAGTTTG TTTAAGGCGG CGGGAATTAC ACAATTACCT AAGACTTGGG AAGAGTTTCG GGAAGTTGCC AAAAAATTGA CCCGTGATAC CGATGGTGAT GGAAGAATTG ATCAACATGG GATGGTACTA CCTTTAGGCA AAGGTGAGTT TACCGTTTTT ACTTGGCTAC CATTTATGTG GAGTGGAGGC GGCGAGTTGG TAAGCAAAGA TTCACAGAAT GCGGCTGGTG TCACTTTAGA AAATAATGCT GGGGCGATCG CTGCTTTACA ATTGTGGCGT GATTTAATAA CAGATGGTTC GGCTGTGTTA TCAGGCCCAG AAAGAGGTTA TGAAACCAAC GATTTGCTAG CTGGTAAAGT AGCGATGCAA TTAACCGGGC CTTGGACTTT AGGGGAGTTT CAAACTAGTG GGGTTGATTT TGATGTATTC CCTATTCCGG TGGGGCAAAG ACCTGCTACT GTAATTGGCG GCGAAAATCT TTATATTTTT AAATCTAAAC CGGAAAGAGA AAAGGCAGCT TTTAAATTTC TTGAATATGC GGCTGGTGAG GAATTTCAAA CAGAGTTAGC ATTGGGAACC GGTTATTTAC CAATCAATTT AAAATCCCGC GAAAGTGCAA AATATCAGGC ATTTGTGAAA AAAGTACCCC AAGCAAAGGT ATTTTTAGAA CAGGCTAAAT ATGCGCGATC GCGTCCTACT TTCCCCGGTT ATAATCGAAT CTCTGATAGT GTGGGTCGGG CAGTGGAAAC TGTATTGATG GGTAAAAGTT CCCCAGCAGA TGCCCTAAAA GCTAGTCAGC AGCGCTTAGA TTTGATTTTC AAATAA
|
Protein sequence | MKVKKLINLF YQNCARFFFA SDDSRVKRKN WRFYRYRWLS LFLVGILVTS GCQVIQNGMK ADKVIHLTLW HGVNPPPNRD VFQKLVDKFN QSHTDIQVDA IYAGQQDQQM PKILAAVVGN ASPDILWYNP ATTGQLVELN ALVPLDEMLS SSLIKDEIDP SLYPAVEYQG KTWAVPFATN NVGVFYRPSL FKAAGITQLP KTWEEFREVA KKLTRDTDGD GRIDQHGMVL PLGKGEFTVF TWLPFMWSGG GELVSKDSQN AAGVTLENNA GAIAALQLWR DLITDGSAVL SGPERGYETN DLLAGKVAMQ LTGPWTLGEF QTSGVDFDVF PIPVGQRPAT VIGGENLYIF KSKPEREKAA FKFLEYAAGE EFQTELALGT GYLPINLKSR ESAKYQAFVK KVPQAKVFLE QAKYARSRPT FPGYNRISDS VGRAVETVLM GKSSPADALK ASQQRLDLIF K
|
| |