Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3042 |
Symbol | |
ID | 3681160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3765632 |
End bp | 3766852 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637718388 |
Product | extracellular solute-binding protein |
Protein accession | YP_323547 |
Protein GI | 75909251 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0658677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCG CGATCGCCAT TGTTGCTTAC CATAGTTGGC AAATACAGAG TTCGCCTTCA GCATCTGCCC CAGTCACTGT TAAACTCAGT GGCTGGGGTG GTTCTCCGGT TGAGCAAAAA CTATTGAAAC AGGTTTTACA AGACTTTGAA GCACAGCATC CTCTGATCAA GGTCAAATAC GAGGTGATTT CTGACCAATA CATGGACGTG ATCAAAACCC GCTTGATTGG AGAAGCTGCC CCTGATGTCT TCTACTTGGA TGCTCTGGAG GCTCCTTTTT TGATGAGCCA GAATGTGCTG GAACCATTAG AAAGTTACAT CACTCCCGCA TTTGACTTAA GTGACTTTGA AACCAATCTC CTAGATAGCT TTAAATACCA GAACCATATT TACGGTTTCC CTAAGGACTA TTCCACTTTG GCGCTGTTTT ATAACAAAAA AGCCTTTGCT GCTGCTGGTT TGAGCAGCCC CCCTGCGACA TGGCAAGAAT TACGCACCTA CTCCCGAAAA TTAACAGGTA AACTCAACAA GTATGGCTTT GGGGAAGCAC CGGAATTAGC TCGCCAAGTT TACAAAATTA AAGCTTTTGG TGGAGAAGTT ATCAATCAAA ATGGTCATGC TACCTTTGCG AGTGAAGCTG GTTTACGGGG ATTGCAATTA GTGATAGACC AGTATCAAAA AGATAAATCA TCTGCTCAAA AATCTGACGT AGGGACAAAC TCAGGTAGCG AAATGTTTGG TCAGGAGAAA GTGGCAATGG TAATTGAAGG TAATTGGGCA ATTCCATACT TACAAGAAAC CTTTCCGCAA TTGGAGTTTG CAACTGCACA ATTACCAACG ATTAATCAAA AAAAAGGCAC AATGGTATTC ACTGTTGCCT ATGTCATGAG TAAGCAATCA CAGCATAAAG CTGAAGCATG GGAGTTAATT TCCTATCTCA CAGGTAAAGC CGGAATGCAG AAATGGACAA GTACAGGCTT TGCTTTACCT ACACGCAAAT CAGTATCCCA AAAACTAGGA TATGAGCAAG ATCCCTTGCG ATCGCCTTTA GTTGCCGGTG TTGATGATGC TACACCCTGG CAGGTTGGTA AATACCCAGC CCCAATTGTG AACAATTTTG ATAATCAATT TGTCAGTGCC TTACTAGGAC AACAACCATT AAAACAGGCG ATGCTCAGGG CGCAGAATCA GGCAAATAAG CAAATTCAAG CAATGGAGTG A
|
Protein sequence | MAIAIAIVAY HSWQIQSSPS ASAPVTVKLS GWGGSPVEQK LLKQVLQDFE AQHPLIKVKY EVISDQYMDV IKTRLIGEAA PDVFYLDALE APFLMSQNVL EPLESYITPA FDLSDFETNL LDSFKYQNHI YGFPKDYSTL ALFYNKKAFA AAGLSSPPAT WQELRTYSRK LTGKLNKYGF GEAPELARQV YKIKAFGGEV INQNGHATFA SEAGLRGLQL VIDQYQKDKS SAQKSDVGTN SGSEMFGQEK VAMVIEGNWA IPYLQETFPQ LEFATAQLPT INQKKGTMVF TVAYVMSKQS QHKAEAWELI SYLTGKAGMQ KWTSTGFALP TRKSVSQKLG YEQDPLRSPL VAGVDDATPW QVGKYPAPIV NNFDNQFVSA LLGQQPLKQA MLRAQNQANK QIQAME
|
| |