Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2522 |
Symbol | |
ID | 9340321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2628894 |
End bp | 2629961 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | family 1 extracellular solute-binding protein |
Protein accession | YP_003721544 |
Protein GI | 298491367 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAAT CTGCTTTTAT ATTAGCGAGC GCTCCTCTAT TCTTTACCCT TGCTGCTTGT AGTGGTGAGT CAGGAACACA GGTAAATACA GCCAGCGCTC CTGCAACCCA AGTATCTCGT AATATTGGTA GTACAGTTAA AAACCGTGGC AAGTTGATTT GTGGTGTTAG CGGTGAATTA CCAGGATTTA GCTTTGTGGG AACTGACGCT AAATATAGCG GTATTGATGT AGATGTTTGT CGTGCTGTAA CTGCGGCTTT GTTTGATAAT CCAGATGCAG TAGAATTTCG CAACCTGAAT ACTAAAGAAA GATTTACAGC TTTACAAACT GGGGAAATAG ATTTACTCAG CCGCAATACT ACTTGGACAA TGAGTAGAGA GACTTCAGTT GGTTTGCAAT TTGCACCTGT GATTTTTTAT GATGGTCAAG CAATTATGGT TAGTAAAAAT AGTGGAATCA AATCTTTAGC AGATCTGAAA AATAAAGCAA TTTGCACTCA AACTGGTACA ACTACAGAAC AAAATTTAGC AGACCAAATG CGGAAACGTT CTATTACTTA TAAACCCGTT GTTTTTGAAG ACGTTAATAT TACATTTGCA ACCTATGCTG AAGGACGTTG TGATGGTGTG ACTACTGATC GTTCGGCTTT GATTTCTCGA CGTACGACTT TACCCAAACC GGAAGATAAT ATTATTTTGG ATGAATTACT GTCTTCAGAA CCTCTAGCAC CAGCAGTTGC TAAAGGAGAT CCTCAGTGGA ATGATATCGT TAAATGGGTT GTTTATTCAT TGGTAAAGGC TGAAGAGTTG GGAATTAATT CTCAGAATAT CGCACAACTT ACTAATAGTA ATGACCCAGA TATTAAGCGT TTTTTGGGAA CAGAAGGAAA TTTAGGTGAA GGACTTGGCT TAACAAATGA TTTCGCAGCG AGGATAGTCA AGCACGTTGG TAACTATGGT GAAGTTTACG ATCGTAACCT GGGTGGAAAA ACAAAACTCA ATCTCCCCCG TGGTCAAAAT CAACTTGCTA TAAAAGGTGG ATTACTTTAT TCTCCACCAT TTCGGTAA
|
Protein sequence | MLKSAFILAS APLFFTLAAC SGESGTQVNT ASAPATQVSR NIGSTVKNRG KLICGVSGEL PGFSFVGTDA KYSGIDVDVC RAVTAALFDN PDAVEFRNLN TKERFTALQT GEIDLLSRNT TWTMSRETSV GLQFAPVIFY DGQAIMVSKN SGIKSLADLK NKAICTQTGT TTEQNLADQM RKRSITYKPV VFEDVNITFA TYAEGRCDGV TTDRSALISR RTTLPKPEDN IILDELLSSE PLAPAVAKGD PQWNDIVKWV VYSLVKAEEL GINSQNIAQL TNSNDPDIKR FLGTEGNLGE GLGLTNDFAA RIVKHVGNYG EVYDRNLGGK TKLNLPRGQN QLAIKGGLLY SPPFR
|
| |