Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3417 |
Symbol | |
ID | 9341222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3482403 |
End bp | 3483554 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | family 1 extracellular solute-binding protein |
Protein accession | YP_003722186 |
Protein GI | 298492009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGAC GCTCTTTTTT GTTAGGTGCA AGCGGACTGG TATTTTCCCA GATACTCATG GGTTGTGCTG GTAAAAACCA GACACATCTA AATGTACAGT TATTAAAAGG TTCTATACCT GGTCAGGTGG TTAATCAATT TCGTAAAACT CTAGAGTCAG ATGCAAATTT AAAGTTTGTC CCTATTAATC AAATTTTAGA TTTATTTAAG CAATTACAAA TATGGCAAAA ACCAGAAACT AAAGATCAAC AGGGATGGAA AAGTTATATT CCCTTGATGC AAAGTCAACA ACAATCTAAA GCTGATTTAG TGACATTGGG AGATTTTTGG CTAAAAGCAG CAATTGAACA GAAACTGATT CAACCACTAG AAACAGAAAA AATTAAACAA TGGTTAAGTT TAAATCTCAG GTGGCAGCAA TTAGTAAGAC GCGATGATCA AGGAAATATA GATCCACAAG GAAAGATTTG GGCTGCACCT TATCACTGGG GTAATACAGT CATTATTTAT AATGGGGAAA AATTTCAAAA ATTTGATTGG CAACCAAAAG ACTGGAGCGA CTTATGGCGG AGTGAATTGC GATCGCGTAT TTCCTTACTT AATCATCCCA GAGAAGTCAT TGGTTTGGTT TTAAAAAAAC TAGGAGAATC CTACAATACG GAAAATATTA CTCAAATCCC CGACTTAAAA GCAGAATTAC TGGCACTACA CCAACAAGTA AAATTTTATG ATTCTACTAC CTATCTAGAA CCACTACTCA CCGGAGATAC TTGGTTAGCT GTGGGTTGGT CAAATGACGT TATGCCTATA CTCAGTCGTT ATCCAAAACT TACCGCAGTC ATTCCCCAAT CAGGAACTGC AATGTGGGCA GACTTATGGG TAAGTCCGGC TGAAGTTGAG CAAAACACCT TAGCATCTGA TTGGATTAAT TTTTGTTGGC AACCAAATAT AGCCAAACAA ATTGCCATAC TGACTAAAAA TAATTCGCCT ATAACAAATA TTGTAGCCTC TGATCTTCAG AAACCATTAC AAAACTTGTT ACTAAATAAT CAGGAATTAT TTGATAAAAG TGAATTTTTA CTCCCCTTAC CAGCATCAGT CAATAAGGGT TATAAGTATT TATTTAACAA AATAAAAAAT TCCGAACAAT GA
|
Protein sequence | MNRRSFLLGA SGLVFSQILM GCAGKNQTHL NVQLLKGSIP GQVVNQFRKT LESDANLKFV PINQILDLFK QLQIWQKPET KDQQGWKSYI PLMQSQQQSK ADLVTLGDFW LKAAIEQKLI QPLETEKIKQ WLSLNLRWQQ LVRRDDQGNI DPQGKIWAAP YHWGNTVIIY NGEKFQKFDW QPKDWSDLWR SELRSRISLL NHPREVIGLV LKKLGESYNT ENITQIPDLK AELLALHQQV KFYDSTTYLE PLLTGDTWLA VGWSNDVMPI LSRYPKLTAV IPQSGTAMWA DLWVSPAEVE QNTLASDWIN FCWQPNIAKQ IAILTKNNSP ITNIVASDLQ KPLQNLLLNN QELFDKSEFL LPLPASVNKG YKYLFNKIKN SEQ
|
| |