Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2734 |
Symbol | |
ID | 9340535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 2820794 |
End bp | 2822035 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | S-layer domain-containing protein |
Protein accession | YP_003721725 |
Protein GI | 298491548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.115763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTACTT TAAATCCCTT GCAATCCAAA ATAGCCTTAT TCATGGCTGT GAGTATCACG GTAGGTACTG TAGCACCTTT GATGACTGTT GCACCTTCTT TTGCTCAAAC CAGTTTTTCT GATGTTTCAT CTAACTACTG GGCATCACAA TTTATACAAC AGTTATCACA ACGGGGTGTA ATTGCAGGAT TTCCAGATGG TACATTCCGT GCAGAAGAGC CTGTGACACG GGCACAATTT GCTGCCATGA TTAATAAAGC TTTTAGCAAG TCTGCACAGC GGCAGCCAAT CAATTTTAAC GATGTACCCA GTAATTATTG GGCATATAGC GCCATTAGAC AAGCTTATAC CATCGGTTTC TTATCTGGAT ATCCGGGCAA TACTTTCAAA CCTAATCAAG CTATTCCCCG TGAACAGGTT TTAGTTTCCC TGGCTAACGG TTTGGACTAT GTTGCTACCA GTAATGTGGA ATATACTTTA CAGTACTACA ACGATTCTGC TAGTATCTCT GGCTATGCTC GCAGTCCGAT CGCAGCAGCA ACTGATAGAA AAATTGTTGT TAACTATCCT AATGTTAAGT TTCTCAATCC TGGTGTGACT GCTACTAGGG CGCAGGTAGC AGCTTTTATT TATCAAGCAT TGGTTAGTTC TAATCAAGCT TCTGCGATTA ATTCACCTTA TATTGTTTCT CTAGTTACGC CACCACCACC ACCCATATCT GTAACCATTC CTCAAGGAAC TATTATTCCT GTGAAGTATG AAAAGGCTGG CAAAATCCTC GTTACCAAAG ATGAAACTGC ACCTTTAACT CTAACAACAT CACAAAATGT CATTACCCAA GATGGTACAG TAGTTATTCC TGCTGGTAGT GAGATAATAG GTGAACTTAG ACCTGGTCAA GGTGGTTCTC AATTTATTGC TCAAAAATTA ATTTTGACGA CAGGAAAAGA ATATAATCTT GTTGCTAGTT CTGATGTGAT CACCAAAACT GAAACTGTTA AAAAAGGAAT TAGCACCAGT TCAATTATCA AAAATACTGT ATTGGGTGCA GGTGCAGCAG CAGCAGTATC TGCGGTGACA GGCGATCGCG CTATTGCCAC AGAAGAAGTC CTTGGAGGTG CTGGTATTGG GGCGCTAATT GGTATTTTAT TTGGTAGAAA TAGCGTTGAT TTAATTGCCA TTGAACCTAA TACAGATTTA GAAATGACAA TTAATCAAAA CTTAGTAGTT TCTCTACAAT AG
|
Protein sequence | MFTLNPLQSK IALFMAVSIT VGTVAPLMTV APSFAQTSFS DVSSNYWASQ FIQQLSQRGV IAGFPDGTFR AEEPVTRAQF AAMINKAFSK SAQRQPINFN DVPSNYWAYS AIRQAYTIGF LSGYPGNTFK PNQAIPREQV LVSLANGLDY VATSNVEYTL QYYNDSASIS GYARSPIAAA TDRKIVVNYP NVKFLNPGVT ATRAQVAAFI YQALVSSNQA SAINSPYIVS LVTPPPPPIS VTIPQGTIIP VKYEKAGKIL VTKDETAPLT LTTSQNVITQ DGTVVIPAGS EIIGELRPGQ GGSQFIAQKL ILTTGKEYNL VASSDVITKT ETVKKGISTS SIIKNTVLGA GAAAAVSAVT GDRAIATEEV LGGAGIGALI GILFGRNSVD LIAIEPNTDL EMTINQNLVV SLQ
|
| |