Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4118 |
Symbol | |
ID | 9341923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4185725 |
End bp | 4187356 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | family 1 extracellular solute-binding protein |
Protein accession | YP_003722682 |
Protein GI | 298492505 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.794373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGGT GGGGTAGAAT AGCGAAATTT CTATCTTTAT TCTCTATCTG TTTATTGTTG ACTGTAAGCT GTACCCCTCC TCAACAGATA ACTACTCCAA CATCTGGTGC TGTCAATACT CCTGCCAGTG ATGGACGGAT TACTATCGGT ACGACTCAGA AACCTCGTAC CCTTGATCCG GCTGATGCGT ATGAATTAGC ATCTATGGGT TTGGTGTTTA ATATGAGTGA TCGCCTATAT ACTTACGAAC CAGGGAGTAT AGAAATTAAA CCCCAACTGG CTACAGCTTT ACCTAAAGTT AGTGCAGATG GTTCAACATA CACCATACCT ATACGTCAAG GAGTGCTCTT TCACGATGGT ACACCTTTTA ACGCTAAAGC AATGGAATTT ACCATCCAGC GTTTTATCGA AAATAAAGGT AAACCATCTT TCTTACTATC AGATACGGTA GATTCAGTGA AAGCTACAGG GGATTATGAA TTAACAATTA AGCTGAAAAA GCCCTTTGCA GCTTTTCCTT CACTGTTAGC ATTTTCTGGA GTGTGTGCTG TCTCTCCGAG AGCTTACGAA ATAGGTGCAG GTAAATTTCA ACCCAATATC TTTGTGGGAA CTGGCCCTTA TAAATTAGCC CAGTATGGGA CTGATTCTCT CAGATTTGAT GTATTTGATA AATATTGGGG AGAAAAACCA GCGAATAAAG GTATTAATGT CCAGATTCAA AGCAGTCCAG TGAATTTGTT CAATGCTTTT AAAACTAGTG CGGTAGACGT TGCTTATCTA TCTTTACAAC CAGACCAAAT TCGTAGTTTA GAAGAAGGTG CTAAAAAAGG AGATTGGCAA ACCATCACTG CCCAAGGTAG TGTAGTGAGT TATATGGTGT TGAATCGCAA TCAGAAACCT TTGGATAAAC CAGAAGTTAG AAGAGCGATC GCATCACTCA TTAATCGTCA ATTATTCAAT GAGCGAGTTT TGTTTAATCA GGCAGATCTA CTTTACACCA TGATTCCCAC TACCTTTAAT GTTTCCCAGC CATTATTTCA AGCTAAATAT GGTGATGGTA ACTTTGAAGA AGCTAAAAAG TTGTTAACTA CCGTTGGTTT TTCCCAACAA AATCCCGCTA AAGTGCAAGT TTGGTATCCT GCGAGTTCAC CAACTCGAAG TTTAGCCGCA CAAACACTCA AATCCTTGGC TGATACTAAA ATGGATGGGA TATTACAATT GGAAGTAAAA ACCGTAGAAG GTGCTACATT TTTTAAAGAA ATTTCCAAAA GTTTATATCC AGTAGCTTTA CTAGATTGGT ATCCAGACTT TTTAGACCCA GATAATTACG TACAACCATT TTTAGCTTGT GAAAAAGGTT CAGAATCAAA AGGCTGTGAA GACGGAGGAA GTCAAATGCA AGGGTCATTT TACTATAGCG AAACAATGAA TAAACTCATT GATCAACAAC GTAAGGAACA AAACCCAGAA GCCAGAAAAA GAATATTTAC TGAGATTCAA AGCCAAGTAG TTAATGATGT CCCTTATATT CCTTTATGGC AAAACAAAGA TTATGTATTT GCCCAAAAAG GTGTAAGTAA TGTAAAACTT GATCCTACCC AGAACTTGAT TTACAAGAAC ATTAAAAAGT AG
|
Protein sequence | MTRWGRIAKF LSLFSICLLL TVSCTPPQQI TTPTSGAVNT PASDGRITIG TTQKPRTLDP ADAYELASMG LVFNMSDRLY TYEPGSIEIK PQLATALPKV SADGSTYTIP IRQGVLFHDG TPFNAKAMEF TIQRFIENKG KPSFLLSDTV DSVKATGDYE LTIKLKKPFA AFPSLLAFSG VCAVSPRAYE IGAGKFQPNI FVGTGPYKLA QYGTDSLRFD VFDKYWGEKP ANKGINVQIQ SSPVNLFNAF KTSAVDVAYL SLQPDQIRSL EEGAKKGDWQ TITAQGSVVS YMVLNRNQKP LDKPEVRRAI ASLINRQLFN ERVLFNQADL LYTMIPTTFN VSQPLFQAKY GDGNFEEAKK LLTTVGFSQQ NPAKVQVWYP ASSPTRSLAA QTLKSLADTK MDGILQLEVK TVEGATFFKE ISKSLYPVAL LDWYPDFLDP DNYVQPFLAC EKGSESKGCE DGGSQMQGSF YYSETMNKLI DQQRKEQNPE ARKRIFTEIQ SQVVNDVPYI PLWQNKDYVF AQKGVSNVKL DPTQNLIYKN IKK
|
| |