Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0287 |
Symbol | |
ID | 9338071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 287602 |
End bp | 288885 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003719998 |
Protein GI | 298489821 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAATTA CAAAAAGTAG ACTTGTTTTA GGTGCTACGG CAGTGACGCT TTCTACAATT GCTGTTACTA GTCTTGGCAT TCACTCCCGT GGTCAGGCTT TATTTAAAGC AAGCCCCAAG GAATTGATAG ACGAAGTTTG GCAAATTGTT TACCGTCAAT ATGTAGACGG GACGTTTAAT CAGGTAGATT GGCAAGCTGT TCGTAAAGAA TATTTAAGCA AGTCCTACAC CAACCAGGAA GAAGCTTATA AGTCGATCCG GGAAATGCTG AAAAAGTTAG AAGATCCTTA CACCCGGTTT ATGAACCCAG AGGAATTCAA GAATATGCAG GTTGATACCT CTGGAGAACT CACAGGGATT GGTATCACGA TCAGTCAGGA TGAAAAAACT AAGCAATTAG TTGTGATTGC CCCGATTGAG GATACACCCG CCTTTAAAAT GGGAGTTATA GCTAAGGATG TGATCCTGGA AATTGATGGC AAAAGCACTG AAGGCATGGA TACTAACCAG GCTGTATCTT TGATTCGCGG TGAAGCGGGA ACTAAGGTCA GATTGAAAAT TTTGCGGAAT GGTCAGAAAA AACAATTTGA TATCACACGG GCCAGGATTG AAATCCATCC GGTTAAGTGT TCTGAAAAAC AAACTCCAGC GGGTAATCTT GGTTACATTC GTCTAAATCA GTTCAGTGCT AATGCCGCCA AGGAAATGAA AGATGCAATT AGTAAATTAG AGACTAAAAA CGTATCTGGT TATATTTTGG ATCTGCGGGG CAATCCTGGT GGTTTATTAT TCTCCAGTGT GGACATTGCC CGAATGTGGT TAGATAAAGG AACTATTGTC TCTACTATTG ACCGTCAAGG TGAACAGGAG AGGGAAATTG CTAAAGGTCG TGCTTTAACT ACTAAACCTT TAGTGGTGTT AGTTGATAAG GGTTCAGCTA GTGCTAGTGA AATTCTTTCC GGTGCTTTGC AGGATAATAA ACGTGCGACC ATAGTGGGTA CGCAAACCTT TGGTAAGGGT TTGGTCCAAT CTGTACGACC CTTGGAAGAT GGTTCAGGGT TAGCAGTGAC TATTGCTAAG TATCATACCC CTAGCGGTAA AGATATTAAT AAGCATGGTA TTGATCCTGA TGTAAAAGTG GATTTAACTG ATGCCCAAAG ACAAGATCTG TGGTTAAAGG AACGGGATAA ACTAGCCACT TTAGAAGATC CCCAATTTGC CAAAGCTGTG GAAATTTTAG GTAAACAAGC TGCTAAAAAT AGTAAGACTA CAAACAAGAA TTAA
|
Protein sequence | MVITKSRLVL GATAVTLSTI AVTSLGIHSR GQALFKASPK ELIDEVWQIV YRQYVDGTFN QVDWQAVRKE YLSKSYTNQE EAYKSIREML KKLEDPYTRF MNPEEFKNMQ VDTSGELTGI GITISQDEKT KQLVVIAPIE DTPAFKMGVI AKDVILEIDG KSTEGMDTNQ AVSLIRGEAG TKVRLKILRN GQKKQFDITR ARIEIHPVKC SEKQTPAGNL GYIRLNQFSA NAAKEMKDAI SKLETKNVSG YILDLRGNPG GLLFSSVDIA RMWLDKGTIV STIDRQGEQE REIAKGRALT TKPLVVLVDK GSASASEILS GALQDNKRAT IVGTQTFGKG LVQSVRPLED GSGLAVTIAK YHTPSGKDIN KHGIDPDVKV DLTDAQRQDL WLKERDKLAT LEDPQFAKAV EILGKQAAKN SKTTNKN
|
| |