Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1705 |
Symbol | |
ID | 9339498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1770465 |
End bp | 1771706 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003720978 |
Protein GI | 298490801 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.880099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTTCA TGAACAAACA GGTTTTTCGG TTGGGATTTT CATTACTTTT GGCTTTTTGT TTGGGTTTTG GCTCGCTTGT TTCACCTGCA ATGGCTTTAA CACAGGAGCA AAAGCTAGTT TCTGAGGTTT GGCGAATTGT TAATCGCTCT TATCTGGATG AAACATTTAA TCATCAAAAC TGGGCTGATG TACGTCAACA GGCGCTAAGG AAACCACTGC CAAATGACCA AGCAGCATAC AGGGCTATTC AGAAGATGCT AAAAAGCCTT GATGACCCTT TTACCAGGTT TTTAGACCCA GAGCAATACC GCAGTTTGCA AGTTAATACT TCTGGAGAAC TGACCGGAGT GGGTTTACAA ATTGCGCTCA ATCCCCAGAC GGGTGGATTG GAGGTAATTA CACCTATAGA GGGTTCACCG GCTGAGAAAG CAGGGTTAAG ACCTCGCGAT CGCATCTTGA AAATCGAAGG ATTATCTACA GAAAATCTGA CTCTTGATGA AGCTGCTAAA CGGATGCGCG GTCCCGTTGG TAGTGTTGTA ACTCTCTTGA TTGCACGAGA GGGAAAGGAA TACAAAGAAG TGATATTAGT GCGCGATCGC ATAGAACTTA ATCCTGTAGT AGCTGAATTG CGTTTATCCC CCGAAGGAAA ACCCATTGGC TACTTACGCC TAACTCAATT TAATGCTAAT GTGGTAATCA GGTTGGCAGA CGCTCTTAAT AGCCTAGAAA AAAAAGGCGC AGTTGCCTAC ATTCTTGATT TGCGAAATAA TCCTGGTGGG CTATTACAAG CCGGAATTGA AGTTGCCCGT CAGTGGTTAG ATTCAGGCAC AATTGTCTAC ACTGTCAATC GTCAAGGTAT TCAGGGCAAT TTTGAAGCCC TTGGCCCAGC GTTAACACAA GATCCCTTGG TGATTTTGGT GAATGAAGGA ACTGCTAGTG CTAGTGAAAT CCTTGCTGGT GCCCTACAAG ACAATAAACG CGCCCAGTTA GTAGGTGAAA CGACCTTTGG TAAAGGTCTA ATTCAATCTT TGTTTGAATT ATCAGATGGT TCAGGTTTAG CAGTGACAAT TGCCAAGTAT GAAACTCCCA AGCACAGAGA CATTAACAAG TTAGGTATTA AACCAGACAA ACTAATTCCC CAACAACCAA TTACACGGGA GCAAATTGGG ACGGAAGGGG ATAGTCAATA TCAAGCTGCA ATGGAACTGC TAACCAAAGA TTTGGTTGTA GCTGGTTCGT AG
|
Protein sequence | MGFMNKQVFR LGFSLLLAFC LGFGSLVSPA MALTQEQKLV SEVWRIVNRS YLDETFNHQN WADVRQQALR KPLPNDQAAY RAIQKMLKSL DDPFTRFLDP EQYRSLQVNT SGELTGVGLQ IALNPQTGGL EVITPIEGSP AEKAGLRPRD RILKIEGLST ENLTLDEAAK RMRGPVGSVV TLLIAREGKE YKEVILVRDR IELNPVVAEL RLSPEGKPIG YLRLTQFNAN VVIRLADALN SLEKKGAVAY ILDLRNNPGG LLQAGIEVAR QWLDSGTIVY TVNRQGIQGN FEALGPALTQ DPLVILVNEG TASASEILAG ALQDNKRAQL VGETTFGKGL IQSLFELSDG SGLAVTIAKY ETPKHRDINK LGIKPDKLIP QQPITREQIG TEGDSQYQAA MELLTKDLVV AGS
|
| |