Gene Aazo_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0287 
Symbol 
ID9338071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp287602 
End bp288885 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content41% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003719998 
Protein GI298489821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAATTA CAAAAAGTAG ACTTGTTTTA GGTGCTACGG CAGTGACGCT TTCTACAATT 
GCTGTTACTA GTCTTGGCAT TCACTCCCGT GGTCAGGCTT TATTTAAAGC AAGCCCCAAG
GAATTGATAG ACGAAGTTTG GCAAATTGTT TACCGTCAAT ATGTAGACGG GACGTTTAAT
CAGGTAGATT GGCAAGCTGT TCGTAAAGAA TATTTAAGCA AGTCCTACAC CAACCAGGAA
GAAGCTTATA AGTCGATCCG GGAAATGCTG AAAAAGTTAG AAGATCCTTA CACCCGGTTT
ATGAACCCAG AGGAATTCAA GAATATGCAG GTTGATACCT CTGGAGAACT CACAGGGATT
GGTATCACGA TCAGTCAGGA TGAAAAAACT AAGCAATTAG TTGTGATTGC CCCGATTGAG
GATACACCCG CCTTTAAAAT GGGAGTTATA GCTAAGGATG TGATCCTGGA AATTGATGGC
AAAAGCACTG AAGGCATGGA TACTAACCAG GCTGTATCTT TGATTCGCGG TGAAGCGGGA
ACTAAGGTCA GATTGAAAAT TTTGCGGAAT GGTCAGAAAA AACAATTTGA TATCACACGG
GCCAGGATTG AAATCCATCC GGTTAAGTGT TCTGAAAAAC AAACTCCAGC GGGTAATCTT
GGTTACATTC GTCTAAATCA GTTCAGTGCT AATGCCGCCA AGGAAATGAA AGATGCAATT
AGTAAATTAG AGACTAAAAA CGTATCTGGT TATATTTTGG ATCTGCGGGG CAATCCTGGT
GGTTTATTAT TCTCCAGTGT GGACATTGCC CGAATGTGGT TAGATAAAGG AACTATTGTC
TCTACTATTG ACCGTCAAGG TGAACAGGAG AGGGAAATTG CTAAAGGTCG TGCTTTAACT
ACTAAACCTT TAGTGGTGTT AGTTGATAAG GGTTCAGCTA GTGCTAGTGA AATTCTTTCC
GGTGCTTTGC AGGATAATAA ACGTGCGACC ATAGTGGGTA CGCAAACCTT TGGTAAGGGT
TTGGTCCAAT CTGTACGACC CTTGGAAGAT GGTTCAGGGT TAGCAGTGAC TATTGCTAAG
TATCATACCC CTAGCGGTAA AGATATTAAT AAGCATGGTA TTGATCCTGA TGTAAAAGTG
GATTTAACTG ATGCCCAAAG ACAAGATCTG TGGTTAAAGG AACGGGATAA ACTAGCCACT
TTAGAAGATC CCCAATTTGC CAAAGCTGTG GAAATTTTAG GTAAACAAGC TGCTAAAAAT
AGTAAGACTA CAAACAAGAA TTAA
 
Protein sequence
MVITKSRLVL GATAVTLSTI AVTSLGIHSR GQALFKASPK ELIDEVWQIV YRQYVDGTFN 
QVDWQAVRKE YLSKSYTNQE EAYKSIREML KKLEDPYTRF MNPEEFKNMQ VDTSGELTGI
GITISQDEKT KQLVVIAPIE DTPAFKMGVI AKDVILEIDG KSTEGMDTNQ AVSLIRGEAG
TKVRLKILRN GQKKQFDITR ARIEIHPVKC SEKQTPAGNL GYIRLNQFSA NAAKEMKDAI
SKLETKNVSG YILDLRGNPG GLLFSSVDIA RMWLDKGTIV STIDRQGEQE REIAKGRALT
TKPLVVLVDK GSASASEILS GALQDNKRAT IVGTQTFGKG LVQSVRPLED GSGLAVTIAK
YHTPSGKDIN KHGIDPDVKV DLTDAQRQDL WLKERDKLAT LEDPQFAKAV EILGKQAAKN
SKTTNKN