Gene Aazo_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1099 
Symbol 
ID9338895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1182463 
End bp1183950 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content39% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003720572 
Protein GI298490395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0536343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTG ATACTACTAA CAACTTCTCT GCTCAGAAAG AACTCAATAT TGCTGGTATT 
TCCACAATTG ATACATTTCA AACTCAGGCT GACAGTAGTG TAAGTTGGGG TAATCGTAGT
AGTTTACCAT CACAGGAAAC TAGTGAGTTT GTTGTCACCA CTAGTAGCTA TAACTCCAAC
AATGGCTATG GCTTAGTCAA TGCAGGATCA GCAGTCAGTA AAGCCGCTGA AGATAGTCCT
TATACAGATG CTCCTAAACT GGGTAGAAAT AGTTGGGGTG CTGATTTAAT AAATGCTCCC
ACAGCGTGGG AACATGGATA TACAGGCCAG TCCATTATTG TTGCCGTTTT AGATACTGGA
ATTGACTACA ACCATAATGA TTTGAATGAT AATATCTGGA CAAATAATAA AGAAATTGCT
GGTAATGGTA TAGATGATGA TGGCAATGCT TATATTGATG ACTTTCAAGG TTGGAACTTT
GATAGTAATA CCAATAATGT TTTCGATGAC AATGGTCATG GAACTCATGT TTCTGGAACT
ATTGCCGGAG AAAATAACAG TGATGGTGTG ACTGGTATTG CCTATAATTG CAAAATTATG
GCAGTAAAAG TTTTAGATAA AAGTGGTTCA GGTTCTTATG CAAATATCGC TAATGGTATC
CGTTATGCCG TAGATAATGG CGCAAATGTG ATTAACCTTA GCTTAGGAGG TAATGTTTCT
AACAACACTC TCAAAATAGC TATTGAATAT GCTGGCAGTA ACGGGGTAAT TGTTGTTATG
TCCGCAGGTA ACGATGGCGA CTCTACACCA TCCTATCCGG CTCGTTATGC CAATGATTCA
GGAATTGCTG TTGGGGCAGT AAATCAAAAT AATCAACTGA CTGATTTTTC TAACCGTTCT
GGTTCTCAAG AAATCAAATA TGTCACTGCT CCAGGTGAGA ATATTTACTC CACACTGCCA
GGTAATAAAT ATGGTAATTA CACTGGCACT TCTATGGCTG CTCCCCATGT AGCTGGGGTA
GTGGCGCTGA TGCTTAGTGC TAACCCCAAC CTGTCAGAAA GCCAAGTGCG CGACATGATC
ACAAGTACAG CTCAAAATGG GACAAACTCT CAAGAACCTA GTCAGCCTTC AAATCCAATG
CCTTCTATAC CCTCTAACTT TCCACCTCTA GATTCTTTCT TTCCTATCAA TTCGATCAAT
ATAGGTTCTC AATTTCCTTT TGATATAGGT TCACTATTTC CGCTAGGTTC TCAAACACAA
TCAGCTACGC AATTACCACC AATTATTTTG TCTGTTGGTG ATAATAGTTT ACAGTTGAAG
TTTGGTAATG TTAGTACGGG AACTTCCACA ACTGCATACT TCACTTATGA CAGTGAAAAT
AAACATAGGT GGGCATTGCG TTATTTTGAC ACCACTGGGG TTACATACAC TTTTGTTAAT
GATGGGGATA TTGAAGCGGA AGATTTGTTT AAGAACTATT ATCCCTAG
 
Protein sequence
MKFDTTNNFS AQKELNIAGI STIDTFQTQA DSSVSWGNRS SLPSQETSEF VVTTSSYNSN 
NGYGLVNAGS AVSKAAEDSP YTDAPKLGRN SWGADLINAP TAWEHGYTGQ SIIVAVLDTG
IDYNHNDLND NIWTNNKEIA GNGIDDDGNA YIDDFQGWNF DSNTNNVFDD NGHGTHVSGT
IAGENNSDGV TGIAYNCKIM AVKVLDKSGS GSYANIANGI RYAVDNGANV INLSLGGNVS
NNTLKIAIEY AGSNGVIVVM SAGNDGDSTP SYPARYANDS GIAVGAVNQN NQLTDFSNRS
GSQEIKYVTA PGENIYSTLP GNKYGNYTGT SMAAPHVAGV VALMLSANPN LSESQVRDMI
TSTAQNGTNS QEPSQPSNPM PSIPSNFPPL DSFFPINSIN IGSQFPFDIG SLFPLGSQTQ
SATQLPPIIL SVGDNSLQLK FGNVSTGTST TAYFTYDSEN KHRWALRYFD TTGVTYTFVN
DGDIEAEDLF KNYYP