Gene Aazo_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0766 
Symbol 
ID9338552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp809512 
End bp810891 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content42% 
IMG OID 
Productpeptidase M24 
Protein accessionYP_003720325 
Protein GI298490148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.498025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACCT CCAGCATCTT TCTAGAAACC CTCCACCATC GTCGTCAAAG ACTGGCAGAA 
CTGATAGATT TTCCAGCAAT TCTCTGGTCT GGTGGTAGCA GTTCCCGCAA CTTTCCAGCT
AATGTCTTCC CCTTTCGCCC TAGTAGTCAT TTCCTCTATT TTGGAGGAAT TCCTCTCCAA
AATGCTGCCA TTCGCCTAGA AAGTGGGAAG CTACAACTAT TTATAGATGA CCCTAACCCC
AGTAGCACCC TGTGGCACGG AGAAACACCA ACCCGAGAGG AAATAGCCGC AAATATAGGT
GCAGATGATG CTAGACCGAT CGCAGAATTA GAAGATTATT TGGAGAATGC TGCCACTATT
CCTGTTCAAG ATGCGGCAAC TTCGACAGAG CAATCACTAT TATTACATAG ATGGCTTTTA
CCCCAACAAC CACCCCAAGG AATTGATTTA GAATTAGCTA AAGCTATTGT TTCCTTGCGT
CTCACCCACG ATGCAGCGGC ATTAGTAGAA TTGCGTAAAG CTGTGGCGGT GAGTGTGGAA
GCACACAAAG CGGGAATGGT TGTTACATCT ACAGCAAAAC TAGAAGCAGA AGTTCGGGCG
GCAATGGAAG CAGTGATTAT AGGTTATAAT ATGACAACTG CTTACGCCAG CATTGTGACA
GTGCATGGTG AATTCTTACA CAATAACCAC TATTATCACT CGTTAGAACC CGGAGATTTA
CTTTTAGCCG ATGTGGGTGC AGAAACTGAA ACAGGTTGGG CTGCTGATAT TACCCGGACG
TGGCCTGTAT CTGGTAAGTT TTCATCTACC CAAAGAGATA TTTATGATAT TGTATTAGCT
GCCCATGATG CTTGTTTTGA AAAAATAGCT CCTGGTGTGG AATATGGGGA AATTCATCTT
ATAGCTGCAA CTGTGATTAC GGAAGGTTTG GTGGATTTGG GAATTTTACA AGGTAAGCCA
GAAGATTTGG TAAAAATGGA TCTTCATGCA TTATTTTTCC CCCACGGAAT TGGGCACTTA
TTAGGTTTAG ATGTCCATGA TATGGAGGAT TTGGGGGATT TAGCTGGGTA TGAAGAGGGA
AGAAAAAGAA GTGATCGATT TGGGTTAAGT TACCTGCGTT TGAATCGTCC TCTGCGTGCA
GGAATGTTAG TAACTATTGA ACCTGGATTT TATCAAGTTC CCGGAATTTT AAATGATCCA
AAAATTCGTG ATCAATATGA ATATTTAATC AATTGGGAAC GCTTAGAACA ATTTGCAGAT
GTGCGTGGAA TTCGCATTGA AGATGATGTT CTAGTTACAG AATCAGGTAG CGAAGTCTTA
ACAGCCGCAT TACCAAATCA AGCTAATAAT ATAGAAGATT TGTTAAAACT TCCAAAATAA
 
Protein sequence
MHTSSIFLET LHHRRQRLAE LIDFPAILWS GGSSSRNFPA NVFPFRPSSH FLYFGGIPLQ 
NAAIRLESGK LQLFIDDPNP SSTLWHGETP TREEIAANIG ADDARPIAEL EDYLENAATI
PVQDAATSTE QSLLLHRWLL PQQPPQGIDL ELAKAIVSLR LTHDAAALVE LRKAVAVSVE
AHKAGMVVTS TAKLEAEVRA AMEAVIIGYN MTTAYASIVT VHGEFLHNNH YYHSLEPGDL
LLADVGAETE TGWAADITRT WPVSGKFSST QRDIYDIVLA AHDACFEKIA PGVEYGEIHL
IAATVITEGL VDLGILQGKP EDLVKMDLHA LFFPHGIGHL LGLDVHDMED LGDLAGYEEG
RKRSDRFGLS YLRLNRPLRA GMLVTIEPGF YQVPGILNDP KIRDQYEYLI NWERLEQFAD
VRGIRIEDDV LVTESGSEVL TAALPNQANN IEDLLKLPK