Gene Aazo_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3784 
Symbol 
ID9341589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3842577 
End bp3845153 
Gene Length2577 bp 
Protein Length858 aa 
Translation table11 
GC content40% 
IMG OID 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_003722442 
Protein GI298492265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCATC TTTACTTCGA TACAGAGAAT AACGGACATA AATCTTTTGA GTTACCAGGG 
GCTAAACCTC ACTACAATCC TGATAGACCA GGACAGGTAG AGCATATTTT TCTTGACCTC
ACCTTGAATA TCCCCAACCA AAGTTACTAT GGTAATTGTA GTATTCGGTT ATTGCCAATC
CGTAATAATA TTGATTGCTT GACCTTGGAT GCTGTAAATT TGAATATCCA ATCTGTACAG
GTAGACGAAG CAGAACAGAA GTTTGAATAT GACGGAGAAA AACTTGCAAT TCTTCTTTCT
GAGCCTACAC AAATTGGTCA GCGTTTGTTA ATTGCGATCG CTTACTCTGT AGAAAAACCC
CAACGAGGCA TTTACTTCAT TCAACCAGAC AAACACTACC CCCACAAGCC TACCCAAGTC
TGGACACAAG GAGAAGACGA AGATTCGCGC TTCTGGTTCC CCTGCTTTGA CTACCCAGGA
CAGCTATCCA CATCAGAAAT CTGTGTCCGT GTTGCCAAAC CCCTGATTGC TATTTCTAAT
GGTGAACTAA TTGATACCTT TGAAGATGGG AATCAGCAAA TTTACCATTG GTCACAGCAG
CAGGTTCATC CCACCTACTT AATGACCCTA GCAGTAGGTG ATTTTGCAGA AATTCGGGAT
GCGTGGCAAG GTAAGCCCGT TACGTACTAT GTAGACAAGG GACGAGAAGC AGATGCTAAA
CGCAGCATGG GCAAAACTCC CCGCATGATT GAATTTCTGA GCGAAAAGTA TGGTTATCCC
TATCCATTCC CGAAATATGC CCAAGTTTGT GTAGATGACT TCATCTTTGG GGGCATGGAA
AACACTTCCA CCACCTTATT AACAGATCGA TGTTTGTTAG ATGAAAGAGC ATCATTAGAT
AACTTCAACA CCGAAAGCTT AGTTGTCCAT GAACTAGCAC ATCAATGGTT TGGTGATTTA
ATCGTCATTA AACATTGGTC TCATGCTTGG ATTAAGGAAG GAATGGCTTC CTATTCGGAA
GTAATGTGGA CTGAACACGA ATATGGCAAA GATGATGCAG CTTATTATCA GTTATTACAA
GCTCGCAGTT ATTTGAATGA AGATAGTAGT CGTTATCGTC GGCCAATGGT AACTCACGTT
TACCGGGAAG CAATAGAGCT TTATGACCGC CACATTTATG AAAAAGGATC TTGTGTTTAT
AACATGATTC GCACAGAATT AGGCGATGAA TTATTTTGGA CAGCTATTCA AACATTTGTT
CAAGATCATG CTCACCAAAC TGTAGAAACA GTAGATTTAT TGCGGGCAAT TGAAAAAGCG
ACAGGACTTA ATCTCACCTT CCTATTTGAC CAGTATGTCT ATCGTGGTGG TCATCCTGAT
TTTAAAGTTG CTTATTCTTG GGATGGGGAT TCTAAATTGG CCAAGGTAAC TGTTACCCAA
ACCCAAGTCA ACAGTAATAA TAAGGATTTA TTTGATTTAA AAATCCCTAT CGGCTTTGCT
TACAGCCAAC AAACATTAAA CCTGACAAGT TTCACAGTGC GGGTCAATGA AAAAGAACAA
AGTTTCTATT TCCCATTAGC AGAGAAGCCA GATTTTATCA GCTTTGATGT AGGTAATAAT
TATCTGAAAA CTGTAACCTT AGAATACCCC ATACCAGAGT TGAAAGCCCA ATTAGAATTT
GACCCTGATC CTATTTCTCG TATTTATGCA GCCGAAGCTT TAGCGAAAAA AGGTGGATTA
GAAGTTACCA AAGTTCTATC AACAGCATTA AAAAATGATC CCTTTTGGGG TGTGCGTGTA
GAAGTAGCCC AAAAATTATC AGAAATTCAG TTATATCAGG CCTTTGATGG TTTAGTAATT
GGGTTACAAG ACCCTAGCCC CTTTGTCCGC AAAGCAGTAG TTAGTTCCTT ATCGCAAATC
AAAACTCACC CTAGTTACAA GGCTGTCAAA GCTGTAGTGC AGAATGGAGA TCCAAGCTAC
AATGTAGAAG CAGCAGCTTG TCGAACTATT GGTGCAATCG CAGCGGCTCA TTTAGAAGAA
AAACCCCATG AAGAAAAAGT CATCAAGCTG CTCAAATCAG TGTTAGAAGA AAGAGCAGGT
TGGAATGAAG TAATACGTAG TGGGGCGATT TCTGGTTTAG CTGAATTTAA AACTTCAGAA
ACCGCTTTAA ATTTGCTATT GGAATATACT AAAATCGGCG TGTCCCAACC GCTACGGCTA
TCTACTACTC GTGCTTTAGG AAAAATTTCT GTTGGTCAAA CTCCTACTAA TGTAGAACGA
ATTTTAGACC GATTAGCAGA AATAGCCAAA GAAGCATTCT TTTTAACCCA AGTTGCAGTA
GTAACAGCAT TAGGACAAAT GGAAACCCCT AAAGCAATTG GAATTTTGCA ATCTTTAGCC
AGTCAAACAA CAGATGGACG GGTGCGTCGT TATGCTGAAG AAGAAATTAT CAAAGTTCAG
AAAAATATTG GACCAGAAAA AACTTTGCGT CAGTTACGTG AAGAACTTAA CCAACTCAAA
CAACAAAATC AAGAACTAAA AAGCCGCTTG GAAAATTTGG AGGCTAAATC AAAGTAA
 
Protein sequence
MSHLYFDTEN NGHKSFELPG AKPHYNPDRP GQVEHIFLDL TLNIPNQSYY GNCSIRLLPI 
RNNIDCLTLD AVNLNIQSVQ VDEAEQKFEY DGEKLAILLS EPTQIGQRLL IAIAYSVEKP
QRGIYFIQPD KHYPHKPTQV WTQGEDEDSR FWFPCFDYPG QLSTSEICVR VAKPLIAISN
GELIDTFEDG NQQIYHWSQQ QVHPTYLMTL AVGDFAEIRD AWQGKPVTYY VDKGREADAK
RSMGKTPRMI EFLSEKYGYP YPFPKYAQVC VDDFIFGGME NTSTTLLTDR CLLDERASLD
NFNTESLVVH ELAHQWFGDL IVIKHWSHAW IKEGMASYSE VMWTEHEYGK DDAAYYQLLQ
ARSYLNEDSS RYRRPMVTHV YREAIELYDR HIYEKGSCVY NMIRTELGDE LFWTAIQTFV
QDHAHQTVET VDLLRAIEKA TGLNLTFLFD QYVYRGGHPD FKVAYSWDGD SKLAKVTVTQ
TQVNSNNKDL FDLKIPIGFA YSQQTLNLTS FTVRVNEKEQ SFYFPLAEKP DFISFDVGNN
YLKTVTLEYP IPELKAQLEF DPDPISRIYA AEALAKKGGL EVTKVLSTAL KNDPFWGVRV
EVAQKLSEIQ LYQAFDGLVI GLQDPSPFVR KAVVSSLSQI KTHPSYKAVK AVVQNGDPSY
NVEAAACRTI GAIAAAHLEE KPHEEKVIKL LKSVLEERAG WNEVIRSGAI SGLAEFKTSE
TALNLLLEYT KIGVSQPLRL STTRALGKIS VGQTPTNVER ILDRLAEIAK EAFFLTQVAV
VTALGQMETP KAIGILQSLA SQTTDGRVRR YAEEEIIKVQ KNIGPEKTLR QLREELNQLK
QQNQELKSRL ENLEAKSK