Gene Aazo_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3762 
Symbol 
ID9341567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3820612 
End bp3822396 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content36% 
IMG OID 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_003722425 
Protein GI298492248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0112026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAC TAATAGCAGT TAGTTTGAAA ACTCAAGTTC AGGAAATTGA ACCAGCAATT 
CATTACTGGG TAGCAATGCC CCAACCAGAA AATCATCTGT TTGAGGTGAC TTTACATCTT
GTAGGCTACC CATTACCAAT TCTTGATTTG AAAATGCCAG TATGGACACC AGGGTCTTAC
TTGGTGCGAG AATACGCTAA GAATTTACAA AACTTTGCTG CCTTTGCTGG GTCTAAACCT
TTAAATTGGC GAAAAATTAG TAAAAATCAT TGGCAAATTG AAAAGGGAGA TGTTTCTGAA
GTAGTTCTGG GTTACCGCGT TTTTGCAAAT GAGTTGTCAG TACGCACAAA TCATTTGGAT
GCTACCCATG GTTATTTTAA CGGTGCGGCG CTGTTTTTAC GAATTCCTGG TTGGGAAGAA
CAACCAATTC ATATTACCAT TGTCCCACCA AACCCTGAAT GGCAAATAAC GACAGGTTTA
TCATCAATTA CTGAAGAAAC TAATACTTTT TTAGCTGCGG ATTTTGATAC TCTTGTTGAT
ACTCCTTTTG AGATTGGTAA CCATCAATTG TTTAATTTTG AGGTATTGGG AAAACCTCAT
GAGTTAGCAA TCTGGGGACA GGGAAACTGT AAACCCCAAA AGATATTAGA GGACTTTAAG
AGAATTATTG AATATGAAGC AGAAATATTT GGCGATTTGC CATATCAAAA GTATGTGTTT
CTGCTGCATT TATTCAACCA AGCTTATGGT GGATTAGAAC ATAAAAATTC CTGTTCATTA
CTTTATCATC GGTTTGGATT TCGTCTGAAA GATAAATATG AACGTTTTAT TCAATTAGTA
GCGCATGAAT TTTTCCATTT GTGGAATGTG AAGCGAATTC GCCCCAAAGA TTTCGAGGTT
TTTAATTATG ATCAAGAGAA CTATACACAG TCTCTTTGGT TTTGTGAGGG AACCACAAGT
TACTATGATT TGATAATTCC TTTCCGGGCA GGAATTTATG ATATCAAATC TTATTTTCAT
CATTTAGATC AAGAAATTAC CAAATATCAA TTAACACCAG GACGAAACGT ACAACATCTT
TCTGAGTCCA GTTTTGATGC TTGGATTAAA CTTTATCGTC CAGATGCTAA TAGTGCTAAT
TCCCAAATTT CTTACTATTT GAAGGGCGAA ATGGTATCGC TATTGCTGGA TTTATTGATT
CGTTCTCGTC ATCATAATCA GCTTTCTCTT GATGATGTTA TGCTGAAAAT GTGGGAACAA
TTTGGTAAGG CTGAAATTGG TTATACTCCA GAACAACTAC AAGAAGTCAT TGAATCTGTG
GCTGGAATGG ATTTATCGGA TTTCTTTAAA AGCTACATTC ATGGACTAGA TGATTTACCT
TTTAATGATT ATTTAGAACC TTTTGGGTTG CAATTGGTAG AAGAATCTGA ACAAGAACCT
TATTTGGGTG TGAAAATAAA AACTGAATAT GGACGAGAAA TAATTAAGTT TGTGGAAATG
GGGTCCCCTG CAAACATTGT GGGAATTGAT GCTGGTGATG AGTTATTAGC AATTGATGGA
ATTAAGGTAG GAACAAGCCA GTTGAGTGAT CGTTTGCACG ATTACCAACC TTACGATACT
ATCCAAATCA CGGTTTTCCA TCAAGATGAA TTGCGTAACT ATTCTGTAAG TTTAGGAAAA
GAACATCCGA CTAAATATCA GTTGCGGCCA GTAAAAAATC CTAATACTAC TCAGCAAGAA
AATTTTTCGG GTTGGTTAGG TGTGCAGTTG TCGAGTTTTT GGTAA
 
Protein sequence
MIELIAVSLK TQVQEIEPAI HYWVAMPQPE NHLFEVTLHL VGYPLPILDL KMPVWTPGSY 
LVREYAKNLQ NFAAFAGSKP LNWRKISKNH WQIEKGDVSE VVLGYRVFAN ELSVRTNHLD
ATHGYFNGAA LFLRIPGWEE QPIHITIVPP NPEWQITTGL SSITEETNTF LAADFDTLVD
TPFEIGNHQL FNFEVLGKPH ELAIWGQGNC KPQKILEDFK RIIEYEAEIF GDLPYQKYVF
LLHLFNQAYG GLEHKNSCSL LYHRFGFRLK DKYERFIQLV AHEFFHLWNV KRIRPKDFEV
FNYDQENYTQ SLWFCEGTTS YYDLIIPFRA GIYDIKSYFH HLDQEITKYQ LTPGRNVQHL
SESSFDAWIK LYRPDANSAN SQISYYLKGE MVSLLLDLLI RSRHHNQLSL DDVMLKMWEQ
FGKAEIGYTP EQLQEVIESV AGMDLSDFFK SYIHGLDDLP FNDYLEPFGL QLVEESEQEP
YLGVKIKTEY GREIIKFVEM GSPANIVGID AGDELLAIDG IKVGTSQLSD RLHDYQPYDT
IQITVFHQDE LRNYSVSLGK EHPTKYQLRP VKNPNTTQQE NFSGWLGVQL SSFW