Gene Aazo_4674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4674 
Symbol 
ID9342481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4777397 
End bp4780405 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content42% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723010 
Protein GI298492833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.597863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTGA AAAATCACTG GTTGACTCCC AATAGAGGCG CTTTGGCACA AATGCTTAAG 
TGGGTGAATC TCCGACCAGA GGAGTTAGAA CGCACTTGGA CGATGTTTGC CTTCTACACG
ATTGTATCTG TGGGATTGCG ATGGGCAGAG GATAGCACTC TAGCTCTGTT TTTGGATCAA
TATGGTGCGG AAAAACTACC TTGGATTTAT ATTGCCAGTG CGGTGACAGG TGCTGGACTG
GTAGTTTTAT ATTCTTGGTT ACAAAAGGTT TTTCCCTTAC GTGCAGTGAT TGTGGCGATC
ACACCGGGCA TGGTGGTGCC ATTATTTCTG TTAGTGTTTT TGTATGGAGG GATAAATATT
CCCTACCTAG CAGTAATTAT CATCTTTCTG CTCAGGTTAT GGGTAGATGC CTGCTATGTG
GTCAATGACC TTAACACATC TATTGTTGCC AACCAACTAT TCAACATTCG CGAGATTAAG
CGCACCTACC CGTTAGTCAG TAGTGGCATA TTAGTAGCTG ATGTGATCAG TGGCTTTAGT
TTACCTTGGT TACTCAAATT CACCAACCTA AATACAGTTA TTCTCATCGC CTGTTTTGTC
ATTTTATTCG GCTCAGGAAT TTTATTTTAT TTAACTTATA AATATCCGAC AGCTTTCCCC
CATACGCCAC AACGGGAGAT TACGGAAGAA CAAGCCTCTC GTCATCGCCG CTTAGAAACA
CCTCTAAAAC GCTATGTTTG GCAGTTGTTT GCATTTTTTG CACTGTTACA AGTGATTGGC
TTGTTAATAG ATTTTCAGTA TTTGCGCGAA CTGAATTCTA GTTTAGGTCA ACAAGAACTA
GCCAGCTTTT TAGGGGTGTT TGGTGGCATT GTAGGACTGT GTGAATTAGT TACCCAATGG
TTTATTTCTA GCCGATTGAT TGAAAAGGTG GGAGTATTTT TCACAGCCGC ACTTTTACCA
ATTACTGTGG GCTTTTTATT ACCAGGTGGA ATCTCAGTTT TGAACTTATT TCCAGCGATT
GAAGAGCCTG GATTTTTTTG GGGTTTAATG AGTCTCAAGT TCTGTGATGA ACTCTTGCGT
TATACCTTTG TGATTAGTAG CGGTCCGGCA CTTTATCAAC CTATTCCTGA CCGAATTCGT
AGCCGGATGC AAGCTTTATC CAGTGGGACA GCCGAAGCGA TCGCCTCTGG TTTGACGGGA
TTAGTCATTT TTGGTAGTGT ATCGCTGATT GATCAATTTG TCCCTCAATC ACTGCAAAAG
TGGGTATTAA TAGGTGAAAT AGTCATCGTT GCGGCCACCT GTTTAAAAGT TGTTTGGGAA
TTGCGATCGC GCTATGTGGA CGTATTAGTT TTAAGTGCAG AACGGGGTGG ATTGAGTCCC
GTAACTGTGG GTTTGCGAGC CTTCCAACAG GGAATAGTTA AAGCTTTGGT AGAAACAGGA
ACTACAGCCG ATAAAAGCTC TTGCTTAGAA CTTTTAGCCC AAATTGATCT CCCCAGCGCA
GGACAAGTTT TAGCACCATT ACTGCTTAAA TTACCTACAG ATTTACAATC CCAAAGCCTA
GAATTAATGC TCAAATCAGG TATAAATCCC CTTTATGTAC CTGAAGTCCG TCGATTATTA
GACCAACCCC AAGCAACCAT TGACCCAGAA GTGTTTGCTT TGGCTATGCG TTACCTTTGG
CTGGCCGAAG AAAATCTCAA TTTAAATCTT GTAGAAGAAT ACCTGCATCA ACAAAACCAT
CCATTGATCC GCGCCACCGC AGCTTCATTA CTACTGCGTC AGGGAACAGA ACAACAAAAA
ACAACAGCCA CCAAAACAAT GCGTCGGATG CTCACCCATC AACAAGAACG AGAACGGATT
AATGGAGTCA GGGGACTCAA AGAATTGGTT TATTTACAAG CTCTGCGGGT TCACATTCCC
AATTTATTGC AGGATGATTC ATTGCGGGTA CGCTGTGCTG TATTAGAAAT GATTACCGCT
ACCTGCTTGG AGGAATACTA TCCCACACTC TTAGCAGCAC TCTACTACAA ATCCACTCGA
CAAACAGCCA TGCAATGCTT GGTGAGTTTG GAAAATGAAG CCATCCCGAT GTTGTTGAAA
CTAACCACAA ATATTTACAA ACCAGACGTT GTGAGAATAT GCGCTTGGCG TATCATTGCT
CAAATCGGCA CACTAGAAGC AAGAGAAACT CTATGGCTAC ATTTGGAAAC ATCCAGGGGT
AATACGAGGG ACTATATTCT CCAAAGCTTA CTGAAGATTC AGCAAAAACC AGGAAATATC
AATGTCGTAG ATCAATTCTA TGAAAGTAGG GTAGAAATTT TAATTGAGCA AGAATTACGA
TTTCTCGGTG AAATTTATGC TGCGTACATA GACTTCCAAA ATCTATACTC TCTAGAAAAT
TATCAAGGAA ATGAGAGGCT TTTGACTATT GCTCAATTGC TGCAACGCTC ACTACTAGAA
TTAGCATTGG ATGTGAGAGA TAGGTTATTG CTGTTGTTGA AGTTGCTTTA TCCCGCAGAA
AAAATGCAAG CCGCAGCCTT CAATCTCCAA TCTAAGTCAT TAATCAATTT AGCAAGAGGC
TTAGAAATAT TAGATCACAC TGTAAATTTG CCTTGTAAGT CTTTGTTGTT GAATATTTTA
GATCGACGAC CAGAACATGA AAAGCTCAAA TATCTGATCG AAGCGGGATT TTGGCAAAAT
GAAAATATGC CAGTCAAAAA GCGCCTCTCT AAGATGATAT CTCAGGGACA TTTGCTTTCT
GATTGGTGTT TAGCTTGTTG CTTGCACTTT GCACAAGCTG CTTATGTTCG ACTCACGACT
GCGGAAATTT TAGCAAATTT GCGCCATCCT ACAGGGTTTG TTAGGGAAGC AACAATTTCA
TACTTGAGTG TAGTTTCCCA GCGCGTTCTT CAGGAAATTC TCCCCCATTT AAAAACAGAT
CCCCATCCAC TGGTGGCTGC TCAAGTCAAC GAGTTGATAG CAAAATATAA AGAAGGACTT
CAGGATTAA
 
Protein sequence
MELKNHWLTP NRGALAQMLK WVNLRPEELE RTWTMFAFYT IVSVGLRWAE DSTLALFLDQ 
YGAEKLPWIY IASAVTGAGL VVLYSWLQKV FPLRAVIVAI TPGMVVPLFL LVFLYGGINI
PYLAVIIIFL LRLWVDACYV VNDLNTSIVA NQLFNIREIK RTYPLVSSGI LVADVISGFS
LPWLLKFTNL NTVILIACFV ILFGSGILFY LTYKYPTAFP HTPQREITEE QASRHRRLET
PLKRYVWQLF AFFALLQVIG LLIDFQYLRE LNSSLGQQEL ASFLGVFGGI VGLCELVTQW
FISSRLIEKV GVFFTAALLP ITVGFLLPGG ISVLNLFPAI EEPGFFWGLM SLKFCDELLR
YTFVISSGPA LYQPIPDRIR SRMQALSSGT AEAIASGLTG LVIFGSVSLI DQFVPQSLQK
WVLIGEIVIV AATCLKVVWE LRSRYVDVLV LSAERGGLSP VTVGLRAFQQ GIVKALVETG
TTADKSSCLE LLAQIDLPSA GQVLAPLLLK LPTDLQSQSL ELMLKSGINP LYVPEVRRLL
DQPQATIDPE VFALAMRYLW LAEENLNLNL VEEYLHQQNH PLIRATAASL LLRQGTEQQK
TTATKTMRRM LTHQQERERI NGVRGLKELV YLQALRVHIP NLLQDDSLRV RCAVLEMITA
TCLEEYYPTL LAALYYKSTR QTAMQCLVSL ENEAIPMLLK LTTNIYKPDV VRICAWRIIA
QIGTLEARET LWLHLETSRG NTRDYILQSL LKIQQKPGNI NVVDQFYESR VEILIEQELR
FLGEIYAAYI DFQNLYSLEN YQGNERLLTI AQLLQRSLLE LALDVRDRLL LLLKLLYPAE
KMQAAAFNLQ SKSLINLARG LEILDHTVNL PCKSLLLNIL DRRPEHEKLK YLIEAGFWQN
ENMPVKKRLS KMISQGHLLS DWCLACCLHF AQAAYVRLTT AEILANLRHP TGFVREATIS
YLSVVSQRVL QEILPHLKTD PHPLVAAQVN ELIAKYKEGL QD