Gene Aazo_4588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4588 
Symbol 
ID9342394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4681799 
End bp4684831 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content42% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722962 
Protein GI298492785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA TGTACATATT GTGGTGGATG GCACAAATAC CTGTAAACAC CCCCAACGTC 
ACACCAGCAC AGGCATCGGT TCTTACTTCC GGTCCGCGCT TTTTTGTGGC TTTAATTTCT
GGGGTAATTC TCGCTTTTGC CTTCCAATTA GTGTTAACAA ATCTCTCAAT TGCCGCTGGG
ATTTCCTATT TGGGGCATCC GTCAGAGTCG CAGGAAGTTG AAAGTTTCGG GGGTACGATT
CGCAAAATTG GGACGAGGTT GGGAATTTGG ACTTTAGTTA CGGTAACAGT TGCTTTATTC
ATAGCTTGTT TCTTGGCTGT AAAATTGAGT CTATTAATTT TAGATCCCAG ATTAGGTGCA
ATTTTGGGCT TGGTAATTTG GGGTGCATAC TTTTTATTAC TAATGTGGGT CAGCACAACT
ACGGTAGGAT CTTTAATTGG TTCGGTCGTG AATACGGCCA CTTCTAGTTT TCAGGCGATT
ATGGGAACTG CTACGGCTGC TTTGGGTACT AGGGCTGTAA ATCAGCAAGT GGTGGCAACG
GCGGAAGCGG CTGCATCGGC TGTGCGGCGG GAGTTAGGGA GTGCATTGAC ACCAGCAAAT
ATTCGCCAAA ATATAGAAGA GTACATAGAA AAGCTACGTC CTCCAGAAAT TGATATATCG
AGGATTCGTT CTGAATTTGA AAGGTTACTG AGTGAACCGC AATTTAAAGC CTATGCCAGT
AGTTCAGACC TCCGTAATAT TGACCGTCAG CGGTTTATTG ATTTAGTCAG CAGTCGCACG
GATTTATCTA AACGAGAAGT TAACCGTGTC GCTGATTCTT TATATGATGC TTGGCAACAA
GTAATCACTC ATAACGTACC CGCAAATAAG CGCTTGGCTG AGTTGGTTGA TTATCTCAAA
TCCCTGCCGC CAGGACAAAC TAAAACAGAC GAACTCAATG CTAAACTTGA CCAGTTAATT
ACAGAAATTC ATTCTTCCAA AGAAACAGAA CAAAAGCCTG GTATGATAAA ACAGGCAATA
TCAGCACTGA GTGCGGTAGT TTTAGATAGG GCAGATTTGT CTGATTTAGA TGTAGAAAAA
ATTTGGGATT CCCTAGCTAC TGCTAGAGAA AAATTCACAA AACAAATTAT ACAACAGCCT
TATAATCCAA TTCGCGCTGA TATTGAAAAT TACTTACTCA ATACCCATCC TTGGCAATTA
AGTCCGAAGA ATATTGTTCA AGAATCTCGA GATGTGATCT ATGATCCTGC TGCTGATCCT
GGTGTAATAC GTAGCGAATT AGAGAAAATT ACCCGTCAAG ATTTTGTCAA AATTTTGCAA
GCCAAGGGAC TGTTAACTCA AGGTCAAATT CAGGAAATTG CTGATCAATT GGTAGCAGTA
AAAAATGAAG TATTGATAAC AGTAATTGCC CAGGAAGAAA GAGAGATTGT CCAAGATTTG
CAAAGAAGAG TTGAAAGTTA TTTACTTGTG ACTCCTAAAG CAGACTTGAC ATCAGCAGGA
ATTGAGGAAA ATTTTAAACC TTTGTTAGCA GATTCAGAGG CAGATTATCA ATCTTTATCC
CGGCGATTAG CACAAGTTGA ACGGGAACAA ATGGGAGAAA TCCTGTTAGG ACGTAATGAT
ATTCAGGAGT GGGAATTAGA CCCGATTTTA GATGAGTTGG AAATGCAGCG TGATCGCGTG
TTGTTAGAGT CTCTCAGCAT GTCCAAACAG GCACAACATC AAGTGGAAAC TCTGTGGTTG
AATGTTGAAT CATATCTACG TAACACGGGT AGGCAAGAAT TAAATCCTGA TGCTATCCGC
ACTGATCTGA AACGACTGTT AGAAGACCCC CAAGGGGGAA TTATGGCTAT TCAGGCGCGG
TTGTCTCGCT TTGACCGAGA TACTTTGGTG CAATTGTTGA GTCAACGCCA AGATTTAAAT
GAAGCACAAG TTAATCAAAT TATTGACGTG GTGGAAGAAA TATGGGGTGG TATTCTCCAC
ACACCACAAA AAGCAAAAGA ACAATATGAT TCTATCACCT CTACTATTGC AGATTATCTG
CGAAATACTG GTAAGGAAGA ATTGAATCCT GAAGCTATTC AGCAAGATTT AACTAGGTTA
TTTGCACATC CCAGAGAAGG TGTTGTCGCA CTGCGTCACC GTTTATCGCA CATTGACAGG
GATACTTTGG TGAAGTTGTT GACTCAACGT CAAGATTTGA GTGAGGAACA AGTTAATCAA
ATTATTGATA GTGTACAGAC ATCAATTAGA AATATTATCC GTGCGCCGCG TCGGTTAGCA
AGCAGGACAC AACAAAGAAT ACAAACTTTT CAAACTTATT TGCAGGAGTA TTTACGATTA
ACTGGTAAAG CAGAATTGAA CCCCGAAGGC ATTAAGCGGG ATGTGCAATT ATTGTTGCAT
GATCCACGAG TGGGGATGGA AAGTTTGAGT GATCGCCTCT CGCATTTCGA CAGAGACACA
ATTATTGCGT TGTTGAAAAT CCGGGAAGAT ATAAGTGATG AAGAAGCAGG GAGAATTGCT
GATAATATTA TCCTCGTGCG TGATCAATTT GTAGAACAGG TTCGGGGTAT TCAACGACGT
ATTCAAGATG TAATTGAGGG GATTTTTGCC AGTATTCGCA ATTATCTTAA TTCTCTAGAA
CGCCCGGAAC TTAATTATGA TGCCATTAAG CATGATATCC GCCAATTGTT TGAAGATCCC
CAAGCCGGGT TTGATGCATT GCGCGATCGC CTCTCATCTT TCAATCATGA TACCTTGATA
GCTATTTTAA GTTCTCGTGA GGATATCTCT GAGGACGATG CTAAACACAT TATTGACCAA
ATTGAACGCG CCCGAAATAC TGTTTTACAA CGGGCTGAAC AGCTACAGCA CGAAGCGCAG
CATCGGCTAG AACAGGTGAA ACATCAGGCA CAGCGTCAAG CCGAGGAAAC GCGCAAAGCA
GCTGCTAATG CCTCTTGGTG GTTGTTTGCA ACAGCGGTTG TTTCCGGTAT TTCTGCGGCT
TTAGGAGGTG CAATCGCTGT GGTGTTGATT TAA
 
Protein sequence
MQEMYILWWM AQIPVNTPNV TPAQASVLTS GPRFFVALIS GVILAFAFQL VLTNLSIAAG 
ISYLGHPSES QEVESFGGTI RKIGTRLGIW TLVTVTVALF IACFLAVKLS LLILDPRLGA
ILGLVIWGAY FLLLMWVSTT TVGSLIGSVV NTATSSFQAI MGTATAALGT RAVNQQVVAT
AEAAASAVRR ELGSALTPAN IRQNIEEYIE KLRPPEIDIS RIRSEFERLL SEPQFKAYAS
SSDLRNIDRQ RFIDLVSSRT DLSKREVNRV ADSLYDAWQQ VITHNVPANK RLAELVDYLK
SLPPGQTKTD ELNAKLDQLI TEIHSSKETE QKPGMIKQAI SALSAVVLDR ADLSDLDVEK
IWDSLATARE KFTKQIIQQP YNPIRADIEN YLLNTHPWQL SPKNIVQESR DVIYDPAADP
GVIRSELEKI TRQDFVKILQ AKGLLTQGQI QEIADQLVAV KNEVLITVIA QEEREIVQDL
QRRVESYLLV TPKADLTSAG IEENFKPLLA DSEADYQSLS RRLAQVEREQ MGEILLGRND
IQEWELDPIL DELEMQRDRV LLESLSMSKQ AQHQVETLWL NVESYLRNTG RQELNPDAIR
TDLKRLLEDP QGGIMAIQAR LSRFDRDTLV QLLSQRQDLN EAQVNQIIDV VEEIWGGILH
TPQKAKEQYD SITSTIADYL RNTGKEELNP EAIQQDLTRL FAHPREGVVA LRHRLSHIDR
DTLVKLLTQR QDLSEEQVNQ IIDSVQTSIR NIIRAPRRLA SRTQQRIQTF QTYLQEYLRL
TGKAELNPEG IKRDVQLLLH DPRVGMESLS DRLSHFDRDT IIALLKIRED ISDEEAGRIA
DNIILVRDQF VEQVRGIQRR IQDVIEGIFA SIRNYLNSLE RPELNYDAIK HDIRQLFEDP
QAGFDALRDR LSSFNHDTLI AILSSREDIS EDDAKHIIDQ IERARNTVLQ RAEQLQHEAQ
HRLEQVKHQA QRQAEETRKA AANASWWLFA TAVVSGISAA LGGAIAVVLI