Gene Aazo_4950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4950 
Symbol 
ID9342756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5070389 
End bp5073316 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content41% 
IMG OID 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003723204 
Protein GI298493027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA CAAATATAGA TGATCAAAAT TCTAAATATC AGCAGGCAGT AACAGCGTAT 
ACACAAGGAG ATTATGAGGT TGCGGCTACT TTAGTGGACC AAGCGGTAAA TAATCTACCA
GATGATCCTA ATTCTCATCT GTTGCGGGGT CATATATACT ATGTTTTGCA GCATTTTGAA
ACAGCAAAAG CCGAATATGA GCAGGTGTTT CATTTAACAG ATGATGAAGG AATTATTGGT
TCTGCCACTG GTTATCTTGA AAATATCAAT CTATATCTAC AATCTTTGCG TGAAGACGCA
GCAGAAATAG AGTTCGAACA ACTAATAAAT TCTGAGAAAG TATCAAACGG GCTGGTAACT
GATGAAGGAG AATTACTAGA TTTAGGAGTT GATATAGACT TGGATAACTC CAGTTTTAAT
CTAAAATCTT TTGGTGATTA TGAAGTATCT GAAGAAGCTC TCGAAGATAT ATCTGTGAGT
AATCCGTTTG ATAGTTCATT AGACAGTCTT AACTTTGATC GAATATCCAG TAACACAGTA
GATTTTGGTA ATGATCCTTT TGCTTTAGAT CAACCATTTA CTGAACTACA AGCAAGTAGC
AGTGGCAGTG AAGCATTAGG AGAATTGGAA TTATCTCCTG TCTGCCAAAA AAATAATCAT
TTCTCTGACC TTGATGGAAA TTCTGATCAA GCGGATTCAG CCATGAATGA TGTTCAAATG
AACTCAGATT TTGGAGATAT TAATTTATCA GATGACTATA ACGCTTCATC GCTGGCTGAA
TCATTCAGGC AACCAGAAGA TTCATCAGAA TTTGAGAATA TAAATATGGG GAATTTGGGC
TTCTCCAGCA ATAGTCAGAA AAGTCTTCAT GTCATCATTG ATGAACAAGA ACTCCATGAT
GATTTCACTA GAGAAGATTT TATAGACATC AGTGACGAGA CTTCTTTGAG TAGTGCAGAC
AGTTGGTTAC AAAAAATTCC AGAGGAACTA AACAATAACT CAGCTTCTAA CTTTAATAAT
CAAGTATCTT CATCTTTAAA TAACAATTCT GATCATACCG ATCAGCAGGC TAATTCAGAT
GGGAAACAAA ATCAACATAA CTTTGATGAT GATAGTTTCG ATTTAGAAGC ATTTGAGTCT
GCCTTTGGCT CAGAAAGCTT TGCTAGTGAT GAAGATAATA ACCGTTTGCA GGGAGAAACT
CCACAAAATA TTCAGTTTTT GGATGATTTT AACGAATTTG ATCATTTAGG CAGTATTCCT
AGCTTTGATC TCAGTGCAGA TTCTAACTAC AATGATATAG AGTTATTGTC AGACACCAGG
GAAAATAGAG TTAGTAAAAC CTGCGGTAGC AGTAGTTTTA CTGATAATGC TTCACTGAAT
AACGTCGTTG ACCGTGATGA AGAATTATTC ACAATTACTG GAGCGCAAGA AGCAGTTCCT
GTTTTTACAG AAACAGATGC TTCTAAACTA GAACCAACTG TCAATATAGA TCAAGGCTTG
TTTGGATTTT TTGAGAATGC GTCTCTAGAG AAGAAGCAAT GGTATATCGC TGGCACAGTG
GGGCTAACCT CAGCCGTGGT AGCAGCTTTG ATTAGCTTTG GCGCTACAGA ATTTTCAGAA
CCCTCACAAC GGGAATCGGT GCGGAATACA GGTTGGGCAA TGGCTTTGGC CGCAGGAGTA
GCTGGTGGTA TGACCGCTGG CTTTATGGGT AATCTCGCAC TTAAGCAAAT TCGGCGCACT
ACTAAAGATT TACAAGCTCA GTTTGATGCT GTCCGAGAGG GAAATCTCAA TGTTCAAGCC
ACAGTTTACT CAGAAGATGA ATTAGGCTAC TTATCCACGA GTTTCAACGA TATGGCGCGG
GTAATTTTCA CAACTACCAA TGAAGCGCAA CGCAAGGCGG TGGAACAAGA GGAAGCAAAA
GAAAACTTGC AACGCCAAGT TATCCGCCTC TTAGACGACG TAGAAGGCGC GGCTAGAGGA
GATTTAACAG TTCAAGCTGA GGTGACAGCA GACGTACTAG GTGCTGTTGC GGATGCCTTT
AACCTGACAA TTCAAAACTT GCGGGATATT GTGCAGCAGG TGAAAGTAGC TGCACGAGAA
GTAACCAAAG GCTCGACTAA TTCAGAAACT TTTGCTAGAG CATTATCTGG GGATGCTTTG
CGTCAAGCAG AAGAGTTAGC AGTGACGTTG AATTCCGTAC AGGTAATGAC CGAATCTATT
CAACGCGTAG CAGTAGCGGC AAAAGAAGCA GAAACTGTAG CTCGTGATGC CAGTGCGATC
GCTCTCAAAG GAGGAGAAGC AGTAGAAAAT ACAGTGGCGG GGATTTTGGA AATTCGGGAA
ACAGTGGCAG AAACTACCCG CAAAGTGAAG CGATTAGCAG AATCTTCTCA AGAAATTAAC
TCTATCGTCG CCTTGGTATC CCAGATTGCT TCTAGAACCA ACTTATTGGC GCTCAATGCC
AGTATTGAGG CAGCAAGAGC AGGAGAAGCG GGAAGAGGAT TTGCGATTGT AGCCGATGAA
GTCCGCCAAT TAGCTGATAA ATCTGCCAAA TCTTTAAAAG AAATCGAACA AATCGTGATG
CAAATCCAGA GTGAAACTAG CTCTGTAATG ACAGCGATGG AAGAAGGTAC ACAACAAGTA
ATTAAAGGAA CAAAACTAGC AGAAGAAGCC AAGCGAGCGC TAGAAAACAT TATCCAAGTA
GCTGATCATA TTGATAGTCT AGTACGCTCA ATTACCAGCG ATACTGTAGA ACAAACCGAA
ACCTCTCTTG CTGTCGCTCA GGTCATGCAA TCAGTGGAAC TCACAGCCCA AGAAACATCC
CAAGAAGCAC AGCGAGTCTC AGCCGCTTTA CAACACCTGG TGGGAGTATC CCGCGACTTG
ATTGCTTCCG TAGAACGCTT CCGGGTGGAA ACCATTGAAT CTAAATAA
 
Protein sequence
MATTNIDDQN SKYQQAVTAY TQGDYEVAAT LVDQAVNNLP DDPNSHLLRG HIYYVLQHFE 
TAKAEYEQVF HLTDDEGIIG SATGYLENIN LYLQSLREDA AEIEFEQLIN SEKVSNGLVT
DEGELLDLGV DIDLDNSSFN LKSFGDYEVS EEALEDISVS NPFDSSLDSL NFDRISSNTV
DFGNDPFALD QPFTELQASS SGSEALGELE LSPVCQKNNH FSDLDGNSDQ ADSAMNDVQM
NSDFGDINLS DDYNASSLAE SFRQPEDSSE FENINMGNLG FSSNSQKSLH VIIDEQELHD
DFTREDFIDI SDETSLSSAD SWLQKIPEEL NNNSASNFNN QVSSSLNNNS DHTDQQANSD
GKQNQHNFDD DSFDLEAFES AFGSESFASD EDNNRLQGET PQNIQFLDDF NEFDHLGSIP
SFDLSADSNY NDIELLSDTR ENRVSKTCGS SSFTDNASLN NVVDRDEELF TITGAQEAVP
VFTETDASKL EPTVNIDQGL FGFFENASLE KKQWYIAGTV GLTSAVVAAL ISFGATEFSE
PSQRESVRNT GWAMALAAGV AGGMTAGFMG NLALKQIRRT TKDLQAQFDA VREGNLNVQA
TVYSEDELGY LSTSFNDMAR VIFTTTNEAQ RKAVEQEEAK ENLQRQVIRL LDDVEGAARG
DLTVQAEVTA DVLGAVADAF NLTIQNLRDI VQQVKVAARE VTKGSTNSET FARALSGDAL
RQAEELAVTL NSVQVMTESI QRVAVAAKEA ETVARDASAI ALKGGEAVEN TVAGILEIRE
TVAETTRKVK RLAESSQEIN SIVALVSQIA SRTNLLALNA SIEAARAGEA GRGFAIVADE
VRQLADKSAK SLKEIEQIVM QIQSETSSVM TAMEEGTQQV IKGTKLAEEA KRALENIIQV
ADHIDSLVRS ITSDTVEQTE TSLAVAQVMQ SVELTAQETS QEAQRVSAAL QHLVGVSRDL
IASVERFRVE TIESK