Gene Aazo_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5067 
Symbol 
ID9342876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5190162 
End bp5193170 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content35% 
IMG OID 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003723286 
Protein GI298493109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AATACTTTAA CATACCGGAA AGTCTCGATT ATAGTAAATA TGAACAACAA 
CCAATTCACA TCCCTGGTTA TATACAGGCA CATGGTGTAC TCCTAGCTTT ACAAGAACCA
GATTTAACAA TCTTACAAAT TAGTGAAAAT ACAAATCATT TTTTTGGGAT TCCCCCAGAG
TCTTTACTGG GGCAAAACTT AAACTGCCTC TTATCTTCAG AACAAATAGA ATCAATTACT
CATTCGTTAC AAGAGGATAA ACCCGTAGTT ATCCATTCAT TAGAAATTAC AATTTACAAC
CAAAAAAAGT CTTCAAGATT ATTAGGAATC CTGCATCGTT CTAATGGAGT TCTAATCCTA
GAGTTAGAAC CTCATGCAGC TATCACAACA AATGCATTAG TAGGATATAC TTATCTTTTA
GAAACAGCAA TTTTCAACAT ACGTAACGCC TTCCATTTAT CGGATATTCA TCAAATAATT
GCCCAAGAAG TGAGAAAAAT AACTCAGTTT GATAGGGTAA TGATATACAG CTTTACCCCA
GACTACAGTG GAGTTGTTAT TGCCGAAGAT AAACAAAGTA ATTTAGAAAG TTATTTAGGA
TTACATTATC CTGCTTTTGA CGTTCCTAAT CAAGCCAGAC AACTTTACTA CAAAAATCGG
CTGCGAATCA TTCCAGATGT TAATTATCAA CCTGTAAAAA TTATTCCCAG CAATCATCCT
CTTAGCAATG AACCCCTTGA TTTAAGTTGT GCAATATTAA GAAATATCTC ATCCTGTCAT
GTAGAATACT TACAAAATAT GGGTGTGACT GCTTCAATGT GCATTTCACT CATTCATGAA
AATAAGCTTT GGGGTTTAAT TGCTTGTCAC CATTACTCCC CCAGATATAT TGATTATGAG
ATTCGGAAAA TTTGTGAAGT AATCGGTCAA TTCTCATCTA TTGAATTAGT TAATCATCGC
GAACGAGCCT TAAATTTCTA CCGTCAAAAA GTCAAATTAA TTCAACAAGA ACTAAATCAC
AATTTAACCA ATGAATTAGA TCACATCGGT AAGGTCTTCC AGCGGAATAA AACAAATTTA
TTAAAATTAT TAAAAGCGAA TGGCACTGCT GTTTTATTAC AAGGACAATT AATATTAATT
GGTAATACCC CATCAGCAGA AGATACGCGG GATCTAGTAG AGTGGATATT AAATCAAAAA
AATCAACAAG AAATTTTTTA CACAGATTCG CTCCCACCAC TATATCCAAA CGCTAGCAAG
TTTCAAGATA TAGCAACTGG TATTTTAGCA ATTTCAATAT TTCTCCATCA GCAGTCTTAT
CATATTATCT GGTTCAGACC AGAGCAAGTT CAAACTGTAA ATTGGGCAGG TAATCCCCAT
GAAGCAATAG CTGTGAACTC ACAGCAAGAA TATTACCTCA CCCCACGTAA ATCATTTGAA
CTATGGAAAC AGACAGTTAA ACACAAATCA CTTCCTTGGC AAATATTTGA AATTGAAGTG
GCACAAGAGA TGAGAAATAG CCTCATGCTT GCTGCACTAG AGTTTTCCCA AGTAGCATTA
CAACAAGCAG CTGAACAGGC AGAAATTGCC AACCGTGCTA AAAGTCAATT CTTGGCCAAG
ATGAGTCATG AATTACGAAC TCCACTCAAT GCCATCTTGG GTTTCACTCA AATATTAACC
CGTAATAGCT CATTATCTCA AGAAGATCAA GAAAATTTAG ACATCATTAG TCGCAGTGGT
GAGCATTTGC TCTCATTAAT TAATGATGTA TTGGAAATGT CTAAGATTGA AGCTGGTAAA
CCAACCCTAA ATGAAACCTA TTTTGACTTG TATCGTCTCA TTTACTCAAT TCAAGAAATG
TTTGCTCTCA AAGCTGCAGA TAAGGGAATA AATTTAATTA CTGAATTTGG TGTCGAGGTG
AATAAATATG TTTTTGGAGA TGAAGGCAAA CTCAAACAAA TACTAATTAA TTTAATAGGT
AATGCTATAA AATTTACCGT TGTTGGTCAT GTCGCTATCA GGGTATCCTG TCCTCCAAAT
TATGCACCTT TAGCTGTAAC AAATGGCAAA GTTCTAATTG AATTAGAAGT GGAAGATACA
GGTGCCGGTA TTGCTCTTGA AGACCAAGAA TTAATTTTTG AAGCATTCCT CCAGTCCAAA
GGTGGAAGAC AGTTTATGCA AGGGACTGGA CTGGGATTAG CTATTAGCCG TCAATTTGCT
AGACTCATGG GAGGTGACAT AACAGTTAAG AGTATTTTAG GTGAGGGTAC AACCTTTACT
TGTCGTGTTC AACTCAGTCT TGCTAATAAG GGAGATATCA TTTCTCCTAT TAAAAATACC
AAAAGAGTTG TTGGTCTAGA ACCAGGACAA CCTAGATACA GAATCTTAAT TGTTGAAGAC
ATTTTAGAAA ATCGTCTTTT ACTAGTTAAA CTACTAGAAT CTGTTGGGTT AGATGTACGT
GCAGTAGAAA ATGGTTTACA AGCAATTGAT ATTTGCCAAG AGTGGGAACC CCATCTGATT
TGGATGGATA TACAAATGCC AGTGATGAAC GGGTATGAAG CAACCAAGCA AATTAGAGCC
ATGAATCAAG GAAAAAATAT CATCATGATT GCTCTAACTG CAAGTGCTTT TGAAGAAGAT
AAACAAGCTA TTTTACAAGT CGGTTGTGAC GATATTATTT CTAAGCCTTT TGAAGAAAAG
ATGCTCTTTG AAAAAATGGC TCAGTATCTA AATTTACGCT ATCTCTATGC AGAGATTCAT
CAGCAACAAA AACAAAATAA GCTGAAAACT GATCATAGAT TAATAAAATT GAGTAACTCT
GATTTACAGG TTATGGCACC ATTTTGGATT GCTCAGGTTC ATGCAGCAGC AATAGTGATT
GACGATGCAC AACTTTATGA ACTGTTTGAA CAAATACCCG CGGAACACAG ACAACTAGCT
GATGGTTTAA AAAATTTAGT TTATAACTTC CATATAGAAA CAATTATTAA CCTCACTGCT
CCTGACTAA
 
Protein sequence
MKNKYFNIPE SLDYSKYEQQ PIHIPGYIQA HGVLLALQEP DLTILQISEN TNHFFGIPPE 
SLLGQNLNCL LSSEQIESIT HSLQEDKPVV IHSLEITIYN QKKSSRLLGI LHRSNGVLIL
ELEPHAAITT NALVGYTYLL ETAIFNIRNA FHLSDIHQII AQEVRKITQF DRVMIYSFTP
DYSGVVIAED KQSNLESYLG LHYPAFDVPN QARQLYYKNR LRIIPDVNYQ PVKIIPSNHP
LSNEPLDLSC AILRNISSCH VEYLQNMGVT ASMCISLIHE NKLWGLIACH HYSPRYIDYE
IRKICEVIGQ FSSIELVNHR ERALNFYRQK VKLIQQELNH NLTNELDHIG KVFQRNKTNL
LKLLKANGTA VLLQGQLILI GNTPSAEDTR DLVEWILNQK NQQEIFYTDS LPPLYPNASK
FQDIATGILA ISIFLHQQSY HIIWFRPEQV QTVNWAGNPH EAIAVNSQQE YYLTPRKSFE
LWKQTVKHKS LPWQIFEIEV AQEMRNSLML AALEFSQVAL QQAAEQAEIA NRAKSQFLAK
MSHELRTPLN AILGFTQILT RNSSLSQEDQ ENLDIISRSG EHLLSLINDV LEMSKIEAGK
PTLNETYFDL YRLIYSIQEM FALKAADKGI NLITEFGVEV NKYVFGDEGK LKQILINLIG
NAIKFTVVGH VAIRVSCPPN YAPLAVTNGK VLIELEVEDT GAGIALEDQE LIFEAFLQSK
GGRQFMQGTG LGLAISRQFA RLMGGDITVK SILGEGTTFT CRVQLSLANK GDIISPIKNT
KRVVGLEPGQ PRYRILIVED ILENRLLLVK LLESVGLDVR AVENGLQAID ICQEWEPHLI
WMDIQMPVMN GYEATKQIRA MNQGKNIIMI ALTASAFEED KQAILQVGCD DIISKPFEEK
MLFEKMAQYL NLRYLYAEIH QQQKQNKLKT DHRLIKLSNS DLQVMAPFWI AQVHAAAIVI
DDAQLYELFE QIPAEHRQLA DGLKNLVYNF HIETIINLTA PD