Gene Aazo_4843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4843 
Symbol 
ID9342650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4956503 
End bp4958227 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content39% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003723119 
Protein GI298492942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAATC AAAAGCCAAC CTTTGTTGAC AAAAGTTCGG AACTCAAAAA AGTGTCAACT 
CAAGAGCGGA AAATAGAAGA ACTCCCAACT ATAGAATTTC CCTCTCGTCG GAAACTCAAA
GCCAGTTCCT GGCGTATCCA TCAAAAAATT GGCTATGGTT ACTTTGTGGC TATTGCTATT
GGCTTTTTTG GCTCATTGAC AGGATTGGTA CTGGCTAACT ATTACCGAGG TAGAGAAGTC
AGGCAGTTCA ATCAAGCTAA TGAGCAAGAA CGATTACTAA ATAATTATAA AGATGCAGTA
ATTGAGGCTC AACTCCATAG CTCAAGCTTG ATAGCGGTAT TGAACGACCC AGAAAAATTA
AAAATTAAAA AAGATAAGTT TCTCAAGTCT GTTGCTCAAG CACAAAAATT AGAAGAAAGC
ATTGATACAT ATATCAACAG CGATCCTCAG ATCTTAGCAG CAAACAAATC GACTTTAGAA
ATTTTATTTC TCAATTATTC TAAGTATTTA GAAGAATTTA TTAACCAGAT AGAACCTATT
TTGACGAGAA TTGACACGCC AAGAATACAA CCGGAACAAG TAGAGATAGC GCGAGAGCAG
TTACTGGGAG TTATGAATAG TCAGATAGTT GAGCAGCTCA ATCTTTTATC GGATCAATTA
ACTAGAATTT TAGAGACTGC TCAAACACAA GAGCTAAAGA GACGGACAGA TGTAGAGCAA
GCTAGATTTG TTGAGAGAGC TATTGTCATA GTTAGTATGC TGGTTTCGGT TGCTGTTGCT
GCTGTTGTGG CATGGCGTAC CAGTCGAGCG CTCGCAGAAC CTGTAATTAC ATTAACACAA
GTAGCTGAAC AGGTAGCTAG AAAATCCAAT TTTGATTTAC GCGCTCCCAT TACTACTGAC
GATGAAATTG GGTTGCTAGC TAAATCGCTA AATCGCTTAA TTGAGCGAGT ATCTGAGCGT
ACTAGACAAC TCCAACAAGC CAAAGAATTA GCCGAAGCTG CTAGTAAAGC CAAAAGCCAA
TTTTTGGCCA ATGTTAGTCA TGAGTTACGT ACACCTTTAA ATGCAGTAAT TGGTTTAAGT
CAACTGCTAA AAGATGATGC TGTTGATATT GGTGCATCGC CAGATTTTAT CACCGACTTA
GAAACTATCA ATTCTGCTGG TAGACATTTA CTAGAACTAA TTAACGACAT CCTAGATGTA
TCAAAAATCG AAGCGGGGAA AATGACTCTC TACCCAGAGA CATTTGACAT CATAACTCTG
ATTAATAATG TTGTTCTGAC AGTGAAACCA GCAATAGAAA AAAATGGCAA CACCTTAGTA
TTAGAATGTG ATGAGTATTT GGGGACGATG TATGCTGACC AAACGAGGAT GCGACAGGTA
TTATTAAATT TACTGAGCAA TGCAGCCAAG TTTACTACCA ACGGCAAAGT AACACTAACT
GTTAAGATTG ATAAAACAAT CATCCTTAAA GAAGCACCCT TCGGATCAAT TATTTTCACC
GTAACTGATA ACGGAATCGG AATGTCACCC AGTCAAGAGC AGAAATTGTT TCAACCCTTT
ATACAAGGGG ATATTTCCAC TACCAAAAAA TATGGTGGTA CCGGCTTGGG ATTAGCTATT
AGCCGTCATT TTTGCCAGAT GATGGGGGGT GAAATTCTTG TTAACAGTCA GCTTGGGGTT
GGTTCTAATT TTAAGGTAAG TTTACCTTTG ACAGTGCGGC AATGA
 
Protein sequence
MPNQKPTFVD KSSELKKVST QERKIEELPT IEFPSRRKLK ASSWRIHQKI GYGYFVAIAI 
GFFGSLTGLV LANYYRGREV RQFNQANEQE RLLNNYKDAV IEAQLHSSSL IAVLNDPEKL
KIKKDKFLKS VAQAQKLEES IDTYINSDPQ ILAANKSTLE ILFLNYSKYL EEFINQIEPI
LTRIDTPRIQ PEQVEIAREQ LLGVMNSQIV EQLNLLSDQL TRILETAQTQ ELKRRTDVEQ
ARFVERAIVI VSMLVSVAVA AVVAWRTSRA LAEPVITLTQ VAEQVARKSN FDLRAPITTD
DEIGLLAKSL NRLIERVSER TRQLQQAKEL AEAASKAKSQ FLANVSHELR TPLNAVIGLS
QLLKDDAVDI GASPDFITDL ETINSAGRHL LELINDILDV SKIEAGKMTL YPETFDIITL
INNVVLTVKP AIEKNGNTLV LECDEYLGTM YADQTRMRQV LLNLLSNAAK FTTNGKVTLT
VKIDKTIILK EAPFGSIIFT VTDNGIGMSP SQEQKLFQPF IQGDISTTKK YGGTGLGLAI
SRHFCQMMGG EILVNSQLGV GSNFKVSLPL TVRQ