Gene Aazo_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2202 
Symbol 
ID9340001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2290516 
End bp2292357 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content39% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003721328 
Protein GI298491151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC CTTCAAGTTT ACCAGAACAG AACTCACAGA TAGACGGAAC GCCAACTCTT 
TTGCAAACAA TCCAGCAACT GCGGGATGAC TTGTGGTTAG AAAGTAGCTT GAATCAGTTG
CAAAGCCGAT TGTATGAATG CCTGCTTTCA GCTGCTAACA CTATGTCACA GGTGATAACC
CCAGAAGCAG AGATTTTTCA AACCGTTGTA AATGAACTGC ATCAGGCTTT AAATGGCAGT
TGGATAGGGT GTACGCATTG TGCCGTAGGC ATAGCTCAAT GTCAACCACA AGAAAAAAAC
GGTAGAATTT GTTATGTTTC TAGGTTTTCT ACCGTGGAAA CACGAACTAC AGAAATTAGC
TATAAACAAC GGACAAAGCT GGAGTTTAAA TTAGACGCTG CTATCAGATT AGAAGATTTA
CAGCAAATGG AGAGACAAAA ACCACGGATT GCTTGGCCTT TAGTTGATGA TTCTAGAGAT
GTGATGGCAT GGCTAATTAT TGCCACAGCT AAACCACGCG ATCATCGTGA GCAAATTCAT
CCATTACAGA TTCAACTGCG ATCACAATTA ATCACAAAGA CAATCCAGCA CTTTAATACA
GCCTTAGCAC AACTGAGGCA AATTCAATTT TGGCAACAAC GTTGTCAACA GTTAGCTAAC
TTTAATCAGG AACTAGAACG CACCAATCAA CTCAAAAACC AGTTTCTGGC CAATACCAGC
CACGAAATTC GCACCCCGCT TAGTTCTATT ATTGGATTCA CCCATCTACT GCTGGCACAA
GGTTACGAAC CAGAAAGACA ACGCCATCAA GAATATTTAC ACATCATCCA ATCTAGTGGT
AAGCATTTGC TAGACCTGAT TAACGATATT TTGGATCTCT CTAAAATTGA AGCTAACCAA
CTAGAAGTGC AATGGGAAAT AGTCGAAGTA TCAACATTAT GCCGTAATGT TTTAGCTTTG
GTAAAAGAGA AAGCTGCTAA CAAAGGGTTG AAACTACGGT TGGAAATAGA GAATGATGTC
ACAAATTTAG TAGTCGATCC CTTGCGACTC AAGCAAATGC TATTAAATTT ATTATTTAAC
GCTGTAAAAT TTACAAATTT AGGAAGTGTT GGTTTACGGG TATCTATAAA AGATTTATAC
TTACGATTTT CAGTTTGGGA TACTGGTATT GGCATTTCCC AGGAAAACCA AACACGCTTG
TTTCGTCCCT ACAGTCAAAT TATTAATCCT GGTGCTGGAA GCAATGAAGG TAGCGGTTTG
GGGTTAGTGG TGACACAGCA ACTTGCAGAA ATTCATGGTG GTTATTTAGA ATTGGAATCA
GCAATTGATC AAGGTTCATG TTTTACCATT GTCCTTCCTC TCAAGCCACA GGGGAGAGCT
TCGGAAGCTA TCGATATCAA GGCAGAAGAA GATTCAGAAA TTAGTCAGGA GGTGAAAAAA
TATTCTGATT CCCAATCTTC TTCTCGTATA TTCATAAATA CCTCACGAGT AATTTTATTG
GTAGAAGATG ATTTAGCCAA TTCCGAATTA ATAAGAGTTT ATCTTGGTAG ATTGGGTTAT
CAAGTAACTT GTGTTAAGAA TGCTCAGGAA ATGTGGACAA TGCTGCCACA AATAGAACCA
GCAGTTATTT TAATGGATGT GAGTCTACCA AATGCCAATG GTTTGAACTT GGTAAAACAA
CTCAGAGACA ATATTGAATA TCAGCAGATA CCAATAATTG CTCAAACAGC AATGACAATG
AAAGGAGATA GAGAAACTTG TCTAGCCGCC GGTGTGGATG ACTATATTTC TAAACCTATA
GATTTACAAC TTTTAGGTAG TATAGTGGCA AAATATAGCT AA
 
Protein sequence
MQQPSSLPEQ NSQIDGTPTL LQTIQQLRDD LWLESSLNQL QSRLYECLLS AANTMSQVIT 
PEAEIFQTVV NELHQALNGS WIGCTHCAVG IAQCQPQEKN GRICYVSRFS TVETRTTEIS
YKQRTKLEFK LDAAIRLEDL QQMERQKPRI AWPLVDDSRD VMAWLIIATA KPRDHREQIH
PLQIQLRSQL ITKTIQHFNT ALAQLRQIQF WQQRCQQLAN FNQELERTNQ LKNQFLANTS
HEIRTPLSSI IGFTHLLLAQ GYEPERQRHQ EYLHIIQSSG KHLLDLINDI LDLSKIEANQ
LEVQWEIVEV STLCRNVLAL VKEKAANKGL KLRLEIENDV TNLVVDPLRL KQMLLNLLFN
AVKFTNLGSV GLRVSIKDLY LRFSVWDTGI GISQENQTRL FRPYSQIINP GAGSNEGSGL
GLVVTQQLAE IHGGYLELES AIDQGSCFTI VLPLKPQGRA SEAIDIKAEE DSEISQEVKK
YSDSQSSSRI FINTSRVILL VEDDLANSEL IRVYLGRLGY QVTCVKNAQE MWTMLPQIEP
AVILMDVSLP NANGLNLVKQ LRDNIEYQQI PIIAQTAMTM KGDRETCLAA GVDDYISKPI
DLQLLGSIVA KYS