Gene Aazo_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2995 
Symbol 
ID9340798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3085455 
End bp3088463 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content42% 
IMG OID 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003721911 
Protein GI298491734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGGAA AGGACAATTA CCAAGAGCAA CAGGAAATTA ATTCTACTGA TGAAGAGGGG 
ATGATTTTCC AGCTTGCTGA TACCACTATA CAGGCGTGTA ATGCTGCTGC TGAGAGAATA
ATTGGATATA AAAATGAGCA AATATTGGGA AAAACTCCCT TTAAACCTCC ATGGCAAATA
GTTCATGAAG ATGGAACAAT TTTTACGCCA GAAACAGTTC CACCTATAGT TGCTTTACAA
ACAGGTAAGC CATGCAAAAA CGTCGTCATG GGGCTTTATC AGCCAACTGG TCAGCTAGTT
TGGTTATTGC TTAACTCCCA GCCTTTATTT CAAACTAACA AATCTACTCC CTATGCAGTT
GTCACCACAT TTACCGACTT TACAAAAGTG AAATGTCGGC AATCTAGGAG TGATGCTTGT
ATTACCAATA ATAAAAAACT AATAGTAACT TGGAATCAAG ATCAAGCTGT AGAAATTTCT
CCAAATACGG ATAAGCAAAA GCTAAACCAA CCAGTTGCAG AACCTATATC TGACGATTGC
ACCAAAGCCT TCTTATCCAT GCAAAACCAT ATTTTAGAAC TAATTGCTAC GGGTAATTCA
CTCTCTGAGA TCCTCGAAAA GTTAGCTCAG TCAATAGAAG CGCAGTTGGC TGAAGTTTTC
TGTTCCATCA TGTTACTAAA TGATACAGGC ACAAAATTAT ATCCTGGCGT AAGTTCTAGT
CTACCAGAAA AATATATTCA AGAATTAAAG AATGGTGTTC CCGTAGGTGC AGATGTAGGT
AGTTGTGGCA CATCAGCCTA TACCAAACAA ACAGTAATAG TCTCAGACAT TGCCAGCGAT
CACAAATGGG TAAAATACCG TGACTTGACA TTGACGAATA ACCTCAAAGC TTGTTGGTCT
ACACCCATTT TGGACAGTCA AGGCCATGTT TTAGGAACAT TTGCACTATA TTATCCCACA
GTTCGCATTC CTCAAAAATC AGATCTGCAA CTCATTGAGA GAGCATCTCA CCTAGCAGGT
GTAGCAATTG AACGTCAAAG ATGGCAACTA GCTTTGCAAC AAAGTGAATA TCAATTGCGA
TTAATGACGG AAACAATTCC CCAACAGGTA TGGACAGCAC AACCTAACGG TCTGGTTGAT
TATTATAACC AGCGGTGGTG TGACTTTACA GGTAAAACGC AAGAACAACT GAAAACTGAG
AGTTGGGAGC AGATTATTCA CCCAGAAGAC TTACCAAAGG TAGAAGAATT GTGGGCTAAA
GCGGTACAAA CCGGCAGTGA CTATGAAGTA GAAACTCGGA TGCTCTCTGG GAGTGGAGAG
TACCATTGGA TTTTGGGACA GGCACGTCCT TTGCGAAATC AACAGGGACA GATTGTTAAA
TGGTACGGTA CCAATACCGA CATTACAGAC CATATCCAAG CCAGAGAAGC TTTACGAGAA
AGTGAGCTAA ACTTCCGCAC CTTGGCAGAT ACCATGCCGC AGATGATCTG GACTGCTAAA
GGAGATGGCT GGTTAGAATA CTGTAACCAA CGTTGGTTGG ACTATACTGG TATGACACTG
GAGGAAACTC AAGGTTGGGG TTGGCAACCT GTACTGCATC CCGATGATGC ACAAATCTGT
ATAGATATTT GGAGTGAGTC AGTTCGCACA GGTCAAAATT ATCAAATTGA GTACCGCTTA
CGTCGCGCTA GAGATGGGCA ATATCGTTGG CATCTAGGTC GAGCCTTTCC ACTACGCAAT
CAAAAGGGAC AGATTATCAA ATGGTTTGGT TCCTGTACTG ATATTGACGA CCACAAGCGG
TCTGAGGAAG CATTGCGAAA TGCCCTCCAA GAAAAGCAAG CAGCTTATGA AGCCGCTGAA
ACAGCGAATC GCGTTAAAGA TGAGTTTCTG GCTATCCTTT CCCACGAACT CCGTACCCCG
CTTAATCCGA TCCTGGGATG GTCTCAATTG TTAAAAACTG GCAAACTGGA CAAAGATAAG
ACAGATGAAG CCCTCAAAAG TATTGAGCGT AATGCAAAAT TGCAGGTTGA CCTGATTGAC
GATTTACTAG ATGTGTCACG CATCCTCCGA GGCAAACTAA CTCTGAATGT GACTACAGTT
AATTTGGCAG ATATAATTTC TGCAGCATTG GAAACAATGA GTTTAGCAGC CTCTGTTAAA
CAAATTGCCG TTCAAACCCT AATTGAACCG AATATAGGAA AAGTTAGAGG TGATCCGGCT
CGCTTGCAAC AAATTGTCTG GAATCTTTTG TCTAATGCTA TTAAATTTAC ACCTCAAGGC
GGAGAGGTAG TAGTACAGCT AGAGCAAATT GGTTCAATGG CTCATATTAT AGTTAGTGAT
CAAGGTAAAG GTATCAACCG TAAATTTCTA CCTCATATAT TTGAACACTT CCGGCAAGAG
GATGGCTCAA CTACCAGAAA ATTTGGTGGT CTGGGATTGG GACTAGCAAT AGTCCGCCAT
TTAGTAGAAT TACACGGTGG TACAGTTGAG GCCGATAGTC CAGGAGAAGG ACAGGGCACA
ACTTTAAAAG TTATACTACC GTTAATATCT TCCTTGCCAG TGACAGTTAC GAATGGTCAA
TTTGAGGACG TAGCTTTAAG TTTAAAAGGA ATTAAAATTT TGGTTGTGGA TGATGATGTT
GATTCGCTGG CTTTTGTTAA AGTTGTGCTG GAATTGTATG AGGCAACTGT GACCACAGCG
GCATCAGCAC CAGAAGCATT ACAAATTGGG GTACAGTTAA AGCCTGATGT TTTGATCAGT
GATATTGGAA TGCCGGAGAT GGATGGTTAT GAACTGTTAA GAAGAATGAG AGCTTTGTCG
TCAGACTTGG GTGGAGACAT TCCAGCAGTA AGCAAAGCTA TACCGAAGGC GATCGCTCTA
ACAGCATTTG CTGGAGAACT TGACGAACAA CAGGCAATGG CGGCAGGTTT TCAAATACAC
ATACCTAAAC CAGTAGAACC AGAGGCGTTA GCAGCAGCTG TTGCCCAAGT TCTGAAAATA
AATGTTTAG
 
Protein sequence
MQGKDNYQEQ QEINSTDEEG MIFQLADTTI QACNAAAERI IGYKNEQILG KTPFKPPWQI 
VHEDGTIFTP ETVPPIVALQ TGKPCKNVVM GLYQPTGQLV WLLLNSQPLF QTNKSTPYAV
VTTFTDFTKV KCRQSRSDAC ITNNKKLIVT WNQDQAVEIS PNTDKQKLNQ PVAEPISDDC
TKAFLSMQNH ILELIATGNS LSEILEKLAQ SIEAQLAEVF CSIMLLNDTG TKLYPGVSSS
LPEKYIQELK NGVPVGADVG SCGTSAYTKQ TVIVSDIASD HKWVKYRDLT LTNNLKACWS
TPILDSQGHV LGTFALYYPT VRIPQKSDLQ LIERASHLAG VAIERQRWQL ALQQSEYQLR
LMTETIPQQV WTAQPNGLVD YYNQRWCDFT GKTQEQLKTE SWEQIIHPED LPKVEELWAK
AVQTGSDYEV ETRMLSGSGE YHWILGQARP LRNQQGQIVK WYGTNTDITD HIQAREALRE
SELNFRTLAD TMPQMIWTAK GDGWLEYCNQ RWLDYTGMTL EETQGWGWQP VLHPDDAQIC
IDIWSESVRT GQNYQIEYRL RRARDGQYRW HLGRAFPLRN QKGQIIKWFG SCTDIDDHKR
SEEALRNALQ EKQAAYEAAE TANRVKDEFL AILSHELRTP LNPILGWSQL LKTGKLDKDK
TDEALKSIER NAKLQVDLID DLLDVSRILR GKLTLNVTTV NLADIISAAL ETMSLAASVK
QIAVQTLIEP NIGKVRGDPA RLQQIVWNLL SNAIKFTPQG GEVVVQLEQI GSMAHIIVSD
QGKGINRKFL PHIFEHFRQE DGSTTRKFGG LGLGLAIVRH LVELHGGTVE ADSPGEGQGT
TLKVILPLIS SLPVTVTNGQ FEDVALSLKG IKILVVDDDV DSLAFVKVVL ELYEATVTTA
ASAPEALQIG VQLKPDVLIS DIGMPEMDGY ELLRRMRALS SDLGGDIPAV SKAIPKAIAL
TAFAGELDEQ QAMAAGFQIH IPKPVEPEAL AAAVAQVLKI NV