Gene Aazo_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0416 
Symbol 
ID9338201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp422073 
End bp425006 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content37% 
IMG OID 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_003720093 
Protein GI298489916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAGTG ATAAAGAATT GGAAATCCAG ATGCAATTTC TGGAAGAAGC AACCGATTAT 
TTGAATACTT TAGAAACTAT CTTGTTGGAA ATTGATACGA GTAAACATAT CGAATTAGAG
AAAATTAATG CAGCCATGCG AACTGCCCAT TCTATTAAGG GTGGTGCAGC GATGATGGGT
TTTAAAGTAT TGAGTAATTT GGCGCATCGT TTAGAAGATT CTTTTAAAGT TTTAAAAACT
CGCAATAAAT CTCTAGAGAT TGATACTCAT TTGCAGAGTC TATTGTTATC TGGAGTTGAC
TGGTTAAGGC AAATTGTGGA TTTGTTATCA GAGAAAAAAT CTCTAGATGA TAAGTGGTTA
AAAACTTTTT GTTATCCCAT TTTTGATGAA CTGCATCAGC GTTTGGGTGA TCCTTCTCCA
GAGGATGTCA CCACAATGCT ATCACCGGAA GATGGGCAAG AAATTGTTCC TTTGCTGTTT
GAAACTGAGG TGGAAGAATG TTTACAACAT TTAGAATCTT TATTAGAAAG TCATCCAACA
AATGATTTAA AAACAGCAGT AGATGTGATG GCTTCCGAGT TGGGTGGGTT AGGAGAAATG
CTTCAGGTAC ATGCTTTTGT CCGGCTTTGT AAATCAGTTA ATCACTATTT AGAAAGTCAG
CCTGATCGTT ATTTAGAAAT TAGTCAATTA GCATTGCAAG CTTGGAGGCG ATCGCAAGCT
TTAGTCCTGA CAAATCAAAG GGATAGATTA CCTACAGAAA TTAAATTAGG TGAAGTAGTT
ATTAACCTCA CTCCTCAGCA AATCAATATT CCCCCAATAG CTATCAATCA AGAAGATACC
TTGGTAGCTG AAAAGCAAGT TCCTGATTTT GAGTCTTTAG AAATAGAGTT ACCACCAGAG
ATTCCTTCTT TAGATTATAA ACATATTGAA CGTAAAGGAG AAATTATTGG TGTTTCTAAA
GATAAAGAAA ATCATGAAAA TACAGTTAGA GTTCCCAGTA AGCAATTAGA GGAAATTAAT
GATTTATTTG GGGAAATAAT TATTCAACGT AATGGCTTAA ATTCTCAATT AGAAAGATTA
CGTAAACTAG TTTTAGGACT GAGCCAAAGA GTGCAAACTC TTGACCAGGA GCATCGAGAA
GTGTGTTCAG CATATCAAAA ACTTTTTCAC CAAACTATGT CTTCTGGAGT ATTAACACCA
GATGAGCAGG TTACAGACTC TGAGGTAATT AGTTTAGAAA TAGATCGCTA TCAAAAATTA
AACCTGCTAT CTCAGGAGTT GATGGAAACT ATTGTACAGG TAGCAGAAGT TGCCAGTGAT
ATTCAACTTA GTGTGGATGA TACAGATCAA ATTGCGCGGA AGTTAAATAA AACTTCTAAG
CAAGTGCAGA GAAAGTTGAC ACAGGTGCGG ATGCGTCCTT TATCTGATTT GGTAGAAAGA
TTTCCCAGAG CTATCCGCGA TTTAAATATT GAGTATGATA AAAATGTTCA ATTAAAAATT
GAAGGTGGTA AAACATTAAT TGAACGCAGC ATCTTAGAGG CTTTGAATGA GCCTTTAATG
CATTTATTAC GTAATGCTTT TGATCATGGA ATTGAAGATC ATGCTACCCG TCATGCTCAA
GGGAAACCTG AACAAGGATT GATTGAAATT AAAGGATATC ATCGGAGCGA TCGCACCATC
ATAGCTATCA CTGATGATGG TAGAGGTATT TCTCTAGAAA AAATCCGCCA ACGTGCTATA
GCTATGGGTT TGGATACAGC ATTAATAGCT GCAGCTAGTG AAGAAGAACT ATTATCACTG
ATTTTTGAAC CTGGGTTTAC CACCTCTGAT AAAGTTACGG CTTTATCTGG TCGTGGTGTG
GGAATGGATG TAGTTCGTAA TAACCTACAA CTCGTCCGGG GCGATATTAA AGTTGATACA
CAGCCAGGAA TTGGTACAAC TTTTACTTTA TCAGTCCCAT TTACACTGTC TGTAGCGCGA
GTTTTGTTAG TAGAAAGTGA AACAGGTAGC GGTTTAAATC ACCGCATGAT TTTGGCATTT
CCTACAGATC TAGTTGCAGA AATCTTTTTA CTAGGAAGTG ATCAGGTTTT CGCTATGGAT
GGTGGGGAGT TTCTCAAATG GCAAGATACC ATGCTACCTT TAATGCGACT TGGGAATTAC
TTTGATTTTA ACTGTTCCCA CTACAATAAT TTAGAATTAG AAAGTCCTAC GGGAATTAAT
GCCAGCAGTG TGTTAATTAT CAAAAATGAT CATCAACCCG TAGCTGTACA AATAGATCGC
TGTTGGGGTG AGCAAGAAGT GGCCATTCGT CAAGTTGAGG GAAAAATCCC TTTACCTGAT
GGTTTTAGTA ACTGTACAAT GCTCGGTGAT GGTCGGGTAG TACCATTAAT AAACACTAAT
GAATTAGTAT CTTGGATTAC TAACAATCAA CGCCCCCATA GAAGTAGTCA GTTAAGTAAT
AAATCACCTG CAACTAAATT AAAAACAGCT TTTCTCAAAC CACAAAAGCA TAAACCCATT
ACTAGCCCTA CTCACCAAAA AGGCATGATT TTAATCGTCG ATGACTCAAT TAATGTCCGA
CGTTATTTAG CTTTAACTCT CGAAAAAGGA GGGTATCAAG TTGAACAGGC TAAAGATGGT
CAAGATGCTT GGGAAAAGTT AGAAAGTGGT TTAAAAGTTC AAGCTGTAAT CTGTGATATT
GAAATGCCAC GTCTTGATGG CTATAGTTTT TTAGAACGAG TTAAATCTAA TGATATTTTA
AGGAATATTC CTGTTGCTAT GTTGACTTCT CGTAGTAGTA ATAAACATCG TCAACTAGCA
ATGCAATTAG GAGCAAGAGC TTATTTTTCT AAACCTTATA ATGAACAAGA TTTGTTGAGA
ACATTGGAAA AAATGATTTT TAGGATTGTG GAAAGTGGCT CTGTAAATAA TTGA
 
Protein sequence
MTSDKELEIQ MQFLEEATDY LNTLETILLE IDTSKHIELE KINAAMRTAH SIKGGAAMMG 
FKVLSNLAHR LEDSFKVLKT RNKSLEIDTH LQSLLLSGVD WLRQIVDLLS EKKSLDDKWL
KTFCYPIFDE LHQRLGDPSP EDVTTMLSPE DGQEIVPLLF ETEVEECLQH LESLLESHPT
NDLKTAVDVM ASELGGLGEM LQVHAFVRLC KSVNHYLESQ PDRYLEISQL ALQAWRRSQA
LVLTNQRDRL PTEIKLGEVV INLTPQQINI PPIAINQEDT LVAEKQVPDF ESLEIELPPE
IPSLDYKHIE RKGEIIGVSK DKENHENTVR VPSKQLEEIN DLFGEIIIQR NGLNSQLERL
RKLVLGLSQR VQTLDQEHRE VCSAYQKLFH QTMSSGVLTP DEQVTDSEVI SLEIDRYQKL
NLLSQELMET IVQVAEVASD IQLSVDDTDQ IARKLNKTSK QVQRKLTQVR MRPLSDLVER
FPRAIRDLNI EYDKNVQLKI EGGKTLIERS ILEALNEPLM HLLRNAFDHG IEDHATRHAQ
GKPEQGLIEI KGYHRSDRTI IAITDDGRGI SLEKIRQRAI AMGLDTALIA AASEEELLSL
IFEPGFTTSD KVTALSGRGV GMDVVRNNLQ LVRGDIKVDT QPGIGTTFTL SVPFTLSVAR
VLLVESETGS GLNHRMILAF PTDLVAEIFL LGSDQVFAMD GGEFLKWQDT MLPLMRLGNY
FDFNCSHYNN LELESPTGIN ASSVLIIKND HQPVAVQIDR CWGEQEVAIR QVEGKIPLPD
GFSNCTMLGD GRVVPLINTN ELVSWITNNQ RPHRSSQLSN KSPATKLKTA FLKPQKHKPI
TSPTHQKGMI LIVDDSINVR RYLALTLEKG GYQVEQAKDG QDAWEKLESG LKVQAVICDI
EMPRLDGYSF LERVKSNDIL RNIPVAMLTS RSSNKHRQLA MQLGARAYFS KPYNEQDLLR
TLEKMIFRIV ESGSVNN