Gene Aazo_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0561 
Symbol 
ID9338347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp583941 
End bp585863 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content37% 
IMG OID 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003720181 
Protein GI298490004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAGTT CTCCCGATTT GAGCTTTTCT CGAACCTTGC CTCTGAGTGT ATTTAATCGG 
CTTGGGGAAT TGTTGCAGCA GATGGCTCAA GCAGTGGGTA CTGCCGCTTT GATGCTAACA
GAAGCTGTGT TGCAGCGAAT GCCCATTCCC GCTGAATGGG AAAGGCAACG GTTTACGCTG
GTAATTTCTG AACAATTTAG TGCGCTGCTG TTGGGAAATA TAGAAGCAGG GGAACAGGAG
ACTAGGGGAC TAGAAAAGCC AGAGAATCCC TCGATAAATA CTAGTTTGAC ATTTAATCCA
GATGAGATCG CTTCTTTTAT CTCAGAAGTC ATAAATTTGT TTGACTGCAA TTCTTGCATT
TATCAAAACA TCGCTCGATA TCAACAAGTT ATCGCCCCTA ATGATGCTAA ACTGCAAAGT
AAACTTACCC TGTTATTATT AGAATATATG CTACCCATTA TTAGTAATGA GGTAACAGTA
TCTTCAACTA TAACTCCTGG TGAAGCTTAT AGTTATCAAG GAGTAGAAGA CGCTCTAAAT
AAGCAAATTG CTCAAGAACG ACTGTTAAAT GAAGTAACAA GCCAAATCCG TAAAAGCTTG
GATTTGCCAA TCATCATGAA AACGGCAATT ACCCAAATAC GTGAGTTTAT GGTATTAGAT
AGATTAGTAA TTTATAAATT TGCATCGGCT CAAGTTAAGA GTCAATACTC ATCATTCAAT
GACCAATATT TACCTAACTC TCAACATTTA CCACAAAATT ATCAAAATTA CCAGGGTTGT
ATAGTTTATG AATCTCTGAG CACAGATGAT ATTCCCTCAG TATTAAATTA TCAAGAAGAA
ACTTGTTTGA TGCGAACTTC CTTGTGTTGG GAGAAATACC GTCAAGGTTT TGTTTTAGCT
GTGGATGATA TAGAAAAAAC ATATCCTTTG GAAGAATGTT TATTGAATTT CCTAAGAAGA
AGTAAAGTCC ATGCGAAATT AGTAGCACCG ATAATTTTTG AAGAAAAACT GTGGGGATTA
TTAATTGCTC ATCAATGTAA TGCTCCCCGT GAATGGACTG AAAGCGAAAA AAGTTTGCTA
ACTTCAGTTA CCAAACAGTT AGCGATCACA ATTCATCAAA CAGAGTTAAT GGCATCTCTT
ACTAAACAAA AACAAACCCT AGAACAACGG GTTATTGAAC GCACAATGGC ACTGCAAGAA
GCCTTACTAG CAGCGGAAGC AGCCAATCGT CTCCGAAGTG AATTTCTTGC TACCATCAGC
CATGAATTAT TAACTCCTTT AACCTATGTA ATTGGGATGT CTTCAACATT ATTGCTTTGG
CCTTTAGGCG AATTAAGCAA ACGACAACGG GATTATTTAC AAACCATCCA TGACAGTGGA
GAACATTTAT TAGAAATGAT TAATGACATT CTGGATTTAT CACAAGTTGA AGCTGGTAGG
ACAGTATTAA ATATTACCGA TTTTTCCTTA GTTAAATCAG CACAAAACAC TTTAGATTCT
CTCTTAGAAA AAGCCAGAAG CGAAAAAGTA ACACTTAAAT TAGACTTGCA AATTAATCCC
TCACATGATA GCTTTACCGG CGATTCTACA AGAGTAGAAC AAATTCTCTG GAATTTATTA
ACAAATGCAA TCAAATTCAC TCCCGAAGGT GGTAATGTCA CCTTACGTTT GTGGGTAGAA
GATGCAACCG CCATTTTTCA AGTAGAAGAT ACTGGTATTG GTATTCCAGA AGAACAATTA
CCACTTTTGT TTGAGAAATT TCAACAACTT GACACACCCT ACCGTCGTCG CTACGAAGGT
ACAGGAGTTG GTTTGGCTTT AACCAAACAA CTTGTAGAAT TACATCGAGG TCGAATTGAA
GTAGAATCTA CCGTAGGTAT AGGTTCTATT TTTACTGTTT GGATACCTAA TCAATTAAAG
TAA
 
Protein sequence
MLSSPDLSFS RTLPLSVFNR LGELLQQMAQ AVGTAALMLT EAVLQRMPIP AEWERQRFTL 
VISEQFSALL LGNIEAGEQE TRGLEKPENP SINTSLTFNP DEIASFISEV INLFDCNSCI
YQNIARYQQV IAPNDAKLQS KLTLLLLEYM LPIISNEVTV SSTITPGEAY SYQGVEDALN
KQIAQERLLN EVTSQIRKSL DLPIIMKTAI TQIREFMVLD RLVIYKFASA QVKSQYSSFN
DQYLPNSQHL PQNYQNYQGC IVYESLSTDD IPSVLNYQEE TCLMRTSLCW EKYRQGFVLA
VDDIEKTYPL EECLLNFLRR SKVHAKLVAP IIFEEKLWGL LIAHQCNAPR EWTESEKSLL
TSVTKQLAIT IHQTELMASL TKQKQTLEQR VIERTMALQE ALLAAEAANR LRSEFLATIS
HELLTPLTYV IGMSSTLLLW PLGELSKRQR DYLQTIHDSG EHLLEMINDI LDLSQVEAGR
TVLNITDFSL VKSAQNTLDS LLEKARSEKV TLKLDLQINP SHDSFTGDST RVEQILWNLL
TNAIKFTPEG GNVTLRLWVE DATAIFQVED TGIGIPEEQL PLLFEKFQQL DTPYRRRYEG
TGVGLALTKQ LVELHRGRIE VESTVGIGSI FTVWIPNQLK