Gene Aazo_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4098 
Symbol 
ID9341903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4164549 
End bp4166054 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content41% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003722668 
Protein GI298492491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACGAAT GGATCTTGCC AAGTCTGAGA GAAGTTTTAG CCTATAGCCA ATCAACTATG 
GCTGAGTGTT CATCTGCCAA AGCAGAGCAG CAATGGCGCA TCAGTTTAGC TGCTACTGAA
CATCTACTAC TCAAAACTTT AGCACCCACT ACCCCTAATA TCACTCAAGG TTTGGTTCTA
ACTGCACCAG CACCCTTATT TAGTCAGCCA AAACTAACTC AAAGTTTACA AACAGTAACT
TTTACAGCCA AACCTTTTAA CCCGTTGGCT TTGATGCCGT TTCATATCTC ACCAGCGATG
GTGCAGAGTG GTGGGAAAGA GGTAATGTGT ACTCTGGAAA ACTCACCGGG AATGACAGAC
TGTGCTTACG CACACACAGA AATCAATCCA GAGGAATCCA TATTACCTTT ATTACCTGCC
GATCCTCTGG GATCAGAACA ATTTTGCTTG GTATTCACAG AAAAATTTAG ATTAGTTCTG
GTTTTGTCAG AACACATAAG CGGTAAAAAA GAATTTTTAT TTTCATTTGA ACCAGAAGTA
GTACAACAGG CTTGGCACGC ATTAGGTGCA AGGGTTGTTC TGACTAATCC AGATTTATTC
GCTGAGTTGG ATGTTTTAGT TCAGCAATAT TCCCCAGTTG TAGCAGATTA TCAAACGGTA
ATTCAATTTA GCCAGTTGTT GCTTCAGGAA TTAGCAGAGC CAGAAGCAGA TAAAGCAGTA
CATAATCCTC CCATTTCTCC ACTTCCTCAT ATTCCCACTT CCCCATCACC AAAACTATCT
TCCCGTTCTG ATGTAGAATT ACTACAAGCC TTTGCTCACG AAGTCCGCAC ACCATTAGCG
ACTATTCGCA CTCTTACTCG TCTGCTACTG AAGCGGCGGG ACTTATCTAT TCCCGTAATT
AAGCGATTAG AAGTAATTGA CCACGAGTGT ACTGAGCAAA TTGACCGCAT GGAGTTATTG
TTTAAAGCGG CAGAATTACA AACTTGTTCT GCCGCAAAAT CTGCCAATAC ACAATTAACT
CCCATGTCTT TGGATCAAGT ATTACAGCAA AGTATCCCTC GTTGGCAACA AGCAGCAACA
CGACGGAATT TAACTTTAGA TGTGGCTTTA CCCCAGCAAC TGCCAACTGT GGTCAGTAAT
CCCGCTATGC TAGACCGGGT ACTTACGGGT TTAATGGAGA ACTTTACCCG CAGTTTACCC
CCTGGAAGTT CTATTCAAGT TCAAGTTATT CCCGCTGGTG ATCAACTCAA ATTACAATTA
TCTCCTCAAT TAGATTGCCA AGATACAACT AGAACTGCAA CACTACCAAT TCGTAAATCT
CTTGGTCAGC TATTAATGTT TCAACCAGAA ACTGGAACAA TTAGTTTAAA TATTGCTGCA
ACTAAGCATC TATTTCAGGC AATTGGTGGT AAGTTAATTG TCCGTCAAAA TCCCAAGTAT
GGAGAAGTAT TGACGATTTT TTTACCTTTG GAAGTCAACA GCAAACAAAA GGTAAAATTC
ACTTAA
 
Protein sequence
MYEWILPSLR EVLAYSQSTM AECSSAKAEQ QWRISLAATE HLLLKTLAPT TPNITQGLVL 
TAPAPLFSQP KLTQSLQTVT FTAKPFNPLA LMPFHISPAM VQSGGKEVMC TLENSPGMTD
CAYAHTEINP EESILPLLPA DPLGSEQFCL VFTEKFRLVL VLSEHISGKK EFLFSFEPEV
VQQAWHALGA RVVLTNPDLF AELDVLVQQY SPVVADYQTV IQFSQLLLQE LAEPEADKAV
HNPPISPLPH IPTSPSPKLS SRSDVELLQA FAHEVRTPLA TIRTLTRLLL KRRDLSIPVI
KRLEVIDHEC TEQIDRMELL FKAAELQTCS AAKSANTQLT PMSLDQVLQQ SIPRWQQAAT
RRNLTLDVAL PQQLPTVVSN PAMLDRVLTG LMENFTRSLP PGSSIQVQVI PAGDQLKLQL
SPQLDCQDTT RTATLPIRKS LGQLLMFQPE TGTISLNIAA TKHLFQAIGG KLIVRQNPKY
GEVLTIFLPL EVNSKQKVKF T