Gene Aazo_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2093 
Symbol 
ID9339887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2179509 
End bp2180915 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content35% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003721257 
Protein GI298491080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.578999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCTA TGCATAGCAT GAAAGGTGCT TATAAGTTGA ATCTAAAACT AGAGTCGACT 
TTGGAGGAGT TACCAGTTTG GACTGTTCAA ATCGAACTAA ATCAGCCAGG AAATGAACTA
GCTAAACTTT TTGATCAAGA GCCTTTATTA CCAGGAATTA TCTTGACTAA AAATCAGAAA
TGTGTAGGCA TGATTTCCCG ACCACTATTT TTTGAGCAAA TGAGCCGTCC TTATGGGTTA
GGGCTATTTG CTGGTCGGCC TATTGAATAT CTTTACAATG TTCTGCTACC AGAGATATGT
ATCCTATCTG AGGATACTCC CATTGTGGAT GCAACTCAGA CAGCATTAGA ACGATCCGCT
AAACTTGTTT ATGAACCGAT TGTAATTACA GATCAATCTA GTCAGTATGG ATTACTCGAT
TTCCATCACT TACTTTTAGC TTACTCTCAA ATCCATGTTT TAACACTTAA TCGACTCCAG
AAAGAAAAAG AACTTACTAG CATAGCAAGA GCAGACTTCC GCAACCTTCA ACACAACTAT
ACTCGACTAT TACAAAATGA AAAAATGATT GCCCTGGGAC AACTTGTAGC GGGTATTGCC
CACGAAATCA ATAATCCCAT GAACTTTATC TATGGTAATC TTAACTATGC CACTAAATAT
GTACAAGACC TCATATACCT AGTTGAATGT TATCAAGAAG AACCTTCTTA CTCTGAAGTA
GTATTGCGAG CCAAAAAAAG AGGTATTGAA ATTGAATTTA TCATGGAAGA TTTGCCAAGA
TTGTTATCTT CTATGAAAGT TGGTGCAACC AGAATTAATG AAATTGTCTT ATCTCTACGG
AATTTTTCCA GACTAGATCA AGCAGAAATA AAATCAGTTG ATATTCATGA AGGGATAGAA
AATACATTAA CAATTCTTCA GCATAACTTG AAAGCTAAAC CTGATCGTCC AGAAATTAAA
GTAATTAAAG ATTATGGTAA TCTGCCATTA GTAGAATGCT ATGCTGGTCA ACTTAATCAG
GTATTTATGA ATATAATATC TAATGCTATT GATGCCCTTT CCGAAAGTTA TCAAATATGT
ATTTGCGGTC ATCATTTAAC TTCCCATAAT CAAAAAGATA TGACAATTAC TATTCGTAGT
GAAGTCAATA AGGACAATCA GGTAATGATT GGCATTGCTG ATAATGGACC AGGAGTACCA
GAGAATATCC AAAAACGTGT ATTTGATCCA TTTTTCACTA CCAAGTCTGT AGGTAAAGGA
ACTGGATTAG GATTATCCAT CAGCCACCAA ATTGTGGTAC AAAAACATGG TGGTCAAATT
TACTGTTTAT CGCAGCCAGG ACAAGGCTCT GAATTTATTA TTAAAATTCC CATTATTTCG
GACAAAAATA ATTTAAATAA TAAGTAG
 
Protein sequence
MLAMHSMKGA YKLNLKLEST LEELPVWTVQ IELNQPGNEL AKLFDQEPLL PGIILTKNQK 
CVGMISRPLF FEQMSRPYGL GLFAGRPIEY LYNVLLPEIC ILSEDTPIVD ATQTALERSA
KLVYEPIVIT DQSSQYGLLD FHHLLLAYSQ IHVLTLNRLQ KEKELTSIAR ADFRNLQHNY
TRLLQNEKMI ALGQLVAGIA HEINNPMNFI YGNLNYATKY VQDLIYLVEC YQEEPSYSEV
VLRAKKRGIE IEFIMEDLPR LLSSMKVGAT RINEIVLSLR NFSRLDQAEI KSVDIHEGIE
NTLTILQHNL KAKPDRPEIK VIKDYGNLPL VECYAGQLNQ VFMNIISNAI DALSESYQIC
ICGHHLTSHN QKDMTITIRS EVNKDNQVMI GIADNGPGVP ENIQKRVFDP FFTTKSVGKG
TGLGLSISHQ IVVQKHGGQI YCLSQPGQGS EFIIKIPIIS DKNNLNNK