Gene Aazo_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1036 
Symbol 
ID9338831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1107856 
End bp1109733 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content41% 
IMG OID 
Productserine/threonine protein kinase 
Protein accessionYP_003720521 
Protein GI298490344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.535363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA AACTGTTAAA CAACCGCTAT CAAGTTATCC AATTACTCGG TGCAGGTGGC 
TTTGGGGAAA CCTCTCTTGC AGAAGATACC CATTTACCTT CTCGTCGTCG CTTTCTTATC
AAAGAACTCA AGCCAATTAA CAATGATCCA GAAACTTCTC AAATAATTCA ACAACGGTTT
GAAAGAGAAG CGGCTATTTT GGAAAATTTG GGAGAAACTA GTGATCAAAT TCCCAAACTT
TACGCTTATT TTCCAGAAAA TGGCAAATTT TATCTTGTCC AAGAATGGAT TGAAGGTCAA
ACGCTCACCA ATATTATCCA ATCAAAAGGC AAATTAAAAG AAACGATTGT TCGAGAAATT
CTTTTAAGTT TGCTACCAGT TTTAGATTAT GTTCACAGCA AAGGTATCAT TCATCGAGAT
ATAAAACCAG ATAATATAAT CCTTCGCTCC CAAGATAATA AACCAGTTCT AATTGATTTC
GGTGCTGTTA AAGAAACAAT CCGTACAGTC ATCAATCCTT CAGGAAATCC CCTACAATCC
ATAGTCATAG GTACACCAGG GTATATGCCC AGTGAGCAAG CTATCGGTCG TCCAGTTTAT
GCTACGGATA TCTATAGTTT AGGCTTGACG GCAATTTATC TGCTCACAGG TAAACAACCC
CAAGACTTAG AAACTCATCC CCAAACGGGT CAAGTACTTT GGCAACAATA CGCTGCTGGT
ATATCACCAG AAATGGTACA GATACTTACT CAAGCTATTG AACCACGTCC GAGCGATCGC
TACACCACGG CCAGTAAAAT GCTCTATGCT TTAAAATCTG GTCATAATAC TGATCATAAT
AATTATATTT CTTCCCATGC TCCGACTACC CGCGCCACAA TTAGTCTTAG CCCCCCTCCT
CCTACCAGCC AAACAACCCA GCCAATCTAT TCACCTGCAA GAACTCCTGT TATCAGAGAA
GTTAACAATC CTGCAAACGG GCAAAAAACC GCTGTAATTC TTGGTAGTTT GCTGGTGAGT
AGTTTGATTG TTGCAGTAGG AATATCCAAC CGTCAGCCAC AACCTTCAGC CCCAGTCGCT
ACTAACTCCA CAGTCACAAG AGAAACCCAG ACTCCCACTG TTGTACTTAC AAATTCCCCA
GTTGCTAAGG AAGTTTCCCC AACACCAGTA ATTACACCAA CTCCTTCTAC AGAACAGCAA
CTCATTTCTA ATCCCTTACC AAAAACTAAT TCTGCACAGG TATCAACACC ACCACCAGTG
GATAAACCCG TAATTCAAGA TACTCCCACT CCCACAGTCA GATCAACACC CGAAGTAGAA
ATACAACAAC AGCAATCTCA GCCAGCTGTA GTTCCCACAG CCGAGGTATT TTCAGATTCC
CAGAAAAAGC CAGGTAAGCA AAGAAAAGAA TTACCTGAAC AGTTAGCCAC CAACATCAGG
CAAACTGTAC CAGTATTTCC CACAGGGACA TCCAGAAATA GTGTAGAAGC AGCACTGGGG
AAACCAAAAA AAGATTTAAG GGGACTATGG TCGAATACCC GTGCTATCAC TTATAAAGTA
GTACCAAATC AAATTGATTT AGGCTACTTA TTTGACCGTG ACACTGGTAG AATCAGGCAA
ACCGAAGCGG CTTTTGCTTC ATCAGTAGAT AATCAAGTTA TGCAAACAAC CTTAAATGGT
TTATTAGCTG GACAAGCCAC AGCAGAAATT AAACAAGGAC TCCAAAAAAT TCAACAGCGT
CAGATAGATA ATTTTAAATT TACGAAGGGT TCTGTGAAAG GTCAAATAGT GCGGCAAAAT
TGTGATTTCA TCTACATCAG TATTTGGGAT GCAGATTTAC ATGATTTTGT GAATCCGTCA
GACGGTAAAC AATGTTAA
 
Protein sequence
MTTKLLNNRY QVIQLLGAGG FGETSLAEDT HLPSRRRFLI KELKPINNDP ETSQIIQQRF 
EREAAILENL GETSDQIPKL YAYFPENGKF YLVQEWIEGQ TLTNIIQSKG KLKETIVREI
LLSLLPVLDY VHSKGIIHRD IKPDNIILRS QDNKPVLIDF GAVKETIRTV INPSGNPLQS
IVIGTPGYMP SEQAIGRPVY ATDIYSLGLT AIYLLTGKQP QDLETHPQTG QVLWQQYAAG
ISPEMVQILT QAIEPRPSDR YTTASKMLYA LKSGHNTDHN NYISSHAPTT RATISLSPPP
PTSQTTQPIY SPARTPVIRE VNNPANGQKT AVILGSLLVS SLIVAVGISN RQPQPSAPVA
TNSTVTRETQ TPTVVLTNSP VAKEVSPTPV ITPTPSTEQQ LISNPLPKTN SAQVSTPPPV
DKPVIQDTPT PTVRSTPEVE IQQQQSQPAV VPTAEVFSDS QKKPGKQRKE LPEQLATNIR
QTVPVFPTGT SRNSVEAALG KPKKDLRGLW SNTRAITYKV VPNQIDLGYL FDRDTGRIRQ
TEAAFASSVD NQVMQTTLNG LLAGQATAEI KQGLQKIQQR QIDNFKFTKG SVKGQIVRQN
CDFIYISIWD ADLHDFVNPS DGKQC