Gene Aazo_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4944 
Symbol 
ID9342750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5057873 
End bp5059792 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content39% 
IMG OID 
Productserine/threonine protein kinase 
Protein accessionYP_003723200 
Protein GI298493023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.438728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGGCA AACTACTAGA CCATCGGTAC CAAGTCATAC GCATTTTGGC AACGGGAGGA 
TTTGGTCAAA CTTATATTGC CCACGATACT AAACGACCTG GTAATCCCAT TTGCGTCGTC
AAACACCTCA AATCAGCTAG CACAGATCCC AAAATTTTTG AAACTGCTAA ACGTCTATTC
AATAACGAAG CGGAAACTTT AGAAAAGCTA GGTAATCATG ACCAAATACC CAGGCTACTC
GCTTATTTTG ACGAAAATGA AGAATTTTTC TTAGTCCAAG AATATATTGA TGGACATACC
CTAAGTGAAG AACTAATCCC CGGCGAACCT TGGAGTGAAA GCCAAGTCAT GCAAATGTTG
CTAGAAGTTC TGGGTATTTT AGAATTTGTC CATCAACAAG GTGTAATTCA CCGGGATATT
AAACCAGACA ATATCATTCG TCGGTCTTGT GATTATAGAT TGGTGTTGCT AGACTTTGGA
GCAGTCAAAC AATTGAGATC GCCTTTAGTC ACAGTTGCCG CACGGACTAC TGCTACTGTA
GCTATTGGTA CACCTAGCTA TATGCCTACA GAACAGGGAC AAGGTAAACC CCGTCCTAAT
AGTGATATTT ATTCCCTGGG GATTATTGCT ATTCAAACTT TAACCGGAGT ACCAGCCAAG
CAATTACAAG AAGATTCTAA TACCGGTGAA ATCCTCTGGC AGCATTTTCT TCCTGTTAAC
TACCACTTGG CAGAAATCTT AAGTAAGATG GTGCGTTATC ACTTCAAAGA CCGCTACCAA
ACAGCTACAG AAGCACTGCA AGCGTGTAGA GATTTTCTAA ATCTTACCTC TGGATATTCC
TTATCTTCAG AAGCTCCGAA ACAACTTAGC TACAAAGCCC ACAAATCTCC ATCTTTCCCA
CCCCCATCTC AAGTTGTTTC AGAGCACACT GTCGCTGTGG CCCCTACTAA TCCTGTAATT
AACAATGCAG TTAATAAACC TCTTACAGAA TCTAGTAAAC CTGATCCATT ACCTTTAATC
ATTGCCATCT TATTAGCAGG TAGTGCAGCG GCTTTGGTCA CAAATTTATA TCCCAATGTG
AAAAATATAG CTGCTAATTG GACAGCAAAA AGTAGCACGA AAAACTGCTT GGCTGTAGTC
ACAGCTAATT CTAATATCCG TTCTGAACCG AGTGCTATCA ATTCTGATAA TATTTTAAAA
ATAGTTGGTG ATGCTAGTGA ATTTGATATT ACTGGTAAGC GCACAAAACG TGGTTGGATA
GAACTGAAAC TAAAATCTGG ACGTTTGGCT TGGGCGCATT CAGATGTGAT TTCTAATAAT
GATCAATGGA TTTCTTGTTT AAGGGATCAA GGAATTTCCA TTCAAACCGT AGATGATCAG
CCTTTAATAG CATCTCGATC AACTCCTACA CCACAAGCAA AACCAATTAC TGTATTTACT
TCCATTACTT CCAAAGCGGA AATATCTGAT CAATCAACTT CTGAAACTGA GAGAAAGCCA
GCTAATGAGG ACAAACAGAA AATAGTAGGA CGTGCCAGAC AGAAGTTTGA GTCAGGTGAT
GTAAATGGTG CGATCACAAT TCTCAAATCA GTTCCAGCTA ATGCTGCTTC AGGAATCAAA
GAAACTCTTG AAATTATGAA TCAATGGCAT AAAGATTGGG AAAAAGCCGA AGCACTTGCT
AATGAAATTA ACCGAGCCAT TGATAACGGC CAATGGGATA AAGTCCTAGC TTATCGAGAT
CATCCAGAAA AGTTACCCAA TATTAAATAC TGGCAAGATA AATTAGACCC AATGTTTAAA
CAAGCAGCGC AAAATGCTGC TAAACAAGTT CTTCCCAAGG AAGATAATAA AATTTATGAA
AAGACAACAA CCACTACAAA AGCCCCTAGT GATACAGACA AGCCTGAAAG CACCTTTTAA
 
Protein sequence
MIGKLLDHRY QVIRILATGG FGQTYIAHDT KRPGNPICVV KHLKSASTDP KIFETAKRLF 
NNEAETLEKL GNHDQIPRLL AYFDENEEFF LVQEYIDGHT LSEELIPGEP WSESQVMQML
LEVLGILEFV HQQGVIHRDI KPDNIIRRSC DYRLVLLDFG AVKQLRSPLV TVAARTTATV
AIGTPSYMPT EQGQGKPRPN SDIYSLGIIA IQTLTGVPAK QLQEDSNTGE ILWQHFLPVN
YHLAEILSKM VRYHFKDRYQ TATEALQACR DFLNLTSGYS LSSEAPKQLS YKAHKSPSFP
PPSQVVSEHT VAVAPTNPVI NNAVNKPLTE SSKPDPLPLI IAILLAGSAA ALVTNLYPNV
KNIAANWTAK SSTKNCLAVV TANSNIRSEP SAINSDNILK IVGDASEFDI TGKRTKRGWI
ELKLKSGRLA WAHSDVISNN DQWISCLRDQ GISIQTVDDQ PLIASRSTPT PQAKPITVFT
SITSKAEISD QSTSETERKP ANEDKQKIVG RARQKFESGD VNGAITILKS VPANAASGIK
ETLEIMNQWH KDWEKAEALA NEINRAIDNG QWDKVLAYRD HPEKLPNIKY WQDKLDPMFK
QAAQNAAKQV LPKEDNKIYE KTTTTTKAPS DTDKPESTF