Gene Aazo_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4022 
Symbol 
ID9341826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4081966 
End bp4083894 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content42% 
IMG OID 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003722616 
Protein GI298492439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATTT GCCCTCACTG TCAATTTGAA AACCCCAATG ACAACAAGTT TTGTCAAAAT 
TGTGGCACGT CTTTAACTCA TAGGGCTTGT TATAAATGTG GATATGAAGT TTCTGTAGAT
ACACAAAACT GTCAAAACTG TGGTGCAGAA TGCGGCACAA TTTGGTGGGC AATTATTACC
CAAGAAACAC CTGTTATAGT TTCAGAATTT GGGCATGCAC TAACACAAAT GCAGGGAAAT
ATAAACTCTC AGCCACAATC TGCGGTAGGC TCATTTTTAG ATCGAGAACA ACGCTATCAG
TTACTGGCAC CATTACCCAG TCGAGAGGTT ATGGCGACTA ATACTGAACT TAAAGTTAGG
GTTTTAGATT GCCAACCCTA TCAAATATCA CCTATTGAGG CGATGCTAGG AAATCAACCA
GATGGATTGA TTGTGCCATC CCTCAGTGTG ATTGGAATTC CTAATTTAGC GAAAGCTTAT
GTGGTATTAC AACCAGAAAC TCCATCCGTA ATACCACTAA TCCACGATGC ATGGCAGCAA
GATAATATAC AGGTGATTTT GATTGAAGAC CGTTCTCATT GGCAGTTGTT ATTGGATGCG
TGGCAAGATC AGACAACCCA TCCTTTACAA ATTCTGCACT GGTGTTATCA GATGATTCAT
CTTTGGGGAG TGCTGGAACG GGTGAATTGT CGCCAAAGTT TGTTGGAATT ATCTAATTTG
CGACTGGATG AAGATCAAAC ATTGGGATTA CAACGGTTGC ATCTGGAAAT ACCTGGGAGT
AATTTATCAG AACATGCGGA GCAACCCTTA ACCATCAAGG CGTTGGGACG AGTTTGGCAG
ACTTTATGTA AACAGTCCCA ACGCACTCAA ATTGGTTCGG TTGTAAAGAT GTTGGAGGAT
TTAGAAATTG GTAAGATTGA AAATTTGGAA GAATTGCGAT CGCGATTGGA AGAAATAGCC
ACGGAACTAG AACCACCTGC CAATCATGAT TTTAGTTCTA CAGAGGCAAA ATATCCTTCG
GCATCCACCA TATTGCAAGA TGATGATGAT GGTGAAGAAA TCGATAAAAA CGAAGATGCA
CCAACAATTA TATTGTCAAT GCAGTTAAGG AGTGTGGAAA ATACAGGACG CACCGACGTT
GGTCGTCAAA GAAATCACAA TGAAGACTAT TTTGGCATTA ACAGCAAGCA GCAAAAACTG
GAATTGCCCA GAAGTCGGAA TTTGCAAGCA CGAGGTTTGT ATATTTTGTG TGATGGTATG
GGAGGACACG CTGCTGGTGA AGTAGCCAGT GAGTTAGCTG TAAGTACTCT GCAACAATAC
TTTCAAGAAA ACTGGGTGAC AGCAGAACTC CCAACCGCAG AAAGTATCCA AAAAGCAGTT
TTTTTGGCTA ATGAGACGAT CTACAATCTC AATCAACAAG GACTGCGTTC TGGTGTGGGG
CGCATGGGTA CTACCTTGGT GATGTTATTA ATGCATAATA CCAGAGCCGC AGTTGCTCAT
GTTGGTGATA GTCGTCTTTA TCGTTGCACA GGCCAACGGG GACTAGAACA AATAACTGTA
GATCACGAAG TTGGCCAGCG GGAAATAGCC AAAGGTGTAG AATCTGCTAT TGCTTATGCT
CGTCCAGATG CCTACCAACT CACCCAAGCT ATTGGACCAC GAGATGCAAA TTTCGTCCAT
CCAGAGGTAG ATTTTTTTGA AATTAGTGAA GATACTTTGC TGCTGTTAGC TTCGGACGGT
TTATCAGATA ATGATTTAGT AGAAAATAAT TGGCAGACTC ATCTAGAACC TCTGCTTAGT
TCTGGCGCTA ATCTAGAATC AGGTCTTAAC AACTTAATTG ATTTAGCGAA CGAATATAAT
GGCCACGACA ACATTACAGG TATACTGATA CGGGTAAAGG TTTCTCCAAA TCAGCAAAGT
TGGCATTAA
 
Protein sequence
MLICPHCQFE NPNDNKFCQN CGTSLTHRAC YKCGYEVSVD TQNCQNCGAE CGTIWWAIIT 
QETPVIVSEF GHALTQMQGN INSQPQSAVG SFLDREQRYQ LLAPLPSREV MATNTELKVR
VLDCQPYQIS PIEAMLGNQP DGLIVPSLSV IGIPNLAKAY VVLQPETPSV IPLIHDAWQQ
DNIQVILIED RSHWQLLLDA WQDQTTHPLQ ILHWCYQMIH LWGVLERVNC RQSLLELSNL
RLDEDQTLGL QRLHLEIPGS NLSEHAEQPL TIKALGRVWQ TLCKQSQRTQ IGSVVKMLED
LEIGKIENLE ELRSRLEEIA TELEPPANHD FSSTEAKYPS ASTILQDDDD GEEIDKNEDA
PTIILSMQLR SVENTGRTDV GRQRNHNEDY FGINSKQQKL ELPRSRNLQA RGLYILCDGM
GGHAAGEVAS ELAVSTLQQY FQENWVTAEL PTAESIQKAV FLANETIYNL NQQGLRSGVG
RMGTTLVMLL MHNTRAAVAH VGDSRLYRCT GQRGLEQITV DHEVGQREIA KGVESAIAYA
RPDAYQLTQA IGPRDANFVH PEVDFFEISE DTLLLLASDG LSDNDLVENN WQTHLEPLLS
SGANLESGLN NLIDLANEYN GHDNITGILI RVKVSPNQQS WH