Gene Aazo_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2839 
Symbol 
ID9340639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2919898 
End bp2921571 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content41% 
IMG OID 
ProductGAF(s) sensor(s)-containing protein serine phosphatase 
Protein accessionYP_003721805 
Protein GI298491628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CAGAGGTAGA AAAGCTCAAA CTCATGGTGG TAGATGACGA GCCAGATAAC 
TTAGATTTAC TCTACCGCAC CTTTTGGCGC ACTTTTAAAG TATACAAAGC TAATGATGCC
CATGAAGCTT TGGCTATTTT AGATCAAGTG GGAGAAATGG CTGTGATTAT CTCTGACCAA
AGAATGCCAG AGATGAATGG CACAGAATTG TTCAGTCGCA CAGTAGAACG TTTTCCCGAT
ACAATTCGGA TTTTACTGAC AGGTTTTACC GATGTTGAAG ATTTGGTAGA TGCAATCAAC
TCAGGTCAAG TATTCAAATA CATCACAAAA CCTTGGAAAC CTGACCAACT CAAAGCATTA
GTCGAGCAAG GAAAGGATAC ATACAGGCTA GTAAAAAAAC GTACAAAAGA ACTACGTCAT
GCCCTCCGAA GAGAGTCTTT ATTTAACGCC GTCACAACCA CAATTCGGGA GTCTTTGGAC
TATAACAGCA TCTTGCAAAA GATAGTAGTT ACCATTGGAC AAACATTTAC AGCTACAAGT
TGTGTACTGA GATTGGTAGA GGGTGATCGC TTGACACCAA ACCAGTTTTT CTATCATGAT
CCCAAATTCT CAGATATCAC ACTCCCCTTT GATTCCAACC CTTTAATTGA AGAAGTCCTC
TACACCCAAA AATATCAATT AACCCAAGAT ACACATCATG ACAATTCTTC TCACAGTCTA
GTAGTACCAT TTACCTACCA ACAGCATCTA CTAGCTGTCA TGACCCTCTA TAAATGGGGA
AGTGAAAATA TTTGGCAAGA TGAAGATATC CAAATGATTA CAGGTGTAGC AGAACAAGCA
GCCTTAGCTC TCTCCCAAGC AAAACTCTAC CAAAGCCTGC AAGAAAAACA ACAACAAATC
CGCGTAGAAT TGCAAGTAGC CCGCCAAATT CAAAATAATC TACTCCGTCA AACCCTACCA
GAAATTGACG GTGTAAAAAT ACAAGCCTGC TGTTACCCTG CACGAGAAGT AGGAGGAGAT
TTTTTTGAAG TCTTTGTTCA TCCCAAAGGT GACTTATGGT TAGCAGTCGG TGATGTTTCC
GGTAAAGGAG TACCCGCCGC TCTATTTATG GCCAGTGCTA TTTCCCTATT GCGTCGGGAA
TTATCTCAAG AATCACCAGC CGAGCCCAAT ATAGTCATGC AGAATCTCAA CTATGCTCTA
GCTGATGACT TAATGAGTAA CAATTATTTT ATTACTCTTG TTATTGCAAC TTACAACAAA
GTCACCAAAG AACTTGTTTA TGCTAATGCT GGACACATCT ATCCCTTATT ATGGTCACAT
CAAGCAACTT TAGTTACTCA ACCTTATTAC CTCAAAGAAC GTAGCATCCC GCTAGGGATT
TTACCAAAAT GGCACGCCAA CCCTGGTCGA TTTACTCTTG CACCCGGAGA CACATTACTA
CTAGCCAGTG ATGGCATTAC AGAAGCAAAA GTATCAAAAA AACCTCATTT ACATAAAGAT
AACAATGGTA TCTGTCAGCA AACACACAGT TCCATGCTCA ACCAAGAAGG ACTTTGGCAA
CTTATACAAG AGGAACCTCA ACCATTGTCT CTGGAAAATT TGCTAAATCG TATTCAAACA
GACAACCATG TCCAAGAAGA TGACCAAACT ATACTCTCCC TGGAGGTTCT GTAA
 
Protein sequence
MTETEVEKLK LMVVDDEPDN LDLLYRTFWR TFKVYKANDA HEALAILDQV GEMAVIISDQ 
RMPEMNGTEL FSRTVERFPD TIRILLTGFT DVEDLVDAIN SGQVFKYITK PWKPDQLKAL
VEQGKDTYRL VKKRTKELRH ALRRESLFNA VTTTIRESLD YNSILQKIVV TIGQTFTATS
CVLRLVEGDR LTPNQFFYHD PKFSDITLPF DSNPLIEEVL YTQKYQLTQD THHDNSSHSL
VVPFTYQQHL LAVMTLYKWG SENIWQDEDI QMITGVAEQA ALALSQAKLY QSLQEKQQQI
RVELQVARQI QNNLLRQTLP EIDGVKIQAC CYPAREVGGD FFEVFVHPKG DLWLAVGDVS
GKGVPAALFM ASAISLLRRE LSQESPAEPN IVMQNLNYAL ADDLMSNNYF ITLVIATYNK
VTKELVYANA GHIYPLLWSH QATLVTQPYY LKERSIPLGI LPKWHANPGR FTLAPGDTLL
LASDGITEAK VSKKPHLHKD NNGICQQTHS SMLNQEGLWQ LIQEEPQPLS LENLLNRIQT
DNHVQEDDQT ILSLEVL