Gene Aazo_4925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4925 
Symbol 
ID9342732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5040709 
End bp5042928 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content38% 
IMG OID 
Productcapsular exopolysaccharide family protein 
Protein accessionYP_003723183 
Protein GI298493006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC AGAATGCTAG TAGAATCTCA ACTTTTAATG AAAATGGTAG GCCAATAATT 
ACGCCTGAAA TATTGCCATA TCAGAACTTT TCCTACTTGG AATCTGAAGA AGATGAAGGC
AGTCTAAAAA ACATCTTAGC TGTTGTTCAA AGACGGGTCT TAATAATTGC TGGCGTTGTA
TCTGTAGTCA TGGCTACTGT TATTTATTCA ACGGTCAACG AAGAAACCAT TTATCAAGGC
AATTTTCAGA TTTTAGTTGA ACCTGTTAAT AGCGATAGTA CATTAGGCCA AATACCACTG
CCTGATTCTA CTATCCCTAA ATCTAATCTA GATTACGAAA GTCAAATTCA GGTTCTCAGA
AGCCAGGAAT TGATGAAAGA TATTTTGCCA AATTTACAAT CTTCCTATCC TGATATTACC
TATAAATCAC TGCTCAAAAA TTTAACTATC CGACGTTTGG GTTCAACCAA AGTTATAGAA
ATAAGCTATA AAAACCGAGA TCGCAAGAAA ATTGAAATTG TATTAAATAC ACTTTCTAAC
TTCTATCTAG AATACAGTTT AGAAAAACGT AAAACCAAGT TAAATCAGGG AGTTCAATTC
GTAGAAAAAC AATTACCTGC TATCAGAAAT CGGGTAGCTC AATTGCAACA ACAATTGCAA
ATATTCCGAC AAAGATACGA CTTTCTGGAT CCAGAAAACC AGTCTGGTCT AGTTGCAACA
CAACTCCAAC AATTGGAAGA GCAGCGACTA GGAATTGATC AACAGTTAGC TACAGCACGA
GCCAGTTATG TAGGCTTATC AACACCACAA GGACAACAGG CAACCTTAAA TCAAGCGCCC
ATATATAGTC AATTAGTATC TCAATTGAGG CAGTTGGAAA CTCAGCTATC TGTAGAATTA
GCACGTTTTC AACCGGATAG TCCATCCATA TCAGTATTAG AAGAAAAAAG AGAAAATATT
TTGCCTCTCA TAGAAGACGA GCAAAAGCGG TATATAGGGT TAAAATTCGC TGAAGCAATT
ACTCTGATTC AGAAACTAGA GGTACAAAGT CAAGAACTAG CTAAAGTAGA AAACCAAACA
AAAATCAAAC TTGAGCAATT ACCAATTTTA GCCAGACAAT ACAGTGAAAT TCAGCGGAAT
TTGCAACTAG CAAATGAAAG TCTCAATCGG TTTTTAGCTA CTCGTGAAAG CCTGGTAATT
CAAGTAGCTC AAACAGAATT GCCGTGGGAA TTAATACAAC CACCAAATCA AGGGGAATTA
CCTATATTAC CAAATATACC CCGGAATTTA CTAATGGGAT TATTTAGCAG TTTAGCTTTG
GGAATTGCTA TTGGCTTCCT ATTGGAAAAA CTAGATAATA CGTATCATGA TATTGGCAGT
CTGAAGGAAA AAATAAAATT GCCCTTCCTG GGAACTCTTC CCATCGACAG AAGCGTTGCA
GGTTATCAAT CTTCTTACCT CAATTTCACT TCTGGTTCAC AATCTGAATC CCGTAAGGGC
TCTGAAGTAA ATGGTTGGTT ATCTAATCTT TTCCGTCGTC AGAGTAAAGT CTATAACTAT
TATGGACAAG GGCTATTTTG GGAATCATTA CAGGTTCTTT ATGCCAATAT TCAACTGCTA
AATTCTGATC AACCAATTCG TTCTTTAACT ATTTCTTCCA CTATGCCAGG AGATGGGAAA
ACCACAGTTT CTTTCCATCT TGCACAAATA GCAGCAGCCT TGGGGAAACG AGTCTTACTT
GTGGATGGTG ACTTACGACG CGCCCAGGTT CATAAATTAT CAAATTTGCA GAATTTGTCA
GGTTTAAGTA ATGTTCTTAC TTCCAATATG CCAGTTGAAC AGGTGATTCA ACAATTACCT
GAGATGAGTT CATTATCTGT AATTACGGCT GGTTCAGTCC CACCAGATCC AGCCAGATTG
CTGGCATCAG ATAAGATGAA ACAACTGATG GAATACTTCA ATGAGAATTT TGATTTGGTA
ATTTATGATG CTCCTCCCAT GCTGGGGTTA GTTGATGCCA GACTACTAGC ACCTCAGACT
GATGGTATGC TGCTGGTGGT GAGGATAGAC AAAACAGATA AGTCGGCTAT GATGCAACTT
CAGGATAGCT TGATAAACTC TCCCATCAAT GTCTTAGGTG TGGTTGCTAA CGGGGATAAG
CAAAAACTTA CTAGTTACAA TTACTACTAT AGTGCTGGTA GGGAAGCTAG ACAACCTTAA
 
Protein sequence
MNKQNASRIS TFNENGRPII TPEILPYQNF SYLESEEDEG SLKNILAVVQ RRVLIIAGVV 
SVVMATVIYS TVNEETIYQG NFQILVEPVN SDSTLGQIPL PDSTIPKSNL DYESQIQVLR
SQELMKDILP NLQSSYPDIT YKSLLKNLTI RRLGSTKVIE ISYKNRDRKK IEIVLNTLSN
FYLEYSLEKR KTKLNQGVQF VEKQLPAIRN RVAQLQQQLQ IFRQRYDFLD PENQSGLVAT
QLQQLEEQRL GIDQQLATAR ASYVGLSTPQ GQQATLNQAP IYSQLVSQLR QLETQLSVEL
ARFQPDSPSI SVLEEKRENI LPLIEDEQKR YIGLKFAEAI TLIQKLEVQS QELAKVENQT
KIKLEQLPIL ARQYSEIQRN LQLANESLNR FLATRESLVI QVAQTELPWE LIQPPNQGEL
PILPNIPRNL LMGLFSSLAL GIAIGFLLEK LDNTYHDIGS LKEKIKLPFL GTLPIDRSVA
GYQSSYLNFT SGSQSESRKG SEVNGWLSNL FRRQSKVYNY YGQGLFWESL QVLYANIQLL
NSDQPIRSLT ISSTMPGDGK TTVSFHLAQI AAALGKRVLL VDGDLRRAQV HKLSNLQNLS
GLSNVLTSNM PVEQVIQQLP EMSSLSVITA GSVPPDPARL LASDKMKQLM EYFNENFDLV
IYDAPPMLGL VDARLLAPQT DGMLLVVRID KTDKSAMMQL QDSLINSPIN VLGVVANGDK
QKLTSYNYYY SAGREARQP