Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4925 |
Symbol | |
ID | 9342732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5040709 |
End bp | 5042928 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | capsular exopolysaccharide family protein |
Protein accession | YP_003723183 |
Protein GI | 298493006 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.501733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGC AGAATGCTAG TAGAATCTCA ACTTTTAATG AAAATGGTAG GCCAATAATT ACGCCTGAAA TATTGCCATA TCAGAACTTT TCCTACTTGG AATCTGAAGA AGATGAAGGC AGTCTAAAAA ACATCTTAGC TGTTGTTCAA AGACGGGTCT TAATAATTGC TGGCGTTGTA TCTGTAGTCA TGGCTACTGT TATTTATTCA ACGGTCAACG AAGAAACCAT TTATCAAGGC AATTTTCAGA TTTTAGTTGA ACCTGTTAAT AGCGATAGTA CATTAGGCCA AATACCACTG CCTGATTCTA CTATCCCTAA ATCTAATCTA GATTACGAAA GTCAAATTCA GGTTCTCAGA AGCCAGGAAT TGATGAAAGA TATTTTGCCA AATTTACAAT CTTCCTATCC TGATATTACC TATAAATCAC TGCTCAAAAA TTTAACTATC CGACGTTTGG GTTCAACCAA AGTTATAGAA ATAAGCTATA AAAACCGAGA TCGCAAGAAA ATTGAAATTG TATTAAATAC ACTTTCTAAC TTCTATCTAG AATACAGTTT AGAAAAACGT AAAACCAAGT TAAATCAGGG AGTTCAATTC GTAGAAAAAC AATTACCTGC TATCAGAAAT CGGGTAGCTC AATTGCAACA ACAATTGCAA ATATTCCGAC AAAGATACGA CTTTCTGGAT CCAGAAAACC AGTCTGGTCT AGTTGCAACA CAACTCCAAC AATTGGAAGA GCAGCGACTA GGAATTGATC AACAGTTAGC TACAGCACGA GCCAGTTATG TAGGCTTATC AACACCACAA GGACAACAGG CAACCTTAAA TCAAGCGCCC ATATATAGTC AATTAGTATC TCAATTGAGG CAGTTGGAAA CTCAGCTATC TGTAGAATTA GCACGTTTTC AACCGGATAG TCCATCCATA TCAGTATTAG AAGAAAAAAG AGAAAATATT TTGCCTCTCA TAGAAGACGA GCAAAAGCGG TATATAGGGT TAAAATTCGC TGAAGCAATT ACTCTGATTC AGAAACTAGA GGTACAAAGT CAAGAACTAG CTAAAGTAGA AAACCAAACA AAAATCAAAC TTGAGCAATT ACCAATTTTA GCCAGACAAT ACAGTGAAAT TCAGCGGAAT TTGCAACTAG CAAATGAAAG TCTCAATCGG TTTTTAGCTA CTCGTGAAAG CCTGGTAATT CAAGTAGCTC AAACAGAATT GCCGTGGGAA TTAATACAAC CACCAAATCA AGGGGAATTA CCTATATTAC CAAATATACC CCGGAATTTA CTAATGGGAT TATTTAGCAG TTTAGCTTTG GGAATTGCTA TTGGCTTCCT ATTGGAAAAA CTAGATAATA CGTATCATGA TATTGGCAGT CTGAAGGAAA AAATAAAATT GCCCTTCCTG GGAACTCTTC CCATCGACAG AAGCGTTGCA GGTTATCAAT CTTCTTACCT CAATTTCACT TCTGGTTCAC AATCTGAATC CCGTAAGGGC TCTGAAGTAA ATGGTTGGTT ATCTAATCTT TTCCGTCGTC AGAGTAAAGT CTATAACTAT TATGGACAAG GGCTATTTTG GGAATCATTA CAGGTTCTTT ATGCCAATAT TCAACTGCTA AATTCTGATC AACCAATTCG TTCTTTAACT ATTTCTTCCA CTATGCCAGG AGATGGGAAA ACCACAGTTT CTTTCCATCT TGCACAAATA GCAGCAGCCT TGGGGAAACG AGTCTTACTT GTGGATGGTG ACTTACGACG CGCCCAGGTT CATAAATTAT CAAATTTGCA GAATTTGTCA GGTTTAAGTA ATGTTCTTAC TTCCAATATG CCAGTTGAAC AGGTGATTCA ACAATTACCT GAGATGAGTT CATTATCTGT AATTACGGCT GGTTCAGTCC CACCAGATCC AGCCAGATTG CTGGCATCAG ATAAGATGAA ACAACTGATG GAATACTTCA ATGAGAATTT TGATTTGGTA ATTTATGATG CTCCTCCCAT GCTGGGGTTA GTTGATGCCA GACTACTAGC ACCTCAGACT GATGGTATGC TGCTGGTGGT GAGGATAGAC AAAACAGATA AGTCGGCTAT GATGCAACTT CAGGATAGCT TGATAAACTC TCCCATCAAT GTCTTAGGTG TGGTTGCTAA CGGGGATAAG CAAAAACTTA CTAGTTACAA TTACTACTAT AGTGCTGGTA GGGAAGCTAG ACAACCTTAA
|
Protein sequence | MNKQNASRIS TFNENGRPII TPEILPYQNF SYLESEEDEG SLKNILAVVQ RRVLIIAGVV SVVMATVIYS TVNEETIYQG NFQILVEPVN SDSTLGQIPL PDSTIPKSNL DYESQIQVLR SQELMKDILP NLQSSYPDIT YKSLLKNLTI RRLGSTKVIE ISYKNRDRKK IEIVLNTLSN FYLEYSLEKR KTKLNQGVQF VEKQLPAIRN RVAQLQQQLQ IFRQRYDFLD PENQSGLVAT QLQQLEEQRL GIDQQLATAR ASYVGLSTPQ GQQATLNQAP IYSQLVSQLR QLETQLSVEL ARFQPDSPSI SVLEEKRENI LPLIEDEQKR YIGLKFAEAI TLIQKLEVQS QELAKVENQT KIKLEQLPIL ARQYSEIQRN LQLANESLNR FLATRESLVI QVAQTELPWE LIQPPNQGEL PILPNIPRNL LMGLFSSLAL GIAIGFLLEK LDNTYHDIGS LKEKIKLPFL GTLPIDRSVA GYQSSYLNFT SGSQSESRKG SEVNGWLSNL FRRQSKVYNY YGQGLFWESL QVLYANIQLL NSDQPIRSLT ISSTMPGDGK TTVSFHLAQI AAALGKRVLL VDGDLRRAQV HKLSNLQNLS GLSNVLTSNM PVEQVIQQLP EMSSLSVITA GSVPPDPARL LASDKMKQLM EYFNENFDLV IYDAPPMLGL VDARLLAPQT DGMLLVVRID KTDKSAMMQL QDSLINSPIN VLGVVANGDK QKLTSYNYYY SAGREARQP
|
| |