Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3125 |
Symbol | |
ID | 9340928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3217855 |
End bp | 3220086 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | capsular exopolysaccharide family protein |
Protein accession | YP_003721986 |
Protein GI | 298491809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.858566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTCAAA CTAGTCTAAA TCCCCATATA AGTTCAGCAG CTGATACAGA AACAGGTTAT GGACAAATGT TTACTGTATT TGTCAGAAGA TTTCCCTGGT TTTTATTAGT ATTCCTGAGT TCTACTGCTC TTGCCTGCAT CATTACTTTA AAGACAAAGC CCAGTTTCAA AAGTACGATG CAACTGTTAG TAGAACCTAA CTATCAAGGT AAAAAACAAG GATCAGATAC AGCAGAAAGT CAGTTTACAG ACTCTAACGT TGTCATAGAC ACGGCAACAC AGCTTAATTT GATGCAAAGT TCTGGACTGA TTCAAAAAGC AGTTAATAAA CTCAAGTTAG AAGATCCAAG TATTACTGTA GAGGAAGTAA AGAGATCCTT AGTCTTAACC CAAATCAGGA CTAAAGAGGA TAATGTTGTC ACCAAAATTT TCCAAGTTGA CTATACCGAT GGCAATCCAG AAAGAACACA AAAAGTTCTA ACTGCTATTC GACGAGTCTA TGTGGATTAT AACAAACAAC AACAGGATTT ACGGTTACAA AAAGGTCTGC AAGTTATTAG AGAGCAGTTG CGAAAAGCTA GTGACGAAGT GAATGCTTCT GAAACCAATC TCCAACGATT TCGCAGAAAC CAAAATTTAA TTGATCCAGA ATTACAAGCC AAAGCGATTG AAGAATCTTT AACTTCTATT CAGAGAGAAA GACAAACAAC TCGTTCTCAA TATCAAGAAG CTTTAGCAAA GCAAAAGTCT TTGCAGCAAC AACTTAATCG TTCTCCTCAA AATGCATTGG TTTCTTCTCG TTTGAGTCAG TCTATACGCT ATCAGGGCTT ACTGAATGAA ATTCAGAAAA CAGAACTAGC TTTAGCACAG GAACGCTTAC GTTTTACGGA TGATACTCCC AACGTACAGA AGCTAACTGA ACAACTCCAG AGTCAAAAGG AACTTTTGCA AAAAGAAGTC AGCAGAACTT TAGGTAGACA ATCCACCAGT GTATTCAGTT TTGGAGAAAA TCTTCTCGAA CAAGGACAAC TTGGGGAAAT TGACCTAAAT TTGACTGGGG AATTAGTAGA AAATCAGACT AACATAGTTG CATTAAGTGC GCGTGATCAA ACTTTAGCAG CAAAAGAGAA TGAACTGCGA TCACAACTCA AACGTTTTCC CTCTCTCTTA GCTTATTACA ACCGTATCCT TCCCCAATTA CAATTTAGTC GAGAAAGAAT GGAGCAATTG TTGAGAGCAG AACAGCAGTT GCGGCAGGAA CTATCCAAAG GTGGATTTAA TTGGGAAGTA GTAGAAGAAC CACAACTAGG TATAAAACTA GGTCCCAATC TCCAGCAGAA CCTCTTATTA GGTGCCGTAG TAGGATTAAT GTTAGGTGGT ATTGCCGCCT TTATTCGAGA AACATCTGAT GACTCTGTAC ATACTACTGC TGAATTAGAA AAACAAATTA CCCTACCATT ATTGGGCACA ACTCCCAAAT TGCCACCAGC AAAACCCAAA GAGTCAATCA TCAAATTACC ATTTGGTAAA CCAGAAGTTC TTGCCCCTTG GACAATTCAA GTTTTACAAT CACCACCCCG TTGGGAATCA CTGGATCTCA TTTATAAAAA CATAGAACTC CTCAATAGCG TCACTGATTT AAAATCATTG ATGGTGACAT CAGCATTACC AGATGATGGT AAATCAGCTT TGACATTAGG TTTAGCTATG AGTGCAGCGA GATTACATAA AAAGGTACTG CTCATAGATG CCAACCTTCG AGACCCCAGC TTACACAAAC AACTTAACCT TCCTAATGAA CAAGGTTTAT CCACCCTATT AGCCAGTGAT GCTACCCTTC CTAATCAGAT TGGGCTTCAA TACTCAGGTT CATCTTATAT CGACATTTTA ACTGCTGGCC CCATACCTGT AGATCCTGCT CATCTTTTGA GTTCTCCACG GATGATGGAA TTGATGGCTG CATTTGAAGA AAACTATGAT TTAGTCCTCA TAGATGCTCC CTCAGTTATC GGTATGGTAG ATGCTATACT CACAGCCTCA TCTTGTCGGA GTGTAGTCAT GGTAGCAAGC ATTGGTAAAG TAACACGCAA TAATTTAGCT CAAGCTACAG CAATGTTGAG CAAGTTAAAT CTCATTGGAG TTGTAGCCAA TGGAGTATCT AACTCTGATA GCACTTATGT ACCTTATGCC AAACAATCAC AATTAATATT ACAAGAAGTT ATGGAAAAAT AG
|
Protein sequence | MVQTSLNPHI SSAADTETGY GQMFTVFVRR FPWFLLVFLS STALACIITL KTKPSFKSTM QLLVEPNYQG KKQGSDTAES QFTDSNVVID TATQLNLMQS SGLIQKAVNK LKLEDPSITV EEVKRSLVLT QIRTKEDNVV TKIFQVDYTD GNPERTQKVL TAIRRVYVDY NKQQQDLRLQ KGLQVIREQL RKASDEVNAS ETNLQRFRRN QNLIDPELQA KAIEESLTSI QRERQTTRSQ YQEALAKQKS LQQQLNRSPQ NALVSSRLSQ SIRYQGLLNE IQKTELALAQ ERLRFTDDTP NVQKLTEQLQ SQKELLQKEV SRTLGRQSTS VFSFGENLLE QGQLGEIDLN LTGELVENQT NIVALSARDQ TLAAKENELR SQLKRFPSLL AYYNRILPQL QFSRERMEQL LRAEQQLRQE LSKGGFNWEV VEEPQLGIKL GPNLQQNLLL GAVVGLMLGG IAAFIRETSD DSVHTTAELE KQITLPLLGT TPKLPPAKPK ESIIKLPFGK PEVLAPWTIQ VLQSPPRWES LDLIYKNIEL LNSVTDLKSL MVTSALPDDG KSALTLGLAM SAARLHKKVL LIDANLRDPS LHKQLNLPNE QGLSTLLASD ATLPNQIGLQ YSGSSYIDIL TAGPIPVDPA HLLSSPRMME LMAAFEENYD LVLIDAPSVI GMVDAILTAS SCRSVVMVAS IGKVTRNNLA QATAMLSKLN LIGVVANGVS NSDSTYVPYA KQSQLILQEV MEK
|
| |