Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1999 |
Symbol | |
ID | 9339792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 2077922 |
End bp | 2080399 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | MutS2 family protein |
Protein accession | YP_003721191 |
Protein GI | 298491014 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCCAGT CTGAAACCTT AGAACTACTA GAATGGCATC GCCTCTGCCA GCACCTTTCC ACCTTTGCGG CAACTAAGTT AGGGGCGATA GTTGCCCGTG CCTTGCCGAT ACCGACAACT CTAGAGGAAA GTAAAGAGTT GTTAGCACAA ACCAAAGAAG TCTATCAACT GGAAAGCCTG CTGACAAAAG GGTTATCCTT TGAGGGCATT CAAGATATTG GTGATTCCCT AGAACGTGCA GAACTACAAG GGATTTTGTC TGGTGAGGAA CTTTTGGCTA TCGCTACCAC CCTCGCTGGT GCAAGAAATT TACGTCGGTT AATTGACAAT CAGGAAGATA TACCAATTTT CGCTGAGTTA GTTGCCGAAT TACGAACTTA CCCGGAATTA GAGCAGGAAA TTCACCGTTG TATTGATGAA CGAGCACAAG TTACAGACCG CGCTAGTCAA AAATTAGGAG AGATTCGGGA ATATTTGCGG AAATCACGTG GTCAAATTAC CCAAAAACTG CACAATATCA TTCAAGCAAA ATCGGGAGCA TTACAAGAAC CTATTATCAC TCAACGGGGT AGTCGCTATG TTATCCCCGT AAAAGCACCA CAAAAAGATG CAGTTCCTGG TATTGTTCAC GATACATCCA CTAGTGGTGC AACTTTGTAT ATAGAACCTA ATAGCATTGT GTCAATGGGC AACCAACTGC GCCAAGCTAT TAGAAGAGAA CAGGCCGAAG AAGAAGCCAT TCGTCGCAGT TTAACAGAGC AAGTCGCCGC AGTTAAACCC GATTTAGAAA AGTTATTAGC AATTGTCACT ACTTTAGATA TAGCTACAGC TAAAGCTAGG TATAGTTTCT GGATAGGTGC TAATCCACCC CGGTTTGTAA ATCGTCAAGA ACAGCAAATA ATTACCCTGC GGCAATTACG CCATCCCCTG TTAGTTTGGC AACAACACCA TGAACAGGGA CATCCAGTAA TTCCTGTTGA TTTATTAATT AGTCCCCACA TAAAAGTCGT CACAATTACT GGACCCAATA CTGGTGGTAA AACTGTAACG TTAAGAACCT TGGGATTAGC AGCTTTAATG GCCAAAGTGG GCTTATTTGT TCCCGCCCGT GAACCAGTCG AGATTCCTTG GTTTGACCAG GTTTTAGCCG ATATTGGGGA TGAACAATCT TTACAACAAA GTTTATCAAC ATTTTCCGGG CATATTCGCC GTATTAGTCG GATTCTAAAT GCTTTAGGCA CTGGGGATCG GGGAGTCGGG ACTAGGGACT GGGAGATGGG GGAGATGAGA GGAGATGGGG GAGATGGGGA AGGAATTTTC CTAATGCCCA ATGCCCAATG CACCATGCCC AACTCCCTAG TCTTACTTGA TGAAGTGGGT GCGGGTACTG ATCCGGTAGA AGGTAGTGCA TTAGCGATCG CTCTTTTACA ATATCTCGCT GATCATGCCC AATTAACCAT TGCTACCACG CACTTTGGGG AATTGAAAGC TTTGAAATAT GAGGATATTC GCTTTGAAAA CGCCTCAGTA GAATTTAATG ACGCTACCCT TTCACCAACC TATCGGTTAT TGTGGGGTAT TCCTGGACGT TCCAACGCCT TAGCCATTGC CTTGCGTCTG GGATTAAAAC CAGAAGTGGT AGAAGCTGCA AAAAGTCAAG TAGGAGAAGC CACGGACGAA GTTAATCAAG TAATTGCAGG GTTAGAAGCA CAGCGGCGCA GTCAGGAAAC CAAAGCGGCG GAAGCGCAAG AATTATTACG TCAAGCCGAA AAATTATACA AAGAAGTTTC CCAAAAAGCC ACAGCTTTAC AAGAACGAGA AAAAGATTTG CGTGCTTCCC AAGAAGTAGC AGTACAGCAA GCAATCATCC AAGCTAAAGG AGAAATTGCC GAAGTTATTC GCCGTTTGCA ACAAGGAAAA CCCACCGCCC AACATGCCCA GGAAGCAACT AGTAAGTTAA GTGAAATTGC GGAGAGATAT CAACCCACTC CACCACCCAA ACCCAAACCA GGGTTTATGC CCAAAGTAGG AGATCGCATC CGTATTCGTA AACTAGGGCA AACCGCGGAA GTGTTAACAG CCCCTAATAC AGATGGAGAA TTCAGTGTGC GCTTTGGCAT CATGAAAATG ATGGTGCAAT TACAAGACAT AGAGTCCTTA GAAGGACAGA AACCAGAACC CATTGCTAAA CCGAAACCAG CACCAGCCGT TACCACACCA CCAGCACCAG CTTTAGCTAT TCGTACCTCG AGAAATACCG TTGATTTGCG CGGAAAAAGG GTAGTTGATG CCGAATATAT CTTAGAAAAA GCGATTTCCG AGGCTGATGG ACCTTTATGG ATTATTCATG GATATGGTAC AGGTAAGTTA AAGCAAGGAG TTCACGCTTT TTTGCATCAA CATCCCAGAG TCAGCCACCA CGAACCCGCA GAACAAGCAG ATGGCGGGAC TGGTGTCACA ATTGCTCATG TGGAATAG
|
Protein sequence | MIQSETLELL EWHRLCQHLS TFAATKLGAI VARALPIPTT LEESKELLAQ TKEVYQLESL LTKGLSFEGI QDIGDSLERA ELQGILSGEE LLAIATTLAG ARNLRRLIDN QEDIPIFAEL VAELRTYPEL EQEIHRCIDE RAQVTDRASQ KLGEIREYLR KSRGQITQKL HNIIQAKSGA LQEPIITQRG SRYVIPVKAP QKDAVPGIVH DTSTSGATLY IEPNSIVSMG NQLRQAIRRE QAEEEAIRRS LTEQVAAVKP DLEKLLAIVT TLDIATAKAR YSFWIGANPP RFVNRQEQQI ITLRQLRHPL LVWQQHHEQG HPVIPVDLLI SPHIKVVTIT GPNTGGKTVT LRTLGLAALM AKVGLFVPAR EPVEIPWFDQ VLADIGDEQS LQQSLSTFSG HIRRISRILN ALGTGDRGVG TRDWEMGEMR GDGGDGEGIF LMPNAQCTMP NSLVLLDEVG AGTDPVEGSA LAIALLQYLA DHAQLTIATT HFGELKALKY EDIRFENASV EFNDATLSPT YRLLWGIPGR SNALAIALRL GLKPEVVEAA KSQVGEATDE VNQVIAGLEA QRRSQETKAA EAQELLRQAE KLYKEVSQKA TALQEREKDL RASQEVAVQQ AIIQAKGEIA EVIRRLQQGK PTAQHAQEAT SKLSEIAERY QPTPPPKPKP GFMPKVGDRI RIRKLGQTAE VLTAPNTDGE FSVRFGIMKM MVQLQDIESL EGQKPEPIAK PKPAPAVTTP PAPALAIRTS RNTVDLRGKR VVDAEYILEK AISEADGPLW IIHGYGTGKL KQGVHAFLHQ HPRVSHHEPA EQADGGTGVT IAHVE
|
| |