Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1519 |
Symbol | |
ID | 9339311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1591619 |
End bp | 1594183 |
Gene Length | 2565 bp |
Protein Length | 854 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003720844 |
Protein GI | 298490667 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00861946 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTC CGACACCAGA ACACAATCAA CCAAATACAC CTCTTAGGGA TACGAAACTG GTAGAAGACC GCAGCAAGCT GAGTAAGATG TATCAGCACT ATGTAGAAAT GAAGGATAAA TATCCTCATG CGTTGCTACT ATATCGCGTC GGTGATTTTT TTGAAACATT TTTCCAAGAC GCTGTAACCG TATCCAGAGA ATTAGAATTA GTTTTAACCA GTAAACATGG TGGTGAGCTT GGTCGTATTG CTATGACTGG TGTACCTCAT CACGCTTGGG AACGCTACAC AACTCTACTC GTAGAAAAAG GTTACGCTGT GGTGATTTGT GACCAAGTTG AAGATTCATC GGAAGCAGTG GGTTTGGTAA AACGCGAAGT AACCCGCATC CTTACCCCTG GGACTTTGTT GGAAGAAGGA ATGCTACAAA CAAGTCGCAA TAATTATTTG GCGGCTGTGG TAATTGCTGT CAATCATTGG GGGTTGGCTT ATGCAGATAT ATCAACTGGA GAATTTCTCA CCTGTGAAGG TAGTGATTTA GAACACTTGA CCCAAGAGTT AATGCGATTG CAACCATCGG AAGTGTTGTT TCCTACCAAC GCCCCCGATT TAGGTACTTT ACTGCGTCCA GGGGAAACTT CGCCATCTCT TCCCCAATGT TTACCACCTA CATTTTGTTA CAGTTTGCGA TCGCAACTTC CCTTTTCTCA ATCCGAAGCT AGAAGTAAAT TATTGCAGAA ATTCAAACTG CGATCGCTCG AAGGCTTAGG TTGTGAACAT CTTCCCCTCG CAGTTCGCGC TGCTGGTGGA CTTTTGGAAT ATATCGAAGA TACCCAAAAA CAAAACCCCG TTCCTCTGCA ACTATTACAC ACCTACAGCC TAACCGATTA TCTTATCGTT GACCACCAAA CCCGACGTAA CTTAGAAATT ACCCAAACCG TGCGCGATGG CACATTTCAC GGTTCTTTGT TATGGGCTTT AAACCGCACT ACTACCGCCA TGGGTGGACG AGCCTTAAGA AGATGGTTGT TACAACCGCT ACTTGATATT AAAGGCATTC GAGCGCGACA ATATACCATT CAAGAATTAT GTGAAAATAC TCCTTTACGT CAGGATTTAC GGAGATTATT ACGGAAGATC TATGATTTAG AACGTTTAAC AGGTCGTGCA GGTTCAGGAA CAGCCAATGC ACGGGATTTA ATGGCTTTAG CTGATTCTTT CTCTAGTTTA CCTGAATTAT CTCGTATAGT AGAAGATGCG CGCTCGCCAT TCTTGAAAGC TTTGCAAAAA GTACCACCTG TACTAGAGGA ACTAGCAGAA AGGTTACAGG CTCACATCGT AGAATCACCA CCAATCCATC TCAAAGAAGG TAGTTTGATT CGCCCCGGGA TCAACCCGCT TTTAGATGAA AGAAAAGCTA CTGTAGAAGC GGATCAAAAA TGGATTGCAA ATCTAGAAGT TGATGAAAGA GCGAGAACAA GAATCCCGAC TTTAAAGGTA GGATTTAATA AAACCTTTGG CTACTATATT AGTATTTCTC GTTCCAAATC TGACCAAGTA CCTGATAATT ACATCCGTAA GCAAACTCTG ACGAACGAGG AACGATACAT TACTCCAGAC CTGAAAGAAC GAGAAGCGCG GATTCTCACA GCGCGAGATG ACTTAAATCA GTTGGAATAT GAGATTTTCG CTGCTTTGCG GGATGAAGTC GGTTCTCACG CGGAAACCAT TCGTAATATT TCCCATGCGG TAGCTGCTGC TGATGTATTA TGCGGATTAG CTGAGTTAGC GGTACATCAA GGTTACTGTT GTCCGGAAAT GGTGGAAGGA TGGGAAATTG AGGTTATTGA TGGTCGTCAT CCAGTAGTGG AACAATCTTT ACCAGCGGGG TTTTTTGTTC CTAACTCGAC AACATTGGGG ACTGGGAATG AACCTACTAA TCACCAATCA CCTGATTTAG TCATTCTCAC AGGGCCGAAT GCGAGTGGCA AGAGTTGTTA TTTACGTCAA GTGGGGTTAA TTCAGTTGAT GGCGCAGGTT GGTAGTTTTG TGCCAGCGCG TTCTGCTAAG TTGGGAGTGT GCGATCGCAT TTTTACCCGT GTAGGTGCTG TAGACGATTT AGCAACAGGT CAATCTACAT TTATGGTAGA AATGAATGAA ACTGCAAATA TTCTCAATCA TGCAACTGTT AAATCATTAG TTTTATTAGA TGAAATTGGC CGTGGAACAG CAACATTTGA TGGTCTTTCA ATAGCTTGGG CTGTAGCAGA ATATTTAGCA GTAGAGATTA AATCTCGGAC AATTTTTGCA ACTCACTATC ATGAATTAAA TGAATTAGCG AGTATTGTTT CTAATGTGGC TAATTATCAA GTTACCGTGA AAGAATTACC TGATCAAATT ATCTTTTTAC ATCAAGTTCA GCCAGGTGGT GCTGATAAAT CCTATGGTAT TGAAGCAGGA AGATTAGCAG GTTTACCCAC GGTGGTAATC AAGCGTGCAA AACAAGTAAT GGGACAAATT GAAAAACATA GTAAAATTGC TGTGGGTTTG CGTGAAGGAC TGTAA
|
Protein sequence | MTTPTPEHNQ PNTPLRDTKL VEDRSKLSKM YQHYVEMKDK YPHALLLYRV GDFFETFFQD AVTVSRELEL VLTSKHGGEL GRIAMTGVPH HAWERYTTLL VEKGYAVVIC DQVEDSSEAV GLVKREVTRI LTPGTLLEEG MLQTSRNNYL AAVVIAVNHW GLAYADISTG EFLTCEGSDL EHLTQELMRL QPSEVLFPTN APDLGTLLRP GETSPSLPQC LPPTFCYSLR SQLPFSQSEA RSKLLQKFKL RSLEGLGCEH LPLAVRAAGG LLEYIEDTQK QNPVPLQLLH TYSLTDYLIV DHQTRRNLEI TQTVRDGTFH GSLLWALNRT TTAMGGRALR RWLLQPLLDI KGIRARQYTI QELCENTPLR QDLRRLLRKI YDLERLTGRA GSGTANARDL MALADSFSSL PELSRIVEDA RSPFLKALQK VPPVLEELAE RLQAHIVESP PIHLKEGSLI RPGINPLLDE RKATVEADQK WIANLEVDER ARTRIPTLKV GFNKTFGYYI SISRSKSDQV PDNYIRKQTL TNEERYITPD LKEREARILT ARDDLNQLEY EIFAALRDEV GSHAETIRNI SHAVAAADVL CGLAELAVHQ GYCCPEMVEG WEIEVIDGRH PVVEQSLPAG FFVPNSTTLG TGNEPTNHQS PDLVILTGPN ASGKSCYLRQ VGLIQLMAQV GSFVPARSAK LGVCDRIFTR VGAVDDLATG QSTFMVEMNE TANILNHATV KSLVLLDEIG RGTATFDGLS IAWAVAEYLA VEIKSRTIFA THYHELNELA SIVSNVANYQ VTVKELPDQI IFLHQVQPGG ADKSYGIEAG RLAGLPTVVI KRAKQVMGQI EKHSKIAVGL REGL
|
| |