Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45661 |
Symbol | MSH4 |
ID | 7200416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 849572 |
End bp | 853326 |
Gene Length | 3755 bp |
Protein Length | 1191 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | muts-like protein 4 |
Protein accession | XP_002179729 |
Protein GI | 219117886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCCC AACCTCACAA GAATCGAAGC AAGCGCAAGA GTAAGAGCAG TACACCATCC GCGAATGGAT CTGACCCGGC ACCGTTGGAT CTCGCCTCCA AAGTCTCCTT AACCGGGGGG GCTTCCGGAA CGGCTCTAGA AGTCTCACTA TCACACGATG ATGCTGCGTC ATTAGCTTGC AAAAGTCTGG GATCGGGATT GTACGGAACT CGCGCAACGC AAAGCCTTGC CTCCTCCCTC CACCGCAAGC GCCTCTTTCC GTCCGCCCAG GCGTCCTCAC GATCCCAACG AACGGTACCT GGTTCGGTAG CGAGTCGGAA GTCTTCCCAG TATTCGCAAA AATCCCGATT TTCTCGAGCC TCGGGTACCC CAGCTCACCA ACGAGCGCGG TATCACGGAA TGCAGACGCA GACCAAGGCC ACAGCGCACA TCGTGTGCGC TGTGGCGGAA AACTTGGCCC GGGAAACCTG CGTTGCTTCC CTGGACGCCG GCGCACCAAC CGTACTACAC GCTACGAAAC AAGGCAACGG ACAAACCTAC GCAGAGACCT TGGCGTACTT GGAACTGCTG CAACCGGACG AAGTTTTGTT GAACGAAGGT CGCCAAACGT CACAATTGGC CCGGAAAATC CTCGAACTGT ACAGTGTCAC AACGACTACC GCAAACGCTG ACAACGGAAG TGCCGTCCTA CGAGACAAAA CCAACGGCCG GAACATTGCT CAGCGACGTC GGCAACGAAA TTTATGGAAA ACCGGCACAA TGGCCAGCAT CAACGAAGAC AAGCACTACA ATCAAAGCAG CGTTGAAGAG AGCGACGAAT ACGGCGACAG CGTAAGACAT ACACGAAAGC AGGTCATGGT CAAGTTTATT TCGCGTGTCT GCTTTGATCA AACAAAGGGA GCCGAGCTTT TGCGTGTTGT AGCGCGCGAA GAGACGTACG ACGCCAAGCT TGTGGAGGAC TACATACTAT TGTCATCGGC CAATGCCGTC TTGCGCTATA CACAGCAACA TTTGGGAGCA TGCTTGACCC GAAACAGCTT AAATTTGCAC ATTAATGCTG GAGGAAATCA TCGCATGGCT ATCGATCGTT CAACTCTCTT ACAACTCGAG TTGCTCATGA ATGCGAAAAC CGGAAAGTTG AAGGATTCGC TGGTGGGTTC CATTGATTGT ACCAAAACAA CAGTAGGAAG TCGGTTGTTG CGCACCAATC TCATGTCGCC ACCAACACAA ATTGCCACCA TTCACGCCCG ACAAGAACTT GTTGACACGT TCCTTGGCAA TGAAGCTTTC TTTTACGATG TGATGGAGCA TTTGATGAAT TTGCCAGATG TCGACCGAAT GTTGAACCAT ATTGCCCTGG TACCGCGCCT CGATTGTAAA GACATGGGCA TGGATGGCCA TCAACGACAG CGTCCCGCCG TTTCTCAGCG GTTAGCAAGC AAGGGAATCT CTGCATTGGT GGCTATCAAA TCGACACTAA AAGCCTTGCC AGCCCTTGTA CAGATATTGA AGAATCAGCT CGAAAGCACC ACCGGAGCTA TGAGGCAAAC CGAGAAAGAA ATCTCCACCG TACAGCAGAT CAATTCTCCA AATGATGAGG ACGAAAATAC TACAATTGTG ACAGATAGGT CGAGTCTATT GATCGGTTTG GGTGGTTGTA ACTCATCGAG CCTTCATTCA GCACACACGA TAGAGAGTCG GAGATACTCC AGTCATTTGC TACGTGCCAT TATCTTTTCA TTGAATCAGC CAGCTTTTAA CGAAGTCCAC AAAGCTATTC TGGATGTTTT CACCGAAAGT ACCGCTTACA CTCGGAACCC TAACGCAATG CGCCATCAAG AATGCTTTGC GCTGAGGTGT GAGTCTGACG GAATGATGGG AATTCTACGT AAGGCATTCT TGGCTAATGT TGATGATATT TACCGAAAGG CAGACGAATA TGCAGAAGTA TACGGAATGC AGGTAAAAGT CAAGTACACG GCTAGCCGCG GCTACTTTTT GGCAGTACCA TCGGATATTG GTACTGATCT CCCACTTGTT TTTACACAGC CCACTCTTTT GGGTCGCCAC ATACACTGCA CGACAGAGGA GATTGCAAGC TTTAATACAA GAGCCCAAGA CAACGTCCAA GATATCTTGC TCATGACACA CGATAAAATT CAGGAAGTGC TCAATATTGG TCGTCAATAT TTCGACGCTT TTGCTGCATT GTCTGATGCA ATTGCCTTGC TGGACTTGTG TCACGGTTTT GCCGATCACG TCACGCTCAG CGAGTCGCCT TGGTGCCGAC CTGTGCTTTC TGAAAAGGCA ATCTGTTCCG AAGAAGGATC TACTGATTCG GAATGTACAA TGATGATTCG AAGCGGACGA TACGCTATCG CGATAGAGGG CCATGGTTTA GAATCAGCAG ATGGTTCAAG CGGATATATT CCGAACGATA CGTTTGCCTC AGACGCAAAG CCATTTACGC TGATAACTGG CATCAATGGC AGCGGAAAGA GTACATATCT CAAACAGATC GCAATCTGCA CTGTTCTAGC GCACTGTGGT AGCTATGTTC CTGCCGAACA AGCGTGTATT CCAATCCGGG ATCTAATATG TTCTCGCATC GGCAACACAG ACGACCAAGA GCACAATATC TCAACTTTCA TGTTGGAAAT GAAAGAGACT GCCTTTATTT GCAATCATGC TACCGAAAGA TCCCTCATTC TTATTGATGA GCTCGGGCGT GCCACTAGCA ACGAAGATGG CGTTGCCATC GCTTGGTCAA TTGCTGAATA TCTGTTAAAG AAAGGAGCGA TGACTTTTTT TGCCACTCAT TACCCTCAAC TCTGTCGCCT GGGAGATGTC TATTTGAAAG TACAGAATGT CCATTTGGAG GCATCAGTGA GCAACGGTGA AAGATCGCAG ATCTATTACA CCCATCGGGT TGTGTCTGGA ACTTGTGCCG TCTCAACAGA TTACGGGGTT GAGCTGGCAA GCGTTTGCGG CTGGCCACAA GAAGTCGTAA CAGCAGCTAA GACAATTCAC AAAGATGTGG AATCATTGCT GCCTGACGAA TCAATTTGCA ACTCTGAACA AGCCAATCAT TATCCGTTTG CTGAAGCGAT GCTAGCTATC CGCACCATCG CATCACAGAT TAAAGGATAC GTTGCCCACA ATAAAGCTCA GCCATATGAA AGTATTCGCC GAGAGCTTGA TGAGCTCCAC CGTAGCTGCG TCAAATACAG CCACAAGGAT CTTGCCGAGC TAATTGAAAG GATGCTTATC AGTAGTCCCT CACATACACA GCAAGATTCT ATTGGGATCA TTCCTTCGCT TCCCGTAAGG GCACCAAAAG CTGCCCTCAA AGACCGTAAG ATCCAAAATG CAAATATGAT CTTCACATCC GATCGCAACG GAAGCGGCAC TTTTGATCTT CCCCTTGCCT CCACGCCTGC CAATGGCAAC CTTGAAAAAC TTGGCGAAAC AGAGGATAAC GACAATTCCA GCTTGAGCTC TTCGTCGACA AGCTCTGATT CTTCAAGCAG CAATTCGTCA GCATCGTCAG TTGCTGCGTT TGAGGGTTCA CTTTGAGAAA TGAGACACCG TTTCGAACGG GGCCAAGTTG AATTGGGAAC GGATACTATA TCTCGCTGTG TTTTGTTTTC TTACGTTGAC TGCAAAACCA CTGATAATAG ATAGCTTTGT CGTAAATTCC TGACTTAGCA TTCTGTATAC ACATCCTCCA GCACGCAAAA CTTGGAATAA AAAGC
|
Protein sequence | MESQPHKNRS KRKSKSSTPS ANGSDPAPLD LASKVSLTGG ASGTALEVSL SHDDAASLAC KSLGSGLYGT RATQSLASSL HRKRLFPSAQ ASSRSQRTVP GSVASRKSSQ YSQKSRFSRA SGTPAHQRAR YHGMQTQTKA TAHIVCAVAE NLARETCVAS LDAGAPTVLH ATKQGNGQTY AETLAYLELL QPDEVLLNEG RQTSQLARKI LELYSVTTTT ANADNGSAVL RDKTNGRNIA QRRRQRNLWK TGTMASINED KHYNQSSVEE SDEYGDSVRH TRKQVMVKFI SRVCFDQTKG AELLRVVARE ETYDAKLVED YILLSSANAV LRYTQQHLGA CLTRNSLNLH INAGGNHRMA IDRSTLLQLE LLMNAKTGKL KDSLVGSIDC TKTTVGSRLL RTNLMSPPTQ IATIHARQEL VDTFLGNEAF FYDVMEHLMN LPDVDRMLNH IALVPRLDCK DMGMDGHQRQ RPAVSQRLAS KGISALVAIK STLKALPALV QILKNQLEST TGAMRQTEKE ISTVQQINSP NDEDENTTIV TDRSSLLIGL GGCNSSSLHS AHTIESRRYS SHLLRAIIFS LNQPAFNEVH KAILDVFTES TAYTRNPNAM RHQECFALRC ESDGMMGILR KAFLANVDDI YRKADEYAEV YGMQVKVKYT ASRGYFLAVP SDIGTDLPLV FTQPTLLGRH IHCTTEEIAS FNTRAQDNVQ DILLMTHDKI QEVLNIGRQY FDAFAALSDA IALLDLCHGF ADHVTLSESP WCRPVLSEKA ICSEEGSTDS ECTMMIRSGR YAIAIEGHGL ESADGSSGYI PNDTFASDAK PFTLITGING SGKSTYLKQI AICTVLAHCG SYVPAEQACI PIRDLICSRI GNTDDQEHNI STFMLEMKET AFICNHATER SLILIDELGR ATSNEDGVAI AWSIAEYLLK KGAMTFFATH YPQLCRLGDV YLKVQNVHLE ASVSNGERSQ IYYTHRVVSG TCAVSTDYGV ELASVCGWPQ EVVTAAKTIH KDVESLLPDE SICNSEQANH YPFAEAMLAI RTIASQIKGY VAHNKAQPYE SIRRELDELH RSCVKYSHKD LAELIERMLI SSPSHTQQDS IGIIPSLPVR APKAALKDRK IQNANMIFTS DRNGSGTFDL PLASTPANGN LEKLGETEDN DNSSLSSSST SSDSSSSNSS ASSVAAFEGS L
|
| |