Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54331 |
Symbol | Mlh1 |
ID | 7199485 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 784434 |
End bp | 786998 |
Gene Length | 2565 bp |
Protein Length | 695 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | mutl-like protein 1 |
Protein accession | XP_002178844 |
Protein GI | 219116098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.719113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGTTGCAT TAAAAAGCTA CATCATTGCA TTGCGTCGGA GAAATAATGG AAGACTCAGC ATTGACCCGT TCCGGAGAAA TCCAGATTCT ACCGCAGGAA GTGGTGGATA AAATTGCCGC TGGCGAAGTC GTCCAGCGGC CCGTATCCGT CGTCAAGGAG CTGGTGGAAA ACGCATTGGA TGCGGGCGCT ACAGAAGTCA TTGTTACCGT TGAGAAAGGA GGATTGGCGA AAATAACAAT AGCAGACAAT GGCGGGGGAA TCCGTCCACA GGATCTCCCA TTGGCGGCAA CACGACATGC CACGAGTAAA TTGCGCACCA CGGAGGATTT TGCGCACCTA TGTAGTTTCG GATTTCGTGG AGAAGCCTTG GCCTCAGTCA GTATGGTGTC ACATCTTTGC ATTACCAGTC GCGTTCCGGA AGTAAAGGTC GGCTACAAGC TCGCGTATCG TGGCGGAAAG CCGCTACAGT CACCGAAGCC TACAGCACGG AAACCGGGCA CCACGGTGTT GGTAGAAGAT TTATTTTTCA ATCTGCCTCA TCGGAAAGTT CTGCGACCTG CGGATGAGTA CAACAAAATT TTGACGGTTC TGCAGCATTA CTCAATCCTA TATGCAGAAC AGGGTATAGG ACTTGTTTGC CAGAAATCCG GTAAGAAAAG CACGACGGAT TTGAACACAA GCAATGCCGT CGCAGTATTG CGCTCGGCAT TAGATGCTGG TCAGGCAAAC GACGACGCTT TGCGAAAGCT GCAACTGCGA GCTACTCAGC AGGTGATAGC TCAGGTTTTT GGATCCCAGC TGATTTCTCA TTTGCAAGGC TTTGATTGTT TGCGGTTAAA AGAGGAAGGG TCCGAAGAAG ACCGCTCAGT ATTTGTCTGT AGGGGATTGA TTAGTACCAC AACTATTCAT CCGCTTAGCA AGAGGGACGC AATTAGTGCT GTTTATCAAT AGCCGTCTGG TGGAATGCAA TGGACTAAAG CGAGTGATGG AGGATATCTA TAGCGAATAT ACGAAGATCA AGCCATTTCT TTATTTACGT CTTGACGTGC CACCTGATAC AGTGGATGTT AACGTTCATC CCACAAAGAA AGAAGTCGCC TTGCTATACC TCGACGAAAT CTGCAAACAT ATCTCGTCTC AACTCAGACA AACGCTGTCA CGAGCCGGAC AAACCTTTGA ACAAGAAGAC TTGTCTGTCC AATCAAGGTT ATCCAACCCC TACAAAAGAA AAGTCTCTGC TATCTGTACA GACAATGCCC CTTCCGGGAT GCACTTGCTT GCTTCACAGC AACCGGGTAA GAAATCCGCG GCATGCAAAC TCATTAGAAC TGATCAATCA ACCCAGGTCG GTGCTCTGGA ACCTTACTTG GTACAGAAAT CTCAAAGTGA AACACCGCTT TCAGATAAAA CGTATCAGAA TGAAACTCCA TCGTCAACGT CCTCTTCGCA GCATTCGTCC GAATCTCTCC TCGACACAAG TCAATTGTCG ACGAGGGCCG TGCAACCCCA CAAACTCGAC TGCCCCCTCG CCAAACCATC CCCCGATCTT GATCTGACCC AGCCGGGGAT TTTCGCTGCT ATATCACAAC AATGCATATG TAGAGAAGGT TCTACCATCG CCGATGCCTC CCTCGTTCAG TTGCCGCGCA TGGCGATATC GCGGCCGAAA AGGATCTTGG CGACGCCTTG CAGATACAGC TCCATCCGCT CACTGCGAAA GCGCGTTCGT AAGCGCTCCA CGTCACGTCT GGAAAAACGA CTTCGGACGT CGTGTTGGGT CGGCGTTGTT AGCCGACAGC GCTCGCTCGT ACAAGTCGGG GAAGACTTGG TACTCATGAA TCACCTTGAA TTTTCCCGAC AAATGTTCTA TCAACTAGCC CTTGATCGAT TCGGCGGAGG AATGAACTTG GCTGAGTTGG GAGAAGGCGG GCAAGGAGCC GTCGATATTC AAGTGATCAT TGCGCAAGCC CTTCAGTTGG AGGAAAAGGT TCGTTCAGAA GAAGGACGAC ACGAGCTGCA GCAGACGAGG GGATTGCTTA CGACGAGTGA AACAAACTCG GCTTTGGCTG ATCAAGCGGC AACTTGCTTA ATGGACAACA GTGAGATGCT CGAAGAATAT TTCTCCATTG CTATTGAGAA AGATGACCTA GGGCGCATCA TGCTTAAGGG GCTCCCCGTA CTACTGGAAG GACATTGCCC ACAGCCACAC GGTCTTGCCT TGTTTCTGTT ACGATTGGCC ACTGAAGTGG ATTGGTCGGA AGAACGGCTT TGTTTTCACG GTGTGTGTCG AGAACTCGGG GCATACTATT CACAACTGCC ATCCGACAAT GAAGCTCTCG AGTCCTTTAT TCGGCATACA CTGTTTCCGG CAATCTCGAC CCTCACAGTT CCTCCCACAG TGTTGGAAGA AGAAGGCTGC TTTCAATCAG TGACCAAATT GTCAAAATTG TTTCGCGTTT TTGAGCGTTG CTAATGCAAG CAAATGCTTT GACCATAAAA ACAACGGCGT CCTCATAACT TTGGGTAAAA TAACTACCCG ACTAATAACA TTCTCTGAAT TTCTC
|
Protein sequence | MEDSALTRSG EIQILPQEVV DKIAAGEVVQ RPVSVVKELV ENALDAGATE VIVTVEKGGL AKITIADNGG GIRPQDLPLA ATRHATSKLR TTEDFAHLCS FGFRGEALAS VSMVSHLCIT SRVPEVKVGY KLAYRGGKPL QSPKPTARKP GTTVLVEDLF FNLPHRKVLR PADEYNKILT VLQHYSILYA EQGIGLVCQK SGKKSTTDLN TSNAVAVLRS ALDAGQANDD ALRKLQLRAT QQVIAQVFGS QLISHLQGFD SRGTQLVLFI NSRLVECNGL KRVMEDIYSE YTKIKPFLYL RLDVPPDTVD VNVHPTKKEV ALLYLDEICK HISSQLRQTL SRAGQTFEQE DLSVQSRLSN PYKRKVSAIC TDNAPSGMHL LASQQPGKKS AACKLIRTDQ STQVGALEPY LVQKSQSETP LSDKTYQNET PSSTSSSQHS SESLLDTSQF SIRSLRKRVR KRSTSRLEKR LRTSCWVGVV SRQRSLVQVG EDLVLMNHLE FSRQMFYQLA LDRFGGGMNL AELGEGGQGA VDIQVIIAQA LQLEEKTRGL LTTSETNSAL ADQAATCLMD NSEMLEEYFS IAIEKDDLGR IMLKGLPVLL EGHCPQPHGL ALFLLRLATE VDWSEERLCF HGVCRELGAY YSQLPSDNEA LESFIRHTLF PAISTLTVPP TVLEEEGCFQ SVTKLSKLFR VFERC
|
| |