Gene PHATRDRAFT_54331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54331 
SymbolMlh1 
ID7199485 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp784434 
End bp786998 
Gene Length2565 bp 
Protein Length695 aa 
Translation table 
GC content49% 
IMG OID 
Productmutl-like protein 1 
Protein accessionXP_002178844 
Protein GI219116098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.719113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGTTGCAT TAAAAAGCTA CATCATTGCA TTGCGTCGGA GAAATAATGG AAGACTCAGC 
ATTGACCCGT TCCGGAGAAA TCCAGATTCT ACCGCAGGAA GTGGTGGATA AAATTGCCGC
TGGCGAAGTC GTCCAGCGGC CCGTATCCGT CGTCAAGGAG CTGGTGGAAA ACGCATTGGA
TGCGGGCGCT ACAGAAGTCA TTGTTACCGT TGAGAAAGGA GGATTGGCGA AAATAACAAT
AGCAGACAAT GGCGGGGGAA TCCGTCCACA GGATCTCCCA TTGGCGGCAA CACGACATGC
CACGAGTAAA TTGCGCACCA CGGAGGATTT TGCGCACCTA TGTAGTTTCG GATTTCGTGG
AGAAGCCTTG GCCTCAGTCA GTATGGTGTC ACATCTTTGC ATTACCAGTC GCGTTCCGGA
AGTAAAGGTC GGCTACAAGC TCGCGTATCG TGGCGGAAAG CCGCTACAGT CACCGAAGCC
TACAGCACGG AAACCGGGCA CCACGGTGTT GGTAGAAGAT TTATTTTTCA ATCTGCCTCA
TCGGAAAGTT CTGCGACCTG CGGATGAGTA CAACAAAATT TTGACGGTTC TGCAGCATTA
CTCAATCCTA TATGCAGAAC AGGGTATAGG ACTTGTTTGC CAGAAATCCG GTAAGAAAAG
CACGACGGAT TTGAACACAA GCAATGCCGT CGCAGTATTG CGCTCGGCAT TAGATGCTGG
TCAGGCAAAC GACGACGCTT TGCGAAAGCT GCAACTGCGA GCTACTCAGC AGGTGATAGC
TCAGGTTTTT GGATCCCAGC TGATTTCTCA TTTGCAAGGC TTTGATTGTT TGCGGTTAAA
AGAGGAAGGG TCCGAAGAAG ACCGCTCAGT ATTTGTCTGT AGGGGATTGA TTAGTACCAC
AACTATTCAT CCGCTTAGCA AGAGGGACGC AATTAGTGCT GTTTATCAAT AGCCGTCTGG
TGGAATGCAA TGGACTAAAG CGAGTGATGG AGGATATCTA TAGCGAATAT ACGAAGATCA
AGCCATTTCT TTATTTACGT CTTGACGTGC CACCTGATAC AGTGGATGTT AACGTTCATC
CCACAAAGAA AGAAGTCGCC TTGCTATACC TCGACGAAAT CTGCAAACAT ATCTCGTCTC
AACTCAGACA AACGCTGTCA CGAGCCGGAC AAACCTTTGA ACAAGAAGAC TTGTCTGTCC
AATCAAGGTT ATCCAACCCC TACAAAAGAA AAGTCTCTGC TATCTGTACA GACAATGCCC
CTTCCGGGAT GCACTTGCTT GCTTCACAGC AACCGGGTAA GAAATCCGCG GCATGCAAAC
TCATTAGAAC TGATCAATCA ACCCAGGTCG GTGCTCTGGA ACCTTACTTG GTACAGAAAT
CTCAAAGTGA AACACCGCTT TCAGATAAAA CGTATCAGAA TGAAACTCCA TCGTCAACGT
CCTCTTCGCA GCATTCGTCC GAATCTCTCC TCGACACAAG TCAATTGTCG ACGAGGGCCG
TGCAACCCCA CAAACTCGAC TGCCCCCTCG CCAAACCATC CCCCGATCTT GATCTGACCC
AGCCGGGGAT TTTCGCTGCT ATATCACAAC AATGCATATG TAGAGAAGGT TCTACCATCG
CCGATGCCTC CCTCGTTCAG TTGCCGCGCA TGGCGATATC GCGGCCGAAA AGGATCTTGG
CGACGCCTTG CAGATACAGC TCCATCCGCT CACTGCGAAA GCGCGTTCGT AAGCGCTCCA
CGTCACGTCT GGAAAAACGA CTTCGGACGT CGTGTTGGGT CGGCGTTGTT AGCCGACAGC
GCTCGCTCGT ACAAGTCGGG GAAGACTTGG TACTCATGAA TCACCTTGAA TTTTCCCGAC
AAATGTTCTA TCAACTAGCC CTTGATCGAT TCGGCGGAGG AATGAACTTG GCTGAGTTGG
GAGAAGGCGG GCAAGGAGCC GTCGATATTC AAGTGATCAT TGCGCAAGCC CTTCAGTTGG
AGGAAAAGGT TCGTTCAGAA GAAGGACGAC ACGAGCTGCA GCAGACGAGG GGATTGCTTA
CGACGAGTGA AACAAACTCG GCTTTGGCTG ATCAAGCGGC AACTTGCTTA ATGGACAACA
GTGAGATGCT CGAAGAATAT TTCTCCATTG CTATTGAGAA AGATGACCTA GGGCGCATCA
TGCTTAAGGG GCTCCCCGTA CTACTGGAAG GACATTGCCC ACAGCCACAC GGTCTTGCCT
TGTTTCTGTT ACGATTGGCC ACTGAAGTGG ATTGGTCGGA AGAACGGCTT TGTTTTCACG
GTGTGTGTCG AGAACTCGGG GCATACTATT CACAACTGCC ATCCGACAAT GAAGCTCTCG
AGTCCTTTAT TCGGCATACA CTGTTTCCGG CAATCTCGAC CCTCACAGTT CCTCCCACAG
TGTTGGAAGA AGAAGGCTGC TTTCAATCAG TGACCAAATT GTCAAAATTG TTTCGCGTTT
TTGAGCGTTG CTAATGCAAG CAAATGCTTT GACCATAAAA ACAACGGCGT CCTCATAACT
TTGGGTAAAA TAACTACCCG ACTAATAACA TTCTCTGAAT TTCTC
 
Protein sequence
MEDSALTRSG EIQILPQEVV DKIAAGEVVQ RPVSVVKELV ENALDAGATE VIVTVEKGGL 
AKITIADNGG GIRPQDLPLA ATRHATSKLR TTEDFAHLCS FGFRGEALAS VSMVSHLCIT
SRVPEVKVGY KLAYRGGKPL QSPKPTARKP GTTVLVEDLF FNLPHRKVLR PADEYNKILT
VLQHYSILYA EQGIGLVCQK SGKKSTTDLN TSNAVAVLRS ALDAGQANDD ALRKLQLRAT
QQVIAQVFGS QLISHLQGFD SRGTQLVLFI NSRLVECNGL KRVMEDIYSE YTKIKPFLYL
RLDVPPDTVD VNVHPTKKEV ALLYLDEICK HISSQLRQTL SRAGQTFEQE DLSVQSRLSN
PYKRKVSAIC TDNAPSGMHL LASQQPGKKS AACKLIRTDQ STQVGALEPY LVQKSQSETP
LSDKTYQNET PSSTSSSQHS SESLLDTSQF SIRSLRKRVR KRSTSRLEKR LRTSCWVGVV
SRQRSLVQVG EDLVLMNHLE FSRQMFYQLA LDRFGGGMNL AELGEGGQGA VDIQVIIAQA
LQLEEKTRGL LTTSETNSAL ADQAATCLMD NSEMLEEYFS IAIEKDDLGR IMLKGLPVLL
EGHCPQPHGL ALFLLRLATE VDWSEERLCF HGVCRELGAY YSQLPSDNEA LESFIRHTLF
PAISTLTVPP TVLEEEGCFQ SVTKLSKLFR VFERC