Gene PHATRDRAFT_45661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45661 
SymbolMSH4 
ID7200416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp849572 
End bp853326 
Gene Length3755 bp 
Protein Length1191 aa 
Translation table 
GC content48% 
IMG OID 
Productmuts-like protein 4 
Protein accessionXP_002179729 
Protein GI219117886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTCCC AACCTCACAA GAATCGAAGC AAGCGCAAGA GTAAGAGCAG TACACCATCC 
GCGAATGGAT CTGACCCGGC ACCGTTGGAT CTCGCCTCCA AAGTCTCCTT AACCGGGGGG
GCTTCCGGAA CGGCTCTAGA AGTCTCACTA TCACACGATG ATGCTGCGTC ATTAGCTTGC
AAAAGTCTGG GATCGGGATT GTACGGAACT CGCGCAACGC AAAGCCTTGC CTCCTCCCTC
CACCGCAAGC GCCTCTTTCC GTCCGCCCAG GCGTCCTCAC GATCCCAACG AACGGTACCT
GGTTCGGTAG CGAGTCGGAA GTCTTCCCAG TATTCGCAAA AATCCCGATT TTCTCGAGCC
TCGGGTACCC CAGCTCACCA ACGAGCGCGG TATCACGGAA TGCAGACGCA GACCAAGGCC
ACAGCGCACA TCGTGTGCGC TGTGGCGGAA AACTTGGCCC GGGAAACCTG CGTTGCTTCC
CTGGACGCCG GCGCACCAAC CGTACTACAC GCTACGAAAC AAGGCAACGG ACAAACCTAC
GCAGAGACCT TGGCGTACTT GGAACTGCTG CAACCGGACG AAGTTTTGTT GAACGAAGGT
CGCCAAACGT CACAATTGGC CCGGAAAATC CTCGAACTGT ACAGTGTCAC AACGACTACC
GCAAACGCTG ACAACGGAAG TGCCGTCCTA CGAGACAAAA CCAACGGCCG GAACATTGCT
CAGCGACGTC GGCAACGAAA TTTATGGAAA ACCGGCACAA TGGCCAGCAT CAACGAAGAC
AAGCACTACA ATCAAAGCAG CGTTGAAGAG AGCGACGAAT ACGGCGACAG CGTAAGACAT
ACACGAAAGC AGGTCATGGT CAAGTTTATT TCGCGTGTCT GCTTTGATCA AACAAAGGGA
GCCGAGCTTT TGCGTGTTGT AGCGCGCGAA GAGACGTACG ACGCCAAGCT TGTGGAGGAC
TACATACTAT TGTCATCGGC CAATGCCGTC TTGCGCTATA CACAGCAACA TTTGGGAGCA
TGCTTGACCC GAAACAGCTT AAATTTGCAC ATTAATGCTG GAGGAAATCA TCGCATGGCT
ATCGATCGTT CAACTCTCTT ACAACTCGAG TTGCTCATGA ATGCGAAAAC CGGAAAGTTG
AAGGATTCGC TGGTGGGTTC CATTGATTGT ACCAAAACAA CAGTAGGAAG TCGGTTGTTG
CGCACCAATC TCATGTCGCC ACCAACACAA ATTGCCACCA TTCACGCCCG ACAAGAACTT
GTTGACACGT TCCTTGGCAA TGAAGCTTTC TTTTACGATG TGATGGAGCA TTTGATGAAT
TTGCCAGATG TCGACCGAAT GTTGAACCAT ATTGCCCTGG TACCGCGCCT CGATTGTAAA
GACATGGGCA TGGATGGCCA TCAACGACAG CGTCCCGCCG TTTCTCAGCG GTTAGCAAGC
AAGGGAATCT CTGCATTGGT GGCTATCAAA TCGACACTAA AAGCCTTGCC AGCCCTTGTA
CAGATATTGA AGAATCAGCT CGAAAGCACC ACCGGAGCTA TGAGGCAAAC CGAGAAAGAA
ATCTCCACCG TACAGCAGAT CAATTCTCCA AATGATGAGG ACGAAAATAC TACAATTGTG
ACAGATAGGT CGAGTCTATT GATCGGTTTG GGTGGTTGTA ACTCATCGAG CCTTCATTCA
GCACACACGA TAGAGAGTCG GAGATACTCC AGTCATTTGC TACGTGCCAT TATCTTTTCA
TTGAATCAGC CAGCTTTTAA CGAAGTCCAC AAAGCTATTC TGGATGTTTT CACCGAAAGT
ACCGCTTACA CTCGGAACCC TAACGCAATG CGCCATCAAG AATGCTTTGC GCTGAGGTGT
GAGTCTGACG GAATGATGGG AATTCTACGT AAGGCATTCT TGGCTAATGT TGATGATATT
TACCGAAAGG CAGACGAATA TGCAGAAGTA TACGGAATGC AGGTAAAAGT CAAGTACACG
GCTAGCCGCG GCTACTTTTT GGCAGTACCA TCGGATATTG GTACTGATCT CCCACTTGTT
TTTACACAGC CCACTCTTTT GGGTCGCCAC ATACACTGCA CGACAGAGGA GATTGCAAGC
TTTAATACAA GAGCCCAAGA CAACGTCCAA GATATCTTGC TCATGACACA CGATAAAATT
CAGGAAGTGC TCAATATTGG TCGTCAATAT TTCGACGCTT TTGCTGCATT GTCTGATGCA
ATTGCCTTGC TGGACTTGTG TCACGGTTTT GCCGATCACG TCACGCTCAG CGAGTCGCCT
TGGTGCCGAC CTGTGCTTTC TGAAAAGGCA ATCTGTTCCG AAGAAGGATC TACTGATTCG
GAATGTACAA TGATGATTCG AAGCGGACGA TACGCTATCG CGATAGAGGG CCATGGTTTA
GAATCAGCAG ATGGTTCAAG CGGATATATT CCGAACGATA CGTTTGCCTC AGACGCAAAG
CCATTTACGC TGATAACTGG CATCAATGGC AGCGGAAAGA GTACATATCT CAAACAGATC
GCAATCTGCA CTGTTCTAGC GCACTGTGGT AGCTATGTTC CTGCCGAACA AGCGTGTATT
CCAATCCGGG ATCTAATATG TTCTCGCATC GGCAACACAG ACGACCAAGA GCACAATATC
TCAACTTTCA TGTTGGAAAT GAAAGAGACT GCCTTTATTT GCAATCATGC TACCGAAAGA
TCCCTCATTC TTATTGATGA GCTCGGGCGT GCCACTAGCA ACGAAGATGG CGTTGCCATC
GCTTGGTCAA TTGCTGAATA TCTGTTAAAG AAAGGAGCGA TGACTTTTTT TGCCACTCAT
TACCCTCAAC TCTGTCGCCT GGGAGATGTC TATTTGAAAG TACAGAATGT CCATTTGGAG
GCATCAGTGA GCAACGGTGA AAGATCGCAG ATCTATTACA CCCATCGGGT TGTGTCTGGA
ACTTGTGCCG TCTCAACAGA TTACGGGGTT GAGCTGGCAA GCGTTTGCGG CTGGCCACAA
GAAGTCGTAA CAGCAGCTAA GACAATTCAC AAAGATGTGG AATCATTGCT GCCTGACGAA
TCAATTTGCA ACTCTGAACA AGCCAATCAT TATCCGTTTG CTGAAGCGAT GCTAGCTATC
CGCACCATCG CATCACAGAT TAAAGGATAC GTTGCCCACA ATAAAGCTCA GCCATATGAA
AGTATTCGCC GAGAGCTTGA TGAGCTCCAC CGTAGCTGCG TCAAATACAG CCACAAGGAT
CTTGCCGAGC TAATTGAAAG GATGCTTATC AGTAGTCCCT CACATACACA GCAAGATTCT
ATTGGGATCA TTCCTTCGCT TCCCGTAAGG GCACCAAAAG CTGCCCTCAA AGACCGTAAG
ATCCAAAATG CAAATATGAT CTTCACATCC GATCGCAACG GAAGCGGCAC TTTTGATCTT
CCCCTTGCCT CCACGCCTGC CAATGGCAAC CTTGAAAAAC TTGGCGAAAC AGAGGATAAC
GACAATTCCA GCTTGAGCTC TTCGTCGACA AGCTCTGATT CTTCAAGCAG CAATTCGTCA
GCATCGTCAG TTGCTGCGTT TGAGGGTTCA CTTTGAGAAA TGAGACACCG TTTCGAACGG
GGCCAAGTTG AATTGGGAAC GGATACTATA TCTCGCTGTG TTTTGTTTTC TTACGTTGAC
TGCAAAACCA CTGATAATAG ATAGCTTTGT CGTAAATTCC TGACTTAGCA TTCTGTATAC
ACATCCTCCA GCACGCAAAA CTTGGAATAA AAAGC
 
Protein sequence
MESQPHKNRS KRKSKSSTPS ANGSDPAPLD LASKVSLTGG ASGTALEVSL SHDDAASLAC 
KSLGSGLYGT RATQSLASSL HRKRLFPSAQ ASSRSQRTVP GSVASRKSSQ YSQKSRFSRA
SGTPAHQRAR YHGMQTQTKA TAHIVCAVAE NLARETCVAS LDAGAPTVLH ATKQGNGQTY
AETLAYLELL QPDEVLLNEG RQTSQLARKI LELYSVTTTT ANADNGSAVL RDKTNGRNIA
QRRRQRNLWK TGTMASINED KHYNQSSVEE SDEYGDSVRH TRKQVMVKFI SRVCFDQTKG
AELLRVVARE ETYDAKLVED YILLSSANAV LRYTQQHLGA CLTRNSLNLH INAGGNHRMA
IDRSTLLQLE LLMNAKTGKL KDSLVGSIDC TKTTVGSRLL RTNLMSPPTQ IATIHARQEL
VDTFLGNEAF FYDVMEHLMN LPDVDRMLNH IALVPRLDCK DMGMDGHQRQ RPAVSQRLAS
KGISALVAIK STLKALPALV QILKNQLEST TGAMRQTEKE ISTVQQINSP NDEDENTTIV
TDRSSLLIGL GGCNSSSLHS AHTIESRRYS SHLLRAIIFS LNQPAFNEVH KAILDVFTES
TAYTRNPNAM RHQECFALRC ESDGMMGILR KAFLANVDDI YRKADEYAEV YGMQVKVKYT
ASRGYFLAVP SDIGTDLPLV FTQPTLLGRH IHCTTEEIAS FNTRAQDNVQ DILLMTHDKI
QEVLNIGRQY FDAFAALSDA IALLDLCHGF ADHVTLSESP WCRPVLSEKA ICSEEGSTDS
ECTMMIRSGR YAIAIEGHGL ESADGSSGYI PNDTFASDAK PFTLITGING SGKSTYLKQI
AICTVLAHCG SYVPAEQACI PIRDLICSRI GNTDDQEHNI STFMLEMKET AFICNHATER
SLILIDELGR ATSNEDGVAI AWSIAEYLLK KGAMTFFATH YPQLCRLGDV YLKVQNVHLE
ASVSNGERSQ IYYTHRVVSG TCAVSTDYGV ELASVCGWPQ EVVTAAKTIH KDVESLLPDE
SICNSEQANH YPFAEAMLAI RTIASQIKGY VAHNKAQPYE SIRRELDELH RSCVKYSHKD
LAELIERMLI SSPSHTQQDS IGIIPSLPVR APKAALKDRK IQNANMIFTS DRNGSGTFDL
PLASTPANGN LEKLGETEDN DNSSLSSSST SSDSSSSNSS ASSVAAFEGS L