Gene PHATRDRAFT_52173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52173 
SymbolMSH5 
ID7202038 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp547987 
End bp549271 
Gene Length1285 bp 
Protein Length376 aa 
Translation table 
GC content48% 
IMG OID 
Productmuts-like protein 5 
Protein accessionXP_002181399 
Protein GI219122117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAACG GCGCCTTGCC TGATGACTTC GAATACGTAT TTTCGGAATC GCTCTCATAC 
TTCAAGAGTG CGGAGATGCG TCAACTTGAT CAAAATATTG GTGACCTCGA TGCTTTCATC
AAAGATGCGG AAACATTGAT TGTTGCAGAA TTGGAGGACG AAATCCTGGA CCACGAATCG
GAGCTCCGTG AGACCTTCAT CGCCCTGGCC GAGCTGGATT GCATACTCTC CTTCGCTGGT
GCTGCTGCAG ACTTAGATTT CGTGCGCCCG CGCGTTGTCT CGGCGTCTGA ACAGTGCGTT
GAGATAAGCC GGGGCCGCCA TCCTTTGCAA GAAATTGTTC TCGACACAGC ATTTGTACCC
AACGATACCA CTATGAACAC GACAAGTCGT GTAACCGTAA TCACAGGCCC AAACTTTAGT
GGCAAGAGTT GCTTTGCTCG CCAAGGTCAG TTCTTGAGTG AGGGTTTAAT TGCGATTGCC
AAAAAAAATT CTCGGCGTTG TCTGACTATT CTATTCCTTG TACCTTGTAA GTCGGAGTAC
TCGTCTATAT GGCACACATT GGCTGTTTTC TCCCCTGCGA TGAAGCACGT ATTTCGCTGA
CAGATCAAAT TTTCACACAA TTCAGCTCTA CTGAAACATG TGCTGTCCCT CAAAGCAGCT
TCCAACTTGA TCTCAGCCGT ATGGGAGCTA TTCTCCGTCG AGCGAGTCAG CATTCGCTGG
TTTTGATTGA CGAATTTGGG AAAGGTAAGG TTCGATTTCG AAGCACATAA AAGAAATTCA
GTGGAAGCTG ACATGCTTAT TATCCGCACA AGGTACAAGC CCGGCATCAG GAATCTCCCT
ACTCACGGCG GCTCTGCAAA AGTTGGTATC TAATCGCTCG AAAGTAATCT GCACGACTCA
TTTCCTAGAG ATCTTTTCAA TAGGATTGCT TGTGGACTCT GAGAATGGAA TTTCCGCGAT
GCATATGACT GTGCATGTCC CTGAGACGGC CAATGATAGT GCTGTCCCAC TATTCCGAAT
GGAGCACGGG ATCGCAAATT CGTCCGCTGG ACTCGTTTGC GCGAAAATGG CTGGCGTAAA
AAAAGCCATC GTCGATCGCG CCTACGAGAT AATTAAGGCA ATCAAGAAAC GTCAAAAGGT
CCATCCGCTT GCTGAACTTT TGCGCAATGA TATACACATG ACTCTCGACT CGAAGCATGC
CATCAGATCC TTCGTCAGCA CGAGTTGGAG GGATGCTAGC GACGATCAAA TTGATGCATT
CTTTTCTATA ACTGAAAGGA TGTAG
 
Protein sequence
MENGALPDDF EYVFSESLSY FKSAEMRQLD QNIGDLDAFI KDAETLIVAE LEDEILDHES 
ELRETFIALA ELDCILSFAG AAADLDFVRP RVVSASEQCV EISRGRHPLQ EIVLDTAFVP
NDTTMNTTSR VTVITGPNFS GKSCFARQVG VLVYMAHIGC FLPCDEARIS LTDQIFTQFS
STETCAVPQS SFQLDLSRMG AILRRASQHS LVLIDEFGKG TSPASGISLL TAALQKLVSN
RSKVICTTHF LEIFSIGLLV DSENGISAMH MTVHVPETAN DSAVPLFRME HGIANSSAGL
VCAKMAGVKK AIVDRAYEII KAIKKRQKVH PLAELLRNDI HMTLDSKHAI RSFVSTSWRD
ASDDQIDAFF SITERM