Gene PHATRDRAFT_51412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51412 
SymbolMCM4 
ID7196534 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1459687 
End bp1462139 
Gene Length2453 bp 
Protein Length791 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176792 
Protein GI219110080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCGC AACGGGACGA TCCTTCGGTG GCTCCACTGG ACGACGATGG TTTGTTGGGA 
CAGGAAACGG CACACATTCG GGGAACGGAC GTGCACGTTC CTACGGCGGC AGCGACCTTT
ACCAGCTTTC TCCGTAGCTT TCAATCGTTA CAGATCTCCC AACGTGTGCA GGAGCGAGCC
GCGAGGGTAG AGGCCGACGA CGACGATTCC GTTGGGGACG AAGTGGACGT CGACGGGGAC
GATCCGCCAC CGTTGTACCG TACGCGACTC GAATCGCTCC TCACTCGCGG TGTCACCCAG
GGATCGACTA CCACCAGTCT CGATATTGAT GCCATGCATC TGTACTACCA CAACGCAGCC
TGTCAGCGAC TCTATCATCA GATTGTGGCC TATCCGATGG AGCTGGTCCC CCTCATGGAC
TTGTGCGTCC AGCGCGAACT AGAGCACCTC GCCAACACGC TACCCGACGT TGTGGATCCG
GACACTCTGC CGCGGGCGCA GGTTCGACCT TTCAACCTCA AGCTCGTCTC CAACCTGCGG
TGTCTCGATC CCGTCGCTAT GGATACGCTC CTTTCCGTCA AGGGCATGAT TGTGCGCTCG
TCGCCCATCA TTCCGGATCT CAAAATTGCC CACTTTGGTT GTTGCGTCTG CGGACATGTT
GTACAGGTAG CCATTGATCG CGGCAAGATT GCCGAACCTA CCGCACGCTG TCCCCAGTGC
AACACCGCCG CATCCTACCA GCTGGTACAC AATCGCTGCG TCTTTGCCGA CAAGCAGCTC
GTGCGGTTGC AGGAAACGCC CGACGAGGTG CCGGCCGGAC AAACACCCGC ATCCGTTACC
TGCTTCAGTT TCGATGATCT CGTCGATGCC GTGCAACCCG GTGATAAGGT CGAAGTCACC
GGTGTGCTCC GCGCCCAACC ACTCCGCGTA CACCCCAAAA TATCCAAACT CAAGACCGTT
TACAAAACGT ACCTGGACGT CATTCATTTC CGGACCATTG CTGGTATAGA TAACGACCAG
ACCAAGCACG ATTTGGCCGC CACGGCGGGA CACCAGTCCA ACCGTTCTCG GTGGTCCGAC
GATCGTGTCC GACAGCTACG AACGCTCGCC CGGGACCCAC TAATATACGA AAAACTGACC
GCCTCCCTCG CACCGTCCAT CTGGGAACTC GACAACGTCA AGAAGGGCAT TCTCTGTATG
CTCTTTGGGG GCAACCACGT ACGGGTCAAA CAACAACAGC AACAGACCAC GGAAGAACAA
AATGCCCCCG ACGCTTGGTT GGACGAGGAA GAGGAGGGTA CCGGCGCTAC CTCCAAACTC
AACAAACGTG GTGACGTGAA CATTCTGTTG TGCGGCGATC CCGGAACCTC CAAGTCCCAA
TTACTCTCCT ACGTACATAA ATTGACCACC CGGGGGGTCT ACACCTCTGG TAAGGGCTCT
TCCGCGGTTG GTTTGACGGC GTCGGTCGTG CGCGATCCGG AAACCCGGGA TTTGGTGCTC
GAAAGCGGAG CACTCGTGCT TAGCGATCAG GGTATTTGTT GCATTGACGA ATTTGACAAA
ATGACCGACA CGACACGCAG TGTACTACAC GAAGCCATGG AACAACAGAC GGTGTCGATC
GCCAAGGCGG GAATCCTGGC CACGTTACAC GCCCGGACCA GTGTACTGGC GTCCGCGAAT
CCCACGGAAT CGCGGTACAA TCCCAACCGC TCGGTGGTGG ACAACATTCA GTTACCACCC
ACGCTTTTGT CACGGTTTGA TTTGATCTAT TTGATTCTGG ATAGTCCAAA CATGGAACAG
GACCGTCGAC TGGCCCAGCA CTTGGTCGGA CTGTACTACG AGACTCCCAA CGTTGTCCAA
CCTCCGTTGG ACCAGGCTTT ATTACGTGAC TACATTGCGT ACGCTCGCGA CAACATCCAC
CCCGAAATTT CCGACGAGGC CGCCGACGAG TTGGTATCGA GTTATTTGAC CATGAGGAAT
CCGCCCGGTG GTGGGGCCGC GGCAGCAGGT ACCCGCACCA TTTCCGCCAC ACCCCGGCAA
CTCGAGTCGC TGATTCGGCT TTCCGAAGGC CTTGCCAAAA TGCGCTACAG TAGTATCGTG
TCCCGGGCCG ATACGTTGGA AGCGGTACGA CTCATGAAGG TCGCGACACA AGCGGCGGCC
ACGGATCCGC GAACGGGACG TATCGACATG GACATGATTA CGACGGGAAA GTCGTCGGCC
GATCGACAAC TCGAAGAACA AATTACCTTG ACGCTACAAG AACTGTTCGC GGAACGTCGG
GGGACCCGCA TGGCGGTTCG TGATGTCACC AAGCAGCTCG CGGAAATCAC CAACGCAACG
ATTCCGCATG ACGAAGTAGT CAAAGCCCTG CGTCAAATGG AAGCGGACGG CGTGATCCAG
TACCAAGAGA GGGCGCAAAC TATTTTTGTT CGTACCGGAA TTGTGCGATC GTG
 
Protein sequence
MPPQRDDPSV APLDDDGLLG QETAHIRGTD VHVPTAAATF TSFLRSFQSL QISQRVQERA 
ARVEADDDDS VGDEVDVDGD DPPPLYRTRL ESLLTRGVTQ GSTTTSLDID AMHLYYHNAA
CQRLYHQIVA YPMELVPLMD LCVQRELEHL ANTLPDVVDP DTLPRAQVRP FNLKLVSNLR
CLDPVAMDTL LSVKGMIVRS SPIIPDLKIA HFGCCVCGHV VQVAIDRGKI AEPTARCPQC
NTAASYQLVH NRCVFADKQL VRLQETPDEV PAGQTPASVT CFSFDDLVDA VQPGDKVEVT
GVLRAQPLRV HPKISKLKTV YKTYLDVIHF RTIAGIDNDQ TKHDLAATAG HQSNRSRWSD
DRVRQLRTLA RDPLIYEKLT ASLAPSIWEL DNVKKGILCM LFGGNHGTGA TSKLNKRGDV
NILLCGDPGT SKSQLLSYVH KLTTRGVYTS GKGSSAVGLT ASVVRDPETR DLVLESGALV
LSDQGICCID EFDKMTDTTR SVLHEAMEQQ TVSIAKAGIL ATLHARTSVL ASANPTESRY
NPNRSVVDNI QLPPTLLSRF DLIYLILDSP NMEQDRRLAQ HLVGLYYETP NVVQPPLDQA
LLRDYIAYAR DNIHPEISDE AADELVSSYL TMRNPPGGGA AAAGTRTISA TPRQLESLIR
LSEGLAKMRY SSIVSRADTL EAVRLMKVAT QAAATDPRTG RIDMDMITTG KSSADRQLEE
QITLTLQELF AERRGTRMAV RDVTKQLAEI TNATIPHDEV VKALRQMEAD GVIQYQERAQ
TIFVRTGIVR S