Gene PHATR_18622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_18622 
SymbolMCM2 
ID7203986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp708929 
End bp711850 
Gene Length2922 bp 
Protein Length808 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186401 
Protein GI219113635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGACGACC CTGACGAGCG TAGTGTGGAC GACGTCGTCG CCGAGAATGA CGACGACGAC 
GTAGACTACG TGACGGGACA AGCCGCCGCC GAAGAAGAGG AAGAAGACGG GGAAGACTTG
TTGGAGCACG CCGAACGTGA TTACCAAGCT ATTCCAGCTT TGGATACGTA CGGAGCCGAA
GGTTTGGACG ATCGCGATTA CGAAAATCTT ACTGTCGATG CCCGTCGAGC GGTGGAACAA
CAGTTGGCCC GACGGGATCG GGACAAGGCC GGAAGGAACG ATGGCTTCTA CACTCTACTC
GACGATATGG CGGACGAAGA CGAAGAAGCG CGACAAGCCC GTCGCGGAGC CTTTGATCGT
CGTCAATTGC ACAAGGACGG ACGGGGCGAC GAAGACGACG ACGACGCCGC GGATACTCGG
GGACCCGCGG CGGACGACGC CGACGATATT ACGGAAGGTG ATTTGGACCG TGACGATCAA
GTCAATCTAG AAGCCTTTGA CGTGCCGCTG CGGGAGTGGA TTGCGCAAGA ACAGACGCGC
CGCGAAATAC AACGCAAGTT TCGGGTCTTT CTGCGACACT ACACGGGACC CTCGGCCGTT
CCGGAATCCC GGCGCCGCCG CGGGAACGGA CTCTACGAAC AAAAGATCCG TACCATGTGC
GCCTCCAACA AATCCACATT ACAGGTCTCC TACATTCATC TCATGGACGC CGAACCCATT
CTGGCGTACT GGTTGGCGGA TGCTCCCAAG GACATGCTCC TCGTACTCAA CGAAGCTGCC
ACCCGCCACA CTCTCATGCT CTTCCCCTCC TACAACGCCA TCAAGTCCGA AATTCACGTG
CGTATTTCGG AGGTGCCCAT TCTGGATAGT CTCCGGGATT TGCGGCGGTC GCATCTAGAT
TGTCTCGTCA AGGTGCACGG TGTCGTTACC CGGCGCTCCT CCGTCTATCC CCAACTGCAA
ATGGCCTACT ACACCTGCCT TTCCTGCAAG GCCATTCAGG GGCCCTTCCG TACTGAAGGC
GTCGGAGCCA ACTTGGCCAA CGTCCATACC CCTAGCGAAT GCGTGCAGTG CGAAGTTTCC
GCCTTTCGTC TGCACCCCAC CATGTCCTCC TACCGCAACA TCCAACGTGT CAATCTACAA
GAGACACCCG GATCGGTTCC ACCCGGCCGC GTCCCCCGCA CCAAAGAAGT CCTCGTCGCC
GATGACCTTA TTGACGTCGC TCGACCCGGG GAAGAAATCG AAGTCACCGG TGTGTACGAA
CACACCTTTG ACTCCTCACT GACGCTCAAA TCCGGTTTTC CCGTCTTTTC AACTTTTCTG
CACGCCAATC ACGTTCTCAA ACGCGAAGAC GCCTCCAGCG CCTCCAATTT GAGTGAACAA
GACATTCGCG ATATTCTGCA GCTCGCCCGG GATCCCAACA TTGGGGCCCG CATCGTTCAG
TCCATCGCCC CGAGTATCTA CGGCCACGAC AATTGCAAAA TGGCCCTCGC CATGAGTTTG
TTCGGTGGCG TCGCCAAGAA CATCAACGAC AAACATCGTA TTCGTGGCGA CGTGAACGTG
CTCTTGTTGG GCGACCCCGG GACGGCCAAG TCGCAGCTCC TCAAGTACGC CGAACAGACC
GCACCCCGGG CCGTTTACTC TACCGGGAAG GGTGCGTCGG CCGTGGGATT GACCGCTAGT
GTGCATAAGG ATCCGATTAC GAGGGAATGG ACGCTCGAGG GTGGGGCATT GGTGCTCGCC
GACAAGGGCG TCTGCCTCAT TGACGAATTC GACAAAATGA ACGAACAGGA TCGCACGTCA
ATCCATGAAG CCATGGAACA ACAGAGTATC TCCATTTCTA AAGCCGGCAT CGTCACCAGT
TTGCAGGCGC GGTGTTCCGT CATTGCGGCG GCCAACCCGA TCGGTGGTCG TTACGACAGT
AGCAATACTT TAGCGGATAA CGTGGAGTTG ACGGACCCGA TTCTGCAGCG ATTCGACTGC
CTTTGTGTAT TGCAGGATGT GGTGGATCCG GTCGCCGATG AACGGCTCGC TCAGTTCGTC
ACTAGTAGTC ACATGCGGTC CGTACCCACG CGGGAATACG TGCCGAACGA AAGCGACCTA
GCCGACAACA ACGCGGAACG CCCCGGTCTC ATTCGGCAAG ATCTGTTGCG CAAGTATATT
CAGTACGCCC GCTTCAACGT ACGGCCCATT CTGCGTGGCA ACGCGCTGGA CCAGGAAAAA
GTGTCGTCGC TGTACGTGGC GCTGCGTCGA GAGTCCGCCG CATCGGGTGG CGTGCCCATT
GCGGTGCGCC ACGTGGAATC CATTATGCGC ATGTCAGAAG CTCACGCCAA AATGCACCTG
CGTGACTACG TTCGGGACGA TGATATGGAC GCCAGTATCC GCATGATGCT GGAGAGCTTT
ATCATGGCGC AAAAGTTTAG CGTCCAACGT GCGCTCCGAC GGTCGTTCGC CAAGTTTATT
ACGTCCGGAG AAGACCGGGC TTACCTGCTC CTGCACATTT TGCAGGACAT GTTCCGCAAG
GAACAAATGT ACCAGGTCAT CCGTTTGCGA CAACGCAATC AGACCGAGGA CGATCTTGAA
ACGCTAGACG TGCCGCTGGA CGAGCTGGAA GCCAGGGCGC GGGAGCGACG GATCTACGAC
GTTTCCGAGT TCTGCCGAAG CGAAGCCTTT ACCGAAGCGG GCTACGTCTT GGACGAACGT
CGTCGGGTTG TTTCCCGTAA TTTGGTTGTA TGATTGGAAA GCTACGAGTA GAATGATTAT
GTGTGCTCAC GAGCTGATTT AGTGTTCAAC AATGAAAAAA GTAAATCATC GTCCCTTGAA
TGCTTTTGCA GCGATCGGGA CGAATGTCAC ACATCAGTCA ATCCGAAAGG GAACATTGCG
TTCTATAGGC TGCACCATAT AGTCTGAAAT ATTACCAAGC GC
 
Protein sequence
MADEDEEARQ ARRGAFDRRQ LHKDGRGDED DDDAADTRGP AADDADDITE GDLDRDDQVN 
LEAFDVPLRE WIAQEQTRRE IQRKFRVFLR HYTGPSAVPE SRRRRGNGLY EQKIRTMCAS
NKSTLQVSYI HLMDAEPILA YWLADAPKDM LLVLNEAATR HTLMLFPSYN AIKSEIHVRI
SEVPILDSLR DLRRSHLDCL VKVHGVVTRR SSVYPQLQMA YYTCLSCKAI QGPFRTEGVG
ANLANVHTPS ECVQCEVSAF RLHPTMSSYR NIQRVNLQET PGSVPPGRVP RTKEVLVADD
LIDVARPGEE IEVTGVYEHT FDSSLTLKSG FPVFSTFLHA NHVLKREDAS SASNLSEQDI
RDILQLARDP NIGARIVQSI APSIYGHDNC KMALAMSLFG GVAKNINDKH RIRGDVNVLL
LGDPGTAKSQ LLKYAEQTAP RAVYSTGKGA SAVGLTASVH KDPITREWTL EGGALVLADK
GVCLIDEFDK MNEQDRTSIH EAMEQQSISI SKAGIVTSLQ ARCSVIAAAN PIGGRYDSSN
TLADNVELTD PILQRFDCLC VLQDVVDPVA DERLAQFVTS SHMRSVPTRE YVPNESDLAD
NNAERPGLIR QDLLRKYIQY ARFNVRPILR GNALDQEKVS SLYVALRRES AASGGVPIAV
RHVESIMRMS EAHAKMHLRD YVRDDDMDAS IRMMLESFIM AQKFSVQRAL RRSFAKFITS
GEDRAYLLLH ILQDMFRKEQ MYQVIRLRQR NQTEDDLETL DVPLDELEAR ARERRIYDVS
EFCRSEAFTE AGYVLDERRR VVSRNLVV