Gene PHATRDRAFT_11490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_11490 
SymbolMCM5 
ID7199856 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp40227 
End bp42558 
Gene Length2332 bp 
Protein Length667 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178693 
Protein GI219115796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCAGGTAG ATCTGGCGCA CATTGGAGAG TACGATGCTT CCATGTTGGG ATACCTCCTT 
AATCAGCCCG CCCACATTCT CCCGGCATTG GAAAACGCCG CGTCGGATGC ACTCAAGTCG
CTTCTCTACG AGCGCAAATA TCCTGCCAGT GCCGACGATC TCGACAAGCT CGACGAAGAC
ATGCTCGACG CGACCCAGGA AACGCAAGAG TTGGACCCCA CGGAGAATTC CGTCAGTGAA
GCTCTGCTCG CTGGTGTGCG TATACAGATT CTGCTCCAGG GAAGCTTGCA ACCGACACCC
TTGCGATCCA TTCAATCCCA GCACATGAAT CGCTTACTCA AGTGTCCCGG AATTGTCATA
TCAACGTCTC CCGTCAAGAA CCGTGCCACG ACGTTGAAAG TGCGCTGTTC CCGTTGTCTC
GATTCCCAAA CGGTGTACGC TACCGAAGGG CCCTTCGGAT CGCTGACTTT GCCCACCACG
TGTCGAGGGC CCAGTCCGCA CGAATGCGGT CGCTTTCCCT ACTCGGTCGT TCCGGACGAA
TCCATCTTTG TCGATCAGCA AACACTCAAA CTACAAGAAG CCCCGGAACG GGTGCCGACT
GGGGAAATGC CGCGCTCGGT ATTGCTGGCC GTCGAACGCT CCAACGTCGA TCAGGCAGCA
CCCGGAACAA GAGTATCGGT GCTTTGCGTA CCGACTCTAT TCACCTCGGG GAAAGACGGA
ACCCACAAGT CCGTCTACTT GCGTGTCCTA GGGATGACCA AGGATAACGA TGCGCACGGA
GAAGCCGTCA CCTACACACC AGCCGAAGAA GAGGCATTTC GGACCTTGTC GCGTCGGCCC
GACGTATACG AAATTCTGCA GCGTTCCATT GCACCAAACA TTTCTGGTTC TTACACGGTC
GATATCAAGA AAGCGCTCTG CTGTCAGCTT CTGGGTGGAT CGCGAAAGAA ATTGCCCGAC
GGGGTGCGCC TGCGCGGTGA CATTAACGTC CTCCTCTTGG GCGATCCCTC GATGGCGAAA
TCGCAGTTTC TCAAATTTAT TTCCAAGGTA GCACCGGTGG GTATCTATAC CTCTGGAAAA
GGCTCTTCGG CCGCCGGGTT GACTGCTTCC GTGGTACGAG ACGCCAAGGG GGAGTTTTAT
CTCGAAGGTG GTGCCATGGT CCTGGCCGAC GGTGGTATCG TTTGCATTGA CGAATTCGAC
AAGATGCGTC CAGCAGATCG TGTTGCTATC CACGAAGCCA TGGAACAACA AACAATTTCC
GTAGCCAAGG CAGGTATTAC GACGGTTTTA AATTCACGCT CCTCCGTACT AGCTGCGGCC
AATCCTGTCT TTGGTCGGTA CGATGATTTC AAGTCAGCCT CCGAAAACAT AGACCTCATG
ACAACCATTT TGAGTCGTTT TGATGTCATA TTTCTGGTAC GAGATATTCG TGAAGAGGAA
CGTGATCGAC TGATTTGCCA GCACGTAATG GGGATCCATA TAGGCGCATC CAATCGGTCG
GACGGTGGTT TGGGGCATGT CAGGGGCGGG GGTGCCGACG ACGGTGGTGC TTTGTCCTAC
ATGGCTGGTT CCAGCGGAAT ATCGGATCCT AACGAAGAGT CGGTCAATAG GTACGTTGAA
ACAAAACTCT CATCCAGTGC CCTTTGCACC TCCACATCTC ACAAAAATTT TCCTTGGTCA
AAGTTCCTCA AACGCTACCC CCGAAGCTAT CGCCGAAAAT GTTCTGCGCG TGGCTACCAC
CGGCGAAGGA GAGCTAGATG TACCAGCAAT GAAGAAATAC ATTCAATACT GCAAGGCTCG
TTGCTCGCCG CGTCTGTCGG AAGAAGCGGG TGAAGTCTTG ACGAGTTCCT ACGTAAAGAT
TAGGGATGAT GTGCGTCGCC GGGCGATTGC CTCGTCCGGA AGATCGGATG GAAGAGACGG
GGATACGCAA TCGGCCATTC CAATCACTGT ACGGCAGCTG GAAGCTTTAG TGCGTTTGTC
TGAAAGCTTG GCGAAGATGC GACTGGACCC GCAAGTGCGG TCTGAAGACG TAACGGAGGC
GTTGCGTCTA TTTAAAGTAA GCACGATGGC AGCAAATGCG GTCGACCAAA ATTTAGGGGA
AACATCATAT GCGTCGGTAT CTGCCCCCAA CCGTGAAGAA ATGGAACGGA CAGAGGCCTT
TTTGCGTAGC CGTTTGAACG TTGGGAGCAT GGTCAACAAG CAAAGGCTGG TGGAAGAAGG
ATCTGGTCAA GGATTCAATG CAATCTTGAT CGCACGGGCA TTATCCATTA TGGCTAGCCG
TGGTGAAGTA TTGGAAAGAA ATCAAGGTCG TTTGCTGAAG CGAGTCAAAT AA
 
Protein sequence
LQVDLAHIGE YDASMLGYLL NQPAHILPAL ENAASDALKS LLYERKYPAT LLAGVRIQIL 
LQGSLQPTPL RSIQSQHMNR LLKCPGIVIS TSPVKNRATT LKVRCSRCLD SQTVYATEGP
FGSLTLPTTC RGPSPHECGR FPYSVVPDES IFVDQQTLKL QEAPERVPTG EMPRSVLLAV
ERSNVDQAAP GTRVSVLCVP TLFTSGKDGT HKSVYLRVLG MTKDNDAHGE AVTYTPAEEE
AFRTLSRRPD VYEILQRSIA PNISGSYTVD IKKALCCQLL GGSRKKLPDG VRLRGDINVL
LLGDPSMAKS QFLKFISKVA PVGIYTSGKG SSAAGLTASV VRDAKGEFYL EGGAMVLADG
GIVCIDEFDK MRPADRVAIH EAMEQQTISV AKAGITTVLN SRSSVLAAAN PVFGRYDDFK
SASENIDLMT TILSRFDVIF LVRDIREEER DRLICQHVMG IHIGASNRSD GGLGHLDVPA
MKKYIQYCKA RCSPRLSEEA GEVLTSSYVK IRDDVRRRAI ASSGRSDGRD GDTQSAIPIT
VRQLEALVRL SESLAKMRLD PQVRSEDVTE ALRLFKVSTM AANAVDQNLG ETSYASVSAP
NREEMERTEA FLRSRLNVGS MVNKQRLVEE GSGQGFNAIL IARALSIMAS RGEVLERNQG
RLLKRVK