Gene PHATRDRAFT_31147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31147 
Symbol 
ID7199156 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp203228 
End bp205163 
Gene Length1936 bp 
Protein Length559 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185340 
Protein GI219130370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0700452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACTGCAAGA GCGGCTACCA GAAAAATTTC CAGACGATTC TCCAGCCTCT CTCTGCTCTC 
TCAAGGAAGC ATATTCTTTC AGTGGTTCCA TCATGCTTTC GAACGGCTCG CACCAGAACG
TTTCCGCCTC TGGCGATACG TCAGTACCCG GCCAAGGTTG GTCCAACGGA GCCTTGGCAC
CGCAACTTTT TCCCGTCACG CAGCAAGGCC GACTGCAGTT GTACGGGTCA CAACGTCGCT
GGATTGGCCT TCAATCGGGC TGGACTCGGA TCGGGGCGGG ACAGACGGAA CTTGACGCTG
TTCACTCGCG TTCGCAAGGT ACGGGAATCC TGGATAGGAA CGACGCGGAA CAAGCCCAAG
CCGCACGTAG CTTGATTGCG CAGTTGTGTG AGACCTTCTA CAGACAGGGA TGGGCGACAG
GAACAGGTGG GGGTGTTTCT ATTCGAGTGG GAGGTCCATC GCAGAATCGC CCTTGGAGAG
TGTTTGTGGC CCCCTCGGGG ATTCAAAAGG AAGATATGAT TGGTGACGAC GTCTTTGAAC
TGGATATGGA TCGGAAAGTT ATCGTTCCCC CGAGGACGCC GAATCTAAGA CAGTCGGCCT
GCACCCCGCT CTGGTACGTG GTCTACAAGT ATAGACCAAC CGCAACTTGC GTCATTCACA
CTCATTCAAT GCACGCGCAA ATGGCTACCT TGTTGGATCC GACCGAAACC GCTCAAACTC
TTAACGTTAC CCACCTGGAA ATGCTCAAAG GCGTCGGCAA CCACGCCTAC GACGACGTTC
TCGAGATCCC CATCATCGAC AATCGTCCCT CGGAAGACCA ACTGGCCACG CAGCTGCAAG
CCGCCATTCA GGCGTACCCT AAGAGCAACG CGGTACTCGT GCGTCGTCAC GGTCTCTACG
TTTGGGGCGA TAGCTGGGAG CAGGCCAAAA CGCAATGCGA AAGTTTTGAT TATCTCTTCC
AATCGGCCGT GCAAATGAAA GCCATGGGCA TTGATTCCGG ACTTAAACCA TTGCAAGGGA
CGTATCGCGA AGGCGAGGAC AAGGAGGACC TCGTCGAAAA GACTGTCGAC GAGCCTCCCC
TCAAAAAACT CAAGACGACT GGGTTTCACG GGCTCAAGGC CGCCGACAAC CACCGCGACG
TCGTCGCCAA CGCGGTACCA ATTCTGCCCC GCGATGCGAA GATTTTACTA CTGGACATTG
AAGGATGTAC CACAAGTATT TCCTTTGTCA AGGACCGACT GTTTCCGTAC GTCCGGGAGC
GTTTGGACTC TTATCTGAAA GGGCACGTGG CCGCAAGCGA CAAATATCAG CAGTTGGCTA
AAGCGTTGGC CGGCGAAGCG GATGCCCACA GCGACTCGCC TGTTGCGGGT ACGATTCGAC
AAGACGTCGC TGGGATGGTA CGATACATGA TGGATCGAGA CTTCAAATCT GCTACACTCA
AAGCGCTTCA GGGGGACATT TGGAAGACTG GATACGCTCG CGGTGAGCTG AAGGGACACA
TATACAGCGA CTTTGTTCCT ACTTGTCAAT GGATGCAACG ACACGGCGTC CGTGTCTACA
TTTATTCTTC TGGGTCGGTG GCTGCTCAAA AGCTTTTGTT TGGCAACTCG ACCGAAGGCG
ACTTGTTGCC GTATTTGTCC GGGCACTTTG ACATTCCCAC AGCTGGTCCT AAAAAGGAAG
CAGGGTCGTA CACAGCCATT GCTCAAACGC TCCAAGTCGC ACCTTCCGCC ATTGTGTTTT
GCAGTGACGC AGAAGCCGAG CTCGTTGCCG CACGGGAAGC GGGCATTGGT TATCCTGTCA
TGAGTGTTCG GCCCGGCAAT GTTCCGCTAT CGGCCGAGGG ACGAGAGCTT CCAGCAATCT
ACTCGCTTCT GCAACTTTGT GGAGAGTGAA TATACAATGA TTCTGTCTAG CTATGTATCA
GACAATCACT TTTTTG
 
Protein sequence
MLSNGSHQNV SASGDTSVPG QGWSNGALAP QLFPVTQQGR LQLYGSQRRW IGLQSGWTRI 
GAGQTELDAV HSRSQGTGIL DRNDAEQAQA ARSLIAQLCE TFYRQGWATG TGGGVSIRVG
GPSQNRPWRV FVAPSGIQKE DMIGDDVFEL DMDRKVIVPP RTPNLRQSAC TPLWYVVYKY
RPTATCVIHT HSMHAQMATL LDPTETAQTL NVTHLEMLKG VGNHAYDDVL EIPIIDNRPS
EDQLATQLQA AIQAYPKSNA VLVRRHGLYV WGDSWEQAKT QCESFDYLFQ SAVQMKAMGI
DSGLKPLQGT YREGEDKEDL VEKTILLLDI EGCTTSISFV KDRLFPYVRE RLDSYLKGHV
AASDKYQQLA KALAGEADAH SDSPVAGTIR QDVAGMVRYM MDRDFKSATL KALQGDIWKT
GYARGELKGH IYSDFVPTCQ WMQRHGVRVY IYSSGSVAAQ KLLFGNSTEG DLLPYLSGHF
DIPTAGPKKE AGSYTAIAQT LQVAPSAIVF CSDAEAELVA AREAGIGYPV MSVRPGNVPL
SAEGRELPAI YSLLQLCGE