Gene PHATRDRAFT_47112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47112 
Symbol 
ID7202026 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp478623 
End bp480595 
Gene Length1973 bp 
Protein Length540 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181386 
Protein GI219122089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0949637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTTGCTTCG ATGTGATTAG TGGAAGATAG CAAGATGGGC CATGCTGAGC CATTCTAGCT 
CCTTTAGCAG TAGGCGACTT GCTTCTCAAA TCGAAAAGCG ACCCATGATG GAATCGGTGA
GGAGAAGAAG GTCGTCGCAG CTCAGTATGG AACGAAAACG CGACAAATTG GAAGATCCCA
ACAAGAAACA AACGTTTACA AAATATCGAT GGTCAAGATG TTTCTGGTCC AAAATCATTC
ATACTCTCCC AAGGAGGAAT TGCGCACGCA AGGGTGGCAC CATCATGCTG TTGCTGATTC
TGCTGCATGT GTTATCGCGG GATTATCTTT GGGGCGGCCC ATTTGACATC AGAGGCGGAC
TCTTCGGTGC TTGGGCTGAC GCTGGCTTGC GGCTCTTATT CTCATCGAAA GCAGCTACGC
CGATTGCAAA AGTAGACTAT CCACATGTTG TATTGCGATC CAGTCGCAAA GAGATCCCCG
GTTACAGTCT TTTTTTAGAA GAGCTGCTGT CGAGAGCAGC AGGAGAGAAT ATCGATTGGG
ACGCTTCAGA AGTACGCAAC CACGACAATC CACGCCATGC GCATTCGGTG GTGAACTTTT
TTGAAGGTGG AGATTACGAA TCATCTTCTA ATGCTTCCTC GGATTCTGTT ATTGACGTTT
TGGCGAAACA GAACTGTCAA CCACGTTGGC TTTGTCAGAG ATGCTTAAAC GCAGCCCAAT
ATGGGTCGCT GACTCAATGT CGTCAATTGT GCCCAGAGTG CCTTGAAGAT ACACTCTGCC
AGCCTTCTTT GGTTCGCAAT CCACCCTTTT CTATCCTCAT GCAACGTCCA GTGACATCAA
CGATTCCACG TATAGTTCAT ATGATGTGGC ACGAGCCACT GGACAGTTTG AAGTATCCGG
AGCTCGTCCG GATTCAAAAC GGATGGCGGA GTACCGACTT TTCCTTTCGA TTTTACACGC
CAGATACCGC ACGTCGGTAT ATTCAAAAGA GCTACCCGTT GCGTATCATA GAGGTGTATG
ATTCTATCCA GTCGTTGTCA ATGCAGATTA ATTGCGCGCG AGTTCTGATT CTTTTGAGGG
AAGGGGGCGT GTTTGCAAAT GGTAAGCCTT GGTGGTGGCT ACTTTTGAAT GCCCCTGTGC
TTGTTCTCAA AAATCGAAAT TAATTTTTCC AAGTGGATTT ACTCTTGGAA GTCAATCTAG
AGGTTCTCTT GGTTTCAGGC GTGTCCTTTT TTGCTGCGCG AGAAGACGAT ATGGAACACT
GCCTTTGGAC TGGATTGGTT GGAGCAAGTC CTGGTCATGT GATTTTGGTT AAGGAAGCGG
AAGAATTTTT AACACGCCTG TCTACAAAAG GAAGCTATTT TGACATCGAC CGTAGCTTAT
GTACGGCGCT GGGACCGAGT GCCGAGCTGT GGAAAGCACG TGTGTATGTC GATGAAACAA
CTATCGATTC CTGCGCGCTT GGCCGAGCCG TTCACACTGC TCTTGGAGAG CGGAATTCAG
TGATGCATTT TTCATTGGGA AAACTTCAGA TTCCTTTAAG CAATAACAGA CTCTATGAAG
GAGATGCCCT TATACTTCTC GTAAGTCTAG ACAAGATCAT CGGCAAACAC AGATACTTTG
ATTTTTTTAA CGGATCTTGT CCCTCTTTTC TTACAGATGA GTAAATCTGA CACTGGTGCC
ACTCGAATTT CTGACATCGA GCGAAATATA TTGATTGCCT CCACATCTAT GGTGGGACTC
TCGAAAGAGA GCCTGTACGA GCGTATACAT CACGCCACAT TGCGGAACTC GAGAGCGGAA
ACCAGCACTG TGAACCTAGG TATGGGAGTA ACCAAACAAA TTAGATTTAT TGATAAACGT
TAACTTTCCG GATGCAAAGA GTGATCCTGA ACTGATTTGT ACACGTGTTG TCTATTTATC
CGCGTGCCCT TTCTACTTTG TAGCCCATGA AGGGGACTGT CCTGGAACCT TAG
 
Protein sequence
MLSHSSSFSS RRLASQIEKR PMMESVRRRR SSQLSMERKR DKLEDPNKKQ TFTKYRWSRC 
FWSKIIHTLP RRNCARKGGT IMLLLILLHV LSRDYLWGGP FDIRGGLFGA WADAGLRLLF
SSKAATPIAK VDYPHVVLRS SRKEIPGYSL FLEELLSRAA GENIDWDASE VRNHDNPRHA
HSVVNFFEGG DYESSSNASS DSVIDVLAKQ NCQPRWLCQR CLNAAQYGSL TQCRQLCPEC
LEDTLCQPSL VRNPPFSILM QRPVTSTIPR IVHMMWHEPL DSLKYPELVR IQNGWRSTDF
SFRFYTPDTA RRYIQKSYPL RIIEVYDSIQ SLSMQINCAR VLILLREGGV FANEVLLVSG
VSFFAAREDD MEHCLWTGLV GASPGHVILV KEAEEFLTRL STKGSYFDID RSLCTALGPS
AELWKARVYV DETTIDSCAL GRAVHTALGE RNSVMHFSLG KLQIPLSNNR LYEGDALILL
MSKSDTGATR ISDIERNILI ASTSMVGLSK ESLYERIHHA TLRNSRAETS TPMKGTVLEP