Gene PHATRDRAFT_47561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47561 
Symbol 
ID7202627 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp116465 
End bp118179 
Gene Length1715 bp 
Protein Length521 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181848 
Protein GI219123056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGCTCATAC ACAAGGCTTC CGGTTATACA TAAGGCTATA TCGTACCAAG ACAGCAGAAC 
CAATGAAATT CTCTATTTCT TCTCTCGTTG TCACAATTGG TTCCTTAGGG AACCTGCCTC
TTGCCATTGC ACAAGGCGTC AGGTATGTCG AATAGGAAAG GTGCCATCGT CACCGTCACC
AAAGAGGACG CAGAAACCTT ACAGATTCAT AATTGCCTTC ACCATTCAGT GCAACTACTC
CTGTGGTCCG ATATGTGGAT CCAATGAGCG ACATGTCTTT ACTCCTTCCG TCGCTAAGCG
CGCGACACGG TGATGAGCCC AGCCTCAACA TGGATTTTTG GCTTCCAGCT TCCCGGGCTC
TGTCTTCCTC TCTCGCCACC GATATTCCCG ACCAAACATA TGCCACCCTT CCGTCGAAAG
CTTCTAGCAC TGCGCAAGTT GCTGTATCAA CGATGCTCCA AAGCAATGTA CCAAGCGACA
TCCCCAGTAG CCAGCCTTCC TATATACCAA GCTCTTTGGA AGATGGCCCG CCTAGTGATT
CCCCAAGTCT CACGCCATCT CTACAATTGG CTTCATCATT TTCGGACGTA CCGAGCAATG
TTCCTAGTAA CCAGCCGTCA GTCACCTCGA GTTTGCCAGG GGCCAGCGAG GTGTCAGTGA
CTCAATCTGT GACGCTTGCA TTGGGGTCCA ATACAATTTT GGATGACGCA TCCATTGACA
TTTTCGAAAG AGTATGTGCT TTTTCGTTTC TACCCATGTA TCTTTCCACA ATCTACGAAG
CTGAGTATAA AAGCATTCGC TGTAGTGTAT TGGATCAAAA CTTGGTAGAT GAGTCTTCTA
AACGACGTTT GCTAGACGAA GATTATACGT TGGGAGAAAA ACATTCAACC TTATCCTTGC
TCCTTCGCGT CTCGAGTTTG GTATATCTAC GTTCCGGCGT CGAATTCGGA GATATAGTGC
AGCAAACCTT CACTACCCAC GTGGACACCT TTCTGAGTCT CTTGTTTGAT ACTTTACCTT
TTTTTGCACC GAAATCCAGC TCTGGTAGTG GTAACTCACA GGCAATCACC GGAGGACAAA
CAGAGGTCCA AAACCAGGAA GCTAACCCAT CCCCAATCAT CATATCCGTC GCTGCAGTTA
TGGGAGGTGC GATTCTAGCA GCTATTGCCG CCTTCTTTGT GCTAAATAGT CGCAGAAACG
CAATTTTGAA TAGAGAAATG CCAGACGGAA CTGATGTATC CATTCCCATC GACTATTTTG
AATCGAGTGA CGAAGATCTG GAAAGTGCCC CATATGATGT CACAGACATC TCATATTCAA
CAATGGGAAT GAATATGATG CCTCCATCCC CTCTTGGAAT AGATTCGATC CCACGAGCCC
TGAACTCTGT CTCGATGATA CATTTTTCCG AGGATCTCTC GACCACCTCA ATCGATCCTG
AAAGTGGAAT CACACCTTCT TCGACGACAT TAAGCCCAGG TCCCTTGATT CCAGCCTATT
GGGAAAGCTA CGAATCCAAA ATGATATGGA AAATTCGAAA TTCTTCATCC CACAGTCTTT
CCAGTCAAAT GTCGACAGAT ATACAGTCCG TAAACGGCGC TTTCGAGAGT TCTCCTAGCC
TTCTCACTGT CAATTCACGC CAAGAAGATG TAGGAACTAC GTACAGCGAC GGAGTCAAAG
ATCAATCGTC GACGACAGAC ACTTACGAAA AGTGA
 
Protein sequence
MKFSISSLVV TIGSLGNLPL AIAQGVSATT PVVRYVDPMS DMSLLLPSLS ARHGDEPSLN 
MDFWLPASRA LSSSLATDIP DQTYATLPSK ASSTAQVAVS TMLQSNVPSD IPSSQPSYIP
SSLEDGPPSD SPSLTPSLQL ASSFSDVPSN VPSNQPSVTS SLPGASEVSV TQSVTLALGS
NTILDDASID IFERVCAFSF LPMYLSTIYE AEYKSIRCSV LDQNLVDESS KRRLLDEDYT
LGEKHSTLSL LLRVSSLVYL RSGVEFGDIV QQTFTTHVDT FLSLLFDTLP FFAPKSSSGS
GNSQAITGGQ TEVQNQEANP SPIIISVAAV MGGAILAAIA AFFVLNSRRN AILNREMPDG
TDVSIPIDYF ESSDEDLESA PYDVTDISYS TMGMNMMPPS PLGIDSIPRA LNSVSMIHFS
EDLSTTSIDP ESGITPSSTT LSPGPLIPAY WESYESKMIW KIRNSSSHSL SSQMSTDIQS
VNGAFESSPS LLTVNSRQED VGTTYSDGVK DQSSTTDTYE K