Gene PHATRDRAFT_40573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40573 
Symbol 
ID7198364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp307914 
End bp309941 
Gene Length2028 bp 
Protein Length675 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184599 
Protein GI219128814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.338329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAA CCCCGTCCAC ATCCACGACG ACGTCCACGC CATCATCGCG CTCCAAGGGT 
CTCCCCAAGT CACCCAAGCG GCGTCTACAA CAACCTCCAC TCATTACTAC ATCCCACTTT
CTTTTGGCTC ACGTCTTTTG CATCGCACTC TATCTTGTAC CCATTCTTAC TACCAGCAAT
GCCAATACTC ACACCAACAC CAAGTACGGT ATCAACCCAC CGCATACGCT GGCACCCGTT
CTCGATGAAA TACATATCGT CTCCCCGGAC AATGCCGACG TCAACGACCC CAACGCTACT
CTACGGAACA TTTTCACCAA CGATTATTGG GGTCGTCCCA TGCAAGCACC CAACAGTCAC
AAATCCTGGC GACCCTTGTC CATTCTCTCC TTCCGCTACC TCCAAGGTGG ACACGTCGAT
CAGTGCCAGT GGTGGTGGTG GTTACGGTAC TGCACCGGGT TCTCGTTGCC GCCTCTCCTG
GCGCATCGAC TCGTCAACGT TGTCACCCAC GCCTGTCTCG CCGAAATGGT CGGGATCCTC
GCGGCGCAAC TCGTACCCTC GCCGGACGCA CACTTTCGTC GTCTTTTGCG ACTCGTGGCC
AAAATCGCCT TTGGCCTACA CCCGACTCAC GTGGAAGTCA CGGCCAACGC TGCCAATCGA
CCCCACCTAC TCGCACTCCT CGCCAGTTTG GCCGCACTCG ACGCCGGTAG CAGTGTTGCC
TCTAGCACAA CTGTGTGGCC CTGGCTACGC ACTCTTCTTT TCCTCGTGGC CGGATTCTTG
TCCTGCGAAA CCTTTCTCTT CCAAACCGTA CCCATCGTCG TCACCTACAC CGTGCTGGTC
TACGTGCAGC TATACCACAA CGCTCCCACG TCCGGGTCGA GTCGTCGCGT CTACCGTCAA
CGCAACGGGT GGTGGTACCG GCAACTATGG AGTCTCGTAC CGATCGTGAG ACTCCGCGTC
GCTCTAGTTG TGGCCAGTGG AATCCTCTAC TACACGGCCC GATCCGCCCT CGACACCTTG
TCCATTCCCG ACGGACTCAT CCGACCGGCC GAAAATCCTT TTTTTGCCCT CCAAGGTTGG
CATCGCGTCC GCAACTACCT CTACATTGTT GCCGTACACG TCGCCAAGGC TTGGGATTTG
GACGTGCTCG GATTCTCACA CGAGTACGGG TACAATTGCG TGCCGGAAAT TAACGAATGG
ACCGATCGAC GCTTGCTGCT ACCACTCACA ATTGCCGTAC TCTACCTGGC CACGGCCGTC
TTCTTTCTCT TGCAACACGC CCGTCGTCGC CAAGTCTGGT CGATTCCCTT CCTCCTCTTT
GTCGTGCACG TTTCCTGGAT GGTCACGCTC TTTCCCGTCG CCGGGATTGT CAAAGTCGGC
ACCTTTGTGG CGGACCGCAT CGTGGTGGCG AGCTCCGTCT CGACCAGTAT CGTGCTCGCC
TACGTGGCGA CCCGCTGGAT GACGGCGCCC CGGTCCCGCA CGGCCGTCAC ACGCCGCGTG
ACCCTCCTCG CCCTCACGGT CGGTGTCTTT CAAACCCACC GTGTTTACGT CCGTACCACG
CAATGGATGG ATTCCTACCC TCTCCTCACG TCTAGTCTCG TTACCTGTCC ACGGTTCGCC
AAGGGACATT TGGAACTGTC CAAAATATAT TCCGGACTCT ATCCGGAACG CTTCAATTTA
ACCACGGCAC GGTGGCACTT GGCGCGGGTC GAAGATATTG ATCCCACTTT TTGTGACGTG
CACCAGCAAG TAGCCCACGT GGCCATTCAG GAACGGCGGT ACGAAGAATT CGAAGAGCGC
CTGGTCCAAG CCTTGCTCTG TCCGTTTACC CTGGGCGGTG CCACGGATCT GTGGCAACGA
TACTGGAAAA TTACGTTAAA TTCGCAACAG AATCCGTCCG ACGTTGTTGC CGCGGCCGAA
CAACGCTACC AGACCTACAT GAAACGCATC CAGGTGGCCA TTCAACAAGA ACAAGAAAAC
GAGCCGGTCC CCGTATCTAC CTCACCAATC GTGGGATGGC AAAAATGA
 
Protein sequence
MSSTPSTSTT TSTPSSRSKG LPKSPKRRLQ QPPLITTSHF LLAHVFCIAL YLVPILTTSN 
ANTHTNTKYG INPPHTLAPV LDEIHIVSPD NADVNDPNAT LRNIFTNDYW GRPMQAPNSH
KSWRPLSILS FRYLQGGHVD QCQWWWWLRY CTGFSLPPLL AHRLVNVVTH ACLAEMVGIL
AAQLVPSPDA HFRRLLRLVA KIAFGLHPTH VEVTANAANR PHLLALLASL AALDAGSSVA
SSTTVWPWLR TLLFLVAGFL SCETFLFQTV PIVVTYTVLV YVQLYHNAPT SGSSRRVYRQ
RNGWWYRQLW SLVPIVRLRV ALVVASGILY YTARSALDTL SIPDGLIRPA ENPFFALQGW
HRVRNYLYIV AVHVAKAWDL DVLGFSHEYG YNCVPEINEW TDRRLLLPLT IAVLYLATAV
FFLLQHARRR QVWSIPFLLF VVHVSWMVTL FPVAGIVKVG TFVADRIVVA SSVSTSIVLA
YVATRWMTAP RSRTAVTRRV TLLALTVGVF QTHRVYVRTT QWMDSYPLLT SSLVTCPRFA
KGHLELSKIY SGLYPERFNL TTARWHLARV EDIDPTFCDV HQQVAHVAIQ ERRYEEFEER
LVQALLCPFT LGGATDLWQR YWKITLNSQQ NPSDVVAAAE QRYQTYMKRI QVAIQQEQEN
EPVPVSTSPI VGWQK