Gene PHATRDRAFT_50616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50616 
Symbol 
ID7199449 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011700 
Strand
Start bp98575 
End bp101751 
Gene Length3177 bp 
Protein Length1015 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185584 
Protein GI219130885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00719688 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATTC CAACGATTCT CACCGTGGAA GAACCAGAAC GGGATCTGTA CGGCCGCGTG 
AAGGAAAAAG CCGCAATCTC CGTCGCTAAC GATCAGTGCA TGGATTTGGC CACAGCAATG
CGAGGAAGCG AGAAGTCTCA CGTAATTCTC GTACATGGTC CATCAGGTTT GGGCAAAACT
GCTTTGTTAC AGCACGTCTT CGAGGAACGG ATAAGGCAAG AGGGCGGCTT CTTCATATAC
GGAAAGTTCG ACTCAGTCCC GTCGGCGAGC CCTGGCCAGC CATTGTGGAC GCTTTCAGCG
ATTTGTGTGT ACAGTGTCTT GCTTCCGAAA GACGATCGAA AATACGGAAG GCGCTTTGCG
AGGAACTAGA ATCAGACGTC AATCAACTTT CACATTTAAT TCCCAACTTC AAGGCGCTCG
TTGACGGCTT TCTTGAATCG GATGACGACA CTGAGGGAAA TGGAGATTGG AATGACATCC
GAGCATCATT CAGGCGAAAC GAATTTGGGT TTACCCGCAT AAAACTCCTC TTTCAACGAT
TCCTCAGAGC AATTTGTCAC CCCATCTCCT CCCGAATAGT TTTGTTGCTT GACGACGTGC
AATGGGCAGA CCAATTGAGT TTGCTTCTGA TAGAGGCGCT CTTATCCGAA GGCGAGCCTG
TAAGCGGGTT TCTGTTAGCT ATCACATCTG ATGAACCAGG ACTTTTTCGA CGAGCTCAGG
TCAATAGAGA TCTCGTAAAT GCTACCAAAA TTCGTATTCG CAACCTTTCC AAGAATGATT
GGATCGAGAT GACAAAGCAA ACCTTGAAGA AAGCGAAAAG CTACATTGCA GCCGAAGAAT
TGGATATTCT CTTCAAAAAG ACTATGGGCA ACCCTTTCTT GACCGTTCTT TGTGTAAAGG
CAATAGACGA GGGAGGCTTG CTTAAGGAAT TAGAAATTGA AGCTGACAAC GGTGTAAAAG
CTGGAATTTT GTCAGTGATT AAGGGCCGCC TGAGCCGATT GGCAAAGCCA GCTCAAGATG
CTTTGTTTTT GGGAGCCTGC TTTGGACTGA GGTTTTGCAT GGATTGGGTG GCTCCTCTTG
TCCCGACATA TGGATCAACA CGAAACGCTT CTCTGCAACC GGACGTCATG CCTTTAAATT
GGGGCTCGGG ACGTTTGGAC GACTCAGAGC ATTTGTCTGA AAGTGAACTC AAGGAGACAT
TGAACGAAGC TATCGTTAAC GGCCTGGTCA CTAAACGATG CGGTATCCCT TGGTACGAAT
TCACTCATCA TTCTATCCGC GATGCGGCCT ATGACCTTTT CAGGAATAGT TCGTGTAAGA
GAGAACGAAT ACATTTGCTA ATAGGCGAAC ACACGCTTCG AAAGGTGAGG TTTTCCTCTA
GCTCCGAGGA CGATACTATT CTATGGACAG CAGTGGATCA TCTGAATCTA GGTTCTTCTC
AGCTGCATGA GGAGCGCAAG TTGATTGAAC TTGCTCGTCT CAATGGAAGG GCAGCAGAAA
AATCAATGCT AAAGTTAGCG TTCTTCTCGG CTGCCCAGTA TGCGTCAGCT GGTTTGGAAA
AGATAGGACG AGTTGGAGGA TGGCAAACAG ACTTTGCTGT GACCTGGGAG TTGAGCACGC
ATTTATGTCG AATGTACAGT TGCCTCGGTG AGCATGAAGC TTGTAAGAGA GTCGCCAACG
ATGTGGTGTC CCGCAGTTCG TCCATTTTTG AAAAGCTTGG TGCTTTCGAA GCTGTCATGG
AGTCTTCTCG AGTTGAAGGG AATGTAGAGG AAGCATTCAA CATAGGGTTT GACGTCCTCA
GAAGCTTGAA TGAACCTTTT CCCAACAGAG TTAGCAAAGC GCTTTTAATT TGGGAAATTG
TCAAGACAAA ACGTATCTTG ACCCGGAAAA CATTAAAGGA TTTGTCAGGC CTACCAAAAA
TGGCTGACGA AAGAGCATGT GCTACGGTTC GGTTTCTAAA ATTACTATCC CTCACCTTCT
TTGCCATGGG AAACTATTTC AGCTACTTTG TGGCATCGTT GAGAATCGTC AGACTGAGCA
CAAAGTACGG TGTTGCTCGG GAATCTCCAA AAGCATTTGT GGTCTTTGGG AATATTCTAT
CCCAAACAAG TCGACACTTC AACGAAGCTA GCCGATACAT GTCCGTTGCG CTGTCTTTAG
GTGAAAAGGC GGGCAAATCT GGAAGAGCGC AATCTCTTGC CGTTGGAAGT TGGATCTTGA
CGCCTTTGCA AGGCACCGTC TCGGAAGCAG TTGGTCAAGC TCTGTATGGA TATCGGCTCG
CAATGGAGTG TGGCGAGGTT GTGTGCGCGT GTACCTCAGT TCTTTCTTAT TGTGGCCTCT
ATTTTTGGAG TGGACTTCCT ATACCACCCC TAATGAAAGA TCTTCCAACT TTCCTGACCA
TGCTAAGCGA ATATAAGCAG GTACGGCATT TCAAATTCTA GCTCTGCATG GGCATCCGAC
TGTTTTCTCA CCCTTTTCTT TCAGACCGTG CACGAAGTAG GACTCTCCTC ACTGATGTAT
TTCATTCAGA CATCAACAGC GGAGACCAAT CCGACTGATT TATATGACGA TACCGTATGG
ATGGTCAGAT GTCAGAATGC AGGAGCCGCT ATTCAAGTCA ATACCATTTG TCTATACCGC
ATAATATATG CTTATTACAT GCAAGATCTC AGTTCAATAC GAGCTTCTCA ACTAGATGCG
CACCGAGCTG TAAAGGTACA ATTGTCAAGA ACTTTGCAAG TTGTCGTGTG CTGGCTTTTT
GTGGGGCTCT CCGATTTCTT CTTAGCACAG TCTGGGTGTG GCATTGAGTT TCAACGAAGT
GGCCAAAAGA TTTTACGCAT GATGCGTCGG TTGGTCGTCA AAGGGGACAG CAAATGCGAG
CATATGTTCA TGTTCCTCAG GGCAGAAAAG TACAAACTTG TATCAAAACA GAGCAATGAA
GTTCTAAAAT CCTACGACGA GGCCATTTCT GGAGCCGGAC AGGCTGGATT TTTCAACCAT
GCGGCCTTGG CGAACGAGCG CGCCGCACTA TACTGTTTAG CGCGTGGAAA GGAAAAGAAA
GCGGCTCAAT ATTTCCAAGA AGCCTGGCAA GGGTACCTGA ACTGGGGAGC CCATTCTAAA
GTTGACCAGC TTGGGGGGTG GTACTCAGCA TACATACAAC AAAGTTCTAG AAAATAG
 
Protein sequence
MAIPTILTVE EPERDLYGRV KEKAAISVAN DQCMDLATAM RGSEKSHVIL VHARLRGTDK 
ARGRLLHIRK VRLSPVGEPW PAIVDAFSDL CVQCLASERR SKIRKALCEE LESDVNQLSH
LIPNFKALVD GFLESDDDTE GNGDWNDIRA SFRRNEFGFT RIKLLFQRFL RAICHPISSR
IVLLLDDVQW ADQLSLLLIE ALLSEGEPVS GFLLAITSDE PGLFRRAQVN RDLVNATKIR
IRNLSKNDWI EMTKQTLKKA KSYIAAEELD ILFKKTMGNP FLTVLCVKAI DEGGLLKELE
IEADNGVKAG ILSVIKGRLS RLAKPAQDAL FLGACFGLRF CMDWVAPLVP TYGSTRNASL
QPDVMPLNWG SGRLDDSEHL SESELKETLN EAIVNGLVTK RCGIPWYEFT HHSIRDAAYD
LFRNSSCEHT LRKVRFSSSS EDDTILWTAV DHLNLGSSQL HEERKLIELA RLNGRAAEKS
MLKLAFFSAA QYASAGLEKI GRVGGWQTDF AVTWELSTHL CRMYSCLGEH EACKRVANDV
VSRSSSIFEK LGAFEAVMES SRVEGNVEEA FNIGFDVLRS LNEPFPNRVS KALLIWEIVK
TKRILTRKTL KDLSGLPKMA DERACATVRF LKLLSLTFFA MGNYFSYFVA SLRIVRLSTK
YGVARESPKA FVVFGNILSQ TSRHFNEASR YMSVALSLGE KAGKSGRAQS LAVGSWILTP
LQGTVSEAVG QALYGYRLAM ECGEVVCACT SVLSYCGLYF WSGLPIPPLM KDLPTFLTML
SEYKQTVHEV GLSSLMYFIQ TSTAETNPTD LYDDTVWMVR CQNAGAAIQV NTICLYRIIY
AYYMQDLSSI RASQLDAHRA VKVQLSRTLQ VVVCWLFVGL SDFFLAQSGC GIEFQRSGQK
ILRMMRRLVV KGDSKCEHMF MFLRAEKYKL VSKQSNEVLK SYDEAISGAG QAGFFNHAAL
ANERAALYCL ARGKEKKAAQ YFQEAWQGYL NWGAHSKVDQ LGGWYSAYIQ QSSRK