Gene PHATRDRAFT_37580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37580 
Symbol 
ID7202428 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp467472 
End bp470546 
Gene Length3075 bp 
Protein Length982 aa 
Translation table 
GC content62% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181563 
Protein GI219122461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGT CTGCCGATTT CACCATCTCC GACTTTCCTC ACAAAGTCCT CAATCCAATC 
GCCACCAATA CCATCGCACC CTCCTATGCG TCGCTTCTCC TCGCCCAACG CCAGCTCAGC
GCCAATGCAT CCGCCATCCC CAGCCTCAAT GGCGGTGGCG CCCATGGCCA CATGGCCCTG
ACGCTCACCG CCGCCGCGTA CGCCGAACTG TCCGACGTCC CCTTCGTCAT TCCCGTTGCT
CCCCCGGCCG ACCCCGAACC GGGCACCACG CAACCTCAAA TCACGGAAAA TAATCGGCTC
CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCAG TCAATAATGC CCTCCGTCGC
CAGCTCCTCG ACGCCGTTCC TCGCGTCTAC GTCCGGGACT TGGAGCACCC CCAGTTCGCG
TACAGCAAAG TCACCTGCCT TGACCTGCTG GACCATCTCT GGCGCAACTT CGGCACCATC
TCCGCTTCTG ATTTAAAAAA CAACATCCAG TCGATGTACA CCCCCTGGAA CCCAGCCGAC
CCGATTGAAA CCATCTTTCA CCGCCTGACC GACGCCATTG CGTATTCGAC GGCAGGCCGC
GACCCCATCT CCGAAGCCGC TGCCGTTCGC GCCGGCTACG ACGTTCTCGA GCATTCCGGC
TTGTTCCCTC GTGCCTGCGA AACTTGGCGC ACTGCCTCGC CCGACACGCA TACGCTTGCC
AACCTCCGTA CTCTCTTTAA GGTTGCCGAC ACGGACCGCA AGCGCACCGT CACCACGGGC
GCCCTTGGCT ACGCCAATGC CCTTTGTGCC CCTTCCTCTG CCCCCCCTTC GATTGTGTCC
GACACCCTCA GTCTTCCCTT TTCTGCTCTC TCTGTGTCGC ATTCCTCTGC CGCCACCACG
GAGAAAACAT ATTGCTGGAC CCATGGCTCC AGCAATAACC GTCGGCACAC AAGCGCCACC
TGCAAGAACA AGGCTCCTGG GCACCGCGAC GACGCCACTG CTGCCAATCC ACTTGGTGGG
TCAACCAAAA TTTGGACTGC CCCCAAACCC CCTGAATAGG TCAGAGGGAC GGCTACACCG
ACACTTAACA CTTGTAATAA CGATCTAATC AATCATATTA CTAGTCTTAA TTTGTCTGTA
GTCCCCTCCC CGCCTAGTAT TACAACCTCG GCCATTGCCG ACACCGGGTG CACAGGACAC
TACATTACCG TGTCCTGCCC CCACTTCAAC CAACAGCCAG CCTCCTCTCC ACTCTCTGTC
CCCGTTCCCA ACGGCGCTAC CCTCCGTTCC AGCCACACGG CCACTCTCGA CCTCCCTGGT
TTTTCCCCTG CCGCTTGCCA AGCTCACATC TTTCCCGGCC TTGCTTCACA CCCCCTCATT
TCCATCGGCC AGCTCAGCGA CGACGGCTGC ACCGCCACCT TCTCCGCCAC CCGACTCGAC
ATCCACCGGG ACACCACCCT GCTTCTTACA GGCGCTCGAG CCCCCACCAC CGGCCTCTGG
CACCTCGACC TGACCCCAGC CAAGACCGCC AATGCCCTCC TTCCCGACAC CTCCCTGGCC
GACCGCATCG CTTTTGTCCA TGCGTCCCTT TTCTCCCCTT CTCTCTCCAC TTGGTGCACC
GCTCTCGATG CCGGGCGCCT CCCAACCTTC CCCGACATCA CGTCCAAACA AGTGCGCAAG
TATCCTCCCC GCTCGATGGC AACCATCAAA GGCCACTTAG ACCAACAACG CGCAAATCTT
CGCTCCACTA AGCCCTCCCC CGTTCCGCTG GTGGCCTCAC CCAACCCTCT CCACGAATCC
CCGCTTGACT TCTGCCCGGC TCCGACCACT CCTCCCGCTG GCCGCACTCA CCATGTCTTC
GCCGCACACC AACGAGTCAC CGGCCAAATC TACACAGATC AACCGGGCCG TTTTCTCACT
CCCTCGAGTG CAGGCCACAC GGATATGCTC GTGTTGTACG ACTACGACAG TAATGCAATT
CACGTTGAAC TAATGAAGAG CAAGTCCGGT GCCGAGATCC TGGCCGCCTA CCAGCGCGCC
CACTCCCTCT TCACCCACCA TGGCCTCCAG CCGCAGCTCC AGCGTCTGGA CAACGAGGCA
TCCACCGCTC TCCAGTCATT CATGACCGCC CACCAAGTTG ACTTCCAATT GGCGCCACCC
CATTTACACC GTCGCAACGC CGCCGAACGC GCCATACGTA CCTTCAAGAA CCACTTCATA
GCCGGTCTCT GCAGCACGAA CCCGGATTTT CCGCTTCATC TTTGGGATCG CCTCATTCCC
CACGCTCTGC TTAGTCTCAA TCTCCTCCGC GGCTCCCGCA TCAACCCCAC CCTCTCAGCC
CACGCCCAAC TCCACGGCGC GTTCGATTAC AACCGCACCC CGCTTGCCCC CCCCGGTACT
CGCGTCCTCG TGCACGAAAA ACCCGCCGTC CGAGAAACTT GGGCGCCCCA TGCTGTTGAA
GGCTGGTATC TTGGCCCGGC CATGAACCAT TACCGCTGCC ATCGCGTTTG GATCACCGAG
ACACGTGCTG AACGCGTTGC TGACACGCTG GCATGGTTCC CCAGCAAGAT TCCCATGCCC
ACCGCCTCTT CCACGGACCG CGCCCTGGCC GCCGCCCGTG ACTTAGTGTG TGCCCTCCGG
AATCCCGCTC CTGCTTCACC GTTTACGCCC CTCGACGCCA ACCAGCACCA GGCCCTCACC
CAACTCGCAG AACTCTTTGA GTCCGTTGCT GTCCCGGCCT CTCCCATCGC CGCACCCGCT
CGAGCGCCCC CGGTCCCCGC CCCTGTCCCA GCACTTACCC CAGCACAGGT CCGCTTTGCC
GTTCCCATCG TCACAGCCGA GCACGCCCCC GCACTTCCGA GGGTGCCCAC CCTTGCGCCG
CCACCTCCGA GGGTGCCCCC CACGGCCACC TATCACTCTC GAACCCGAAA TCCCGGCCGC
CGCCGTCGCA AAGCACGCAA GCCCCCGCCA ACCCCAACCC TAGTTCCGGC TCATCCACAC
AACACCCGCA CCCGACCCTT TCTTGCCCCG CCTCCGCCAA CGCAGTCGTC GACCCCGCAA
CCGGCGCCTC TTTAG
 
Protein sequence
MSPSADFTIS DFPHKVLNPI ATNTIAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL 
TLTAAAYAEL SDVPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR
QLLDAVPRVY VRDLEHPQFA YSKVTCLDLL DHLWRNFGTI SASDLKNNIQ SMYTPWNPAD
PIETIFHRLT DAIAYSTAGR DPISEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA
NLRTLFKVAD TDRKRTVTTG ALGYANALCA PSSAPPSIVS DTLSLPFSAL SVSHSSAATT
EKTYCWTHGS SNNRRHTSAT CKNKAPGHRD DATAANPLVP SPPSITTSAI ADTGCTGHYI
TVSCPHFNQQ PASSPLSVPV PNGATLRSSH TATLDLPGFS PAACQAHIFP GLASHPLISI
GQLSDDGCTA TFSATRLDIH RDTTLLLTGA RAPTTGLWHL DLTPAKTANA LLPDTSLADR
IAFVHASLFS PSLSTWCTAL DAGRLPTFPD ITSKQVRKYP PRSMATIKGH LDQQRANLRS
TKPSPVPLVA SPNPLHESPL DFCPAPTTPP AGRTHHVFAA HQRVTGQIYT DQPGRFLTPS
SAGHTDMLVL YDYDSNAIHV ELMKSKSGAE ILAAYQRAHS LFTHHGLQPQ LQRLDNEAST
ALQSFMTAHQ VDFQLAPPHL HRRNAAERAI RTFKNHFIAG LCSTNPDFPL HLWDRLIPHA
LLSLNLLRGS RINPTLSAHA QLHGAFDYNR TPLAPPGTRV LVHEKPAVRE TWAPHAVEGW
YLGPAMNHYR CHRVWITETR AERVADTLAW FPSKIPMPTA SSTDRALAAA RDLVCALRNP
APASPFTPLD ANQHQALTQL AELFESVAVP ASPIAAPARA PPVPAPVPAL TPAQVRFAVP
IVTAEHAPAL PRVPTLAPPP PRVPPTATYH SRTRNPGRRR RKARKPPPTP TLVPAHPHNT
RTRPFLAPPP PTQSSTPQPA PL