Gene PHATRDRAFT_37731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37731 
Symbol 
ID7202607 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp821451 
End bp825974 
Gene Length4524 bp 
Protein Length1373 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181632 
Protein GI219122605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACG ACCGGATTCG GTCCGTTGAG ACACGCTCGC CACAGTGGGC CCTTACGGTA 
AGTGGCGAGC GACGAGCCGG GAGTGGCGAG TACCGAGTGG CTACCGGTCC CGCAGGAAAA
TGTCCACGTA CAATGCGGGG ACGGGGAGTT TCTACGAACC ACTACCAAAA CTACTGCAAC
ACTCTCTACT ACTACTAGCT ACTACTGCTA CTACTAGAGT AGTACTACGA GTATACAATT
CGACCGCATT CGAGGTCAGT CGGTCTTGTC TTTTATTCGT CACAGCGATC GAATCGCATC
CACGGTCCCT TTGGCGTCTG TGTTACATTT ACTCCCTCTT CCGGTAGTGG ACACACAGAA
AGACAATTCA ATCGCTCACG AAGGGAATTC CTTTCTCCAC ACGCAACACA CATACACAAC
ACACAGCACA CACTGCGCCA ATGCGGCGAC GTTTGCAAGT CGACGTGGCG GTGGGGTCGA
CCGATCCTTC CGTGCCGTCG TCACCTACCG CACCGTCCCC CACGACGACT CCTCGTCGTC
GCTCTCCGTA TCAAAGTTTT CCTCTCCCCG CGTATTTACC CTTGTGTCCC GGTGGTGTCG
CCTATCCTGC CGTGGGGACC GGCACAGACC CTCGCGACTG GCACGTTCTC GGGTCCCCCG
TCGGTACCAG CAGTAGCACC AGTACTAGTA GTACTAGTCC CAGTGGCAAC CAAAGTCCGT
CTCCTGCCGT TCCCATCAGT ACTACCAATA CCAGCGCCAG TACTAGCACG ACGAGTCCGA
ATGCTGCGAC GGCGATCCTC CCCCGGGCTA CACTACAACA ACTCGCCGCA CGTGCCAAGA
CCAACGACGA TCAACTCCGA CAGTACGCCT TGTTGTACGC GTCGCACGAG GCCAACACGG
CAAGCTCCCT GTCCAACACT GCAACAACAA CAACGACACA CGTCTTGCCC AGCCTCGCGA
TACTCACCGA AGCCGTCCTG GGGCGCCTGA CGGGGGTCGC ACCCTCTCAC GTCAAAACGG
CGGTAGATAT ACTCCTCGGC CCCTTTTTTC AAGATCCCAA CGATCCCCCG TGCCCGCAAT
CCGCCGACGA CGTTGGGAAC TGCAAACCTT CGGCCGTTGC TACCGACAGC CGTATCGGTA
CCGGTATTGG TACCAGTGAC GGCAGCAGCC GTACCCGTAC AACCAGCTCT AGTACTACCC
ACCACCACCA CAGCAACAAT CACCACCACA ACCACAGTAC CATCACCACG GCGCAACGAT
TGCAAACGCT CTTGGACCAG GCCGAACGCG AGCACCAACA ACAGCTCCAC CAGGACAAAG
AACCCTCACC CACCACTACT CGGCGAACCC GGCCCTGTGG ATACGTCTTT CAGCGAGGAG
ATATTGCCTG GAACTGTAGA ACCTGCCAGA CCGACCCCAC CTGCGTCATC TGTGACGCGT
GCTTCCGGGA CAGCAACCAC GAAGGTCACG AGGTGTACTT CCACCGTACC ACACCGGGAG
GGTGTTGCGA TTGTGGGGAC ACCGAAGCCT GGAACATTGC CGGTTGTTGC GAACGACACC
GTCCTCCACC AGCGCTACTT TCCGACGACG CCCCGCATCA CCCGGACGAT CCTTTTGAAG
CCGTCCGCGC CAGCCGCCGC GGATACACAC TGGCCCACGA AACCCTCACC CTCGAACCCA
CCGCTTTGCC GCCGCGTCTT ACTGCCGCGT TGGCCGTCGT GGTCGGAGCC GCCGTACACG
CCCTACTCGA TGCCGTCAAC GGCGCCGGGA TTGGCGCCGA TCCCGTACAG TGGAAACGAC
AATGGGCCGA TGAGGCGGCC AAAATTGCCA ACGGTGTCGT GCACCCCGAA GATTACGCCC
TGGTCCAACC CAAAGACCGT CAGGGCAATG ACGACGACGC GGCGGCGGTC CCCAAAACAT
CGACCCACCC GTTTCTCTGC AAGCTTTTTG GGAGGAGGCC CCTGCCACCA CTACTGCATT
TCCCCACGGC TTTCATTTGC AACTACGACT CCACAATGAC GACGTGCACA CGTTCGATGA
AGTCATTGAG GCTCTACACA AACCCCGGTC ACTGCGACGC CACTCCGAAG ACCGACAATC
TCTCGTGGCC CTCCGTGATC ACGCTACGGA AATGACGCAC CACGTCGACG CTGACGGACA
AGTCACGGTC AAAACATTTA CTTCCTTTCC GGCCGCCCTC CAGGGATATC AAAGTCTCAA
ACGACGGGGC CTGCACTGCG CCGTGGTTAG TTCCGTGCAA GTCCAAGCCG AACACCGTGC
CCGTGCCTTG GCGTCGTGGC TGTCCGAAAT TTCTGCCGCC CATCCCGCCG CGGCCGCCCT
GGTCGTCCAC GCCTTGGTAC AAGTAGGCGA AGGCGATGAA ACCCTGGCGG ACTTTAGTGT
GTGGCCCCGA GCGCGTTCGA TTCCACCCTG GGCCGCCACG GACGCGTCCT CGGAGGAACA
GGCCTGTCTG CGGCGCTTCG CCGCCTTTCC ACCCCATTTA CCTTCGAGTT ACGTCACCCG
CGAACAAGCC GAACTCCTCC ACGGCATTGC CCTGACCGTG CAAGTGGCGG ATTTTGTTCA
CGTCACGGGC GCAGATCCGC ATTTTTACGG ACGCGTGCCT TACCGACTGT GTGCCGATCG
GTACAAGAAA TCTCCCCACG CCCTGTGGGG GACCCTCCCG CAGTGCTACG TCGACCCTAC
ACCACCACAC GCCAAACACC CCCTCCTGCA GCGATTGGCG ACCGTGACTA CTGACGGTAC
GGAGAATACA ACGAATCGCG CGGACTGCTG GAAGGATGTG CCGAATGCGT TGACGGAAAC
GGTGTACGTG GTCGATACCG ATTTGCGGAA ACAGCAAGAG GCGGACCGGA TCACGTCGAC
GGTCTTTCCA CATCGCTTGC CTGGCTTGCA ACTGGTCAGT GGTGTTGGCA CGATTCGTCT
GGATCATCTC GACGCACGTC GACCCCCGTT GCCCAGTCCA ATGGATTGGC GGCATCTTTT
GGCTACGTCA TCTTTTCGGG CCCCCGCGTC GACAATTCTC TGGTTGCTCT TGCTCGATCC
CTATCCGACG AAACAAGTTC GCGGTGCCAT TCACGCGCTT ATACTGTCGC TGCTCACGGA
CGCCCGCTTC AAATCACGGG TTGCTGGCGC TCTTGGTGTC GCGTACCGAC CGTTGAGTAC
TCTGTTTTGC GCCGGCGTCG GGACGGAGGC GGACTCGCCT CTGCATTTTA CGGTGCAAAT
TTTTACGGCA GGCAGTTTGG TGCGGGCTCT AGGGAGCGGA CCCGCGACTG AGGCCTTGCT
GATTTCCGAC GACCCGAATC GGGCAGGGCA CAGCGAGGCA TCGATTGGTG TTTTCACATC
ACCCATCGCG CACACGATCG TCCGATGCAT CCACACGAAC TTGCTGGGTG CCACGAAAGA
GGTCAATATG ATTTTGAACC ACACGACTTC CGGGACGGAT GATGCGGAAG AAGATCCCGT
CTTCCAGCCG TCGAACGATA GTTTGTTGCC GGCCTTGACG TACGTGGCGG GAGAACATCC
GCTCATGACA CCCTTACCAG CGGCCCCGGA CGACGGCTTT TTGGATTCCC GGTCGACCCG
TCACAAGCGG CTGCCGCATA TGCTGCGCGA TTTAGAATAC GTCATTGAGA CACCTGGTAC
CGCAATCCGA CTTCTCCTAC CACGGCGTTT CCCGGTATAC CAAGGGCCAC CTTTGTCCAT
GCGAGGAGAA GACGTCTTGG CATTTCCCGT CGTCTTTTCC CGCATGCTGC GGCTGGCTCA
AGGAATGGAT CCACAAAAGC GTAAGATTTC GGGAGGGCAT GTCGAGTACG AACACATACG
ATGGCTGGAA GCGTTTGGGT TGAGTCTTAA CTTTGCCGGC GCGCGCGACG CCTTGTCGGA
AAGCCCGCCG CGAAGCAGCA GCGTCGCACT TGGTGCAGAT TATATGGAAA ATGTTATGGG
CGTGCGTGAA GCTCTCGGGA ACATTGGTGC ATCCTTGCTA CGTGAAATAA AGCTTTGGCT
GTATCGCGAA GGTATGCTCG AGACAGGTCT TCCGCTACCA CCCGGGGGCG CCCACGGAGC
GACGGATATG GCCCAGGTGG AGTCACTACA GCGGAGCACT CTACACGTAT CGGGATCGCA
GAGCACGTCC AATGATGCTT TATCCAATAG CAATGCCGGT GCTGTTGCTC TGGCTTGTGC
CACGGGCGTG AAAATGACAG AGGCCCAATT AAGTCTCATT GAGAACGTGT TGAAGGCCGA
AGCGGTGGAG CGCTTCCACT CCAAACAGGG ACAAGTGTTG ACGCCCAAGT CTATGGGTCC
AGTGATGGGC GACTGGCTGC GCGTTCCGCA TTCACCACTC GCGGGAGATT CTCTTTCGTT
TCACATTCCC TTGCATCGAG CTTTGGCGGA AAGCATTCGA TGCGTGTGCG CCCTTTCTGT
TTCCGAGGCA TCGAGAAAAT CGGAACCATC CGGATGGTGG AAGCTTCCAG TCCTTGACGG
AATTCCTACT AGTGTGTCCT CTGA
 
Protein sequence
MKNDRIRSVE TRSPQWALTS STTSIQFDRI RAHTAPMRRR LQVDVAVGST DPSVPSSPTA 
PSPTTTPRRR SPYQSFPLPA YLPLCPGGVA YPAVGTGTDP RDWHVLGSPV GTSSSTSTSS
TSPSGNQSPS PAVPISTTNT SASTSTTSPN AATAILPRAT LQQLAARAKT NDDQLRQYAL
LYASHEANTA SSLSNTATTT TTHVLPSLAI LTEAVLGRLT GVAPSHVKTA VDILLGPFFQ
DPNDPPCPQS ADDVGNCKPS AVATDSRIGT GIGTSDGSSR TRTTSSSTTH HHHSNNHHHN
HSTITTAQRL QTLLDQAERE HQQQLHQDKE PSPTTTRRTR PCGYVFQRGD IAWNCRTCQT
DPTCVICDAC FRDSNHEGHE VYFHRTTPGG CCDCGDTEAW NIAGCCERHR PPPALLSDDA
PHHPDDPFEA VRASRRGYTL AHETLTLEPT ALPPRLTAAL AVVVGAAVHA LLDAVNGAGI
GADPVQWKRQ WADEAAKIAN GVVHPEDYAL VQPKDRQGND DDAAAEAPAT TTAFPHGFHL
QLRLHNDDVH TFDEVIEALH KPRSLRRHSE DRQSLVALRD HATEMTHHVD ADGQVTVKTF
TSFPAALQGY QSLKRRGLHC AVVSSVQVQA EHRARALASW LSEISAAHPA AAALVVHALV
QVGEGDETLA DFSVWPRARS IPPWAATDAS SEEQACLRRF AAFPPHLPSS YVTREQAELL
HGIALTVQVA DFVHVTGADP HFYGRVPYRL CADRYKKSPH ALWGTLPQCY VDPTPPHAKH
PLLQRLATVT TDGTENTTNR ADCWKDVPNA LTETVYVVDT DLRKQQEADR ITSTVFPHRL
PGLQLVSGVG TIRLDHLDAR RPPLPSPMDW RHLLATSSFR APASTILWLL LLDPYPTKQV
RGAIHALILS LLTDARFKSR VAGALGVAYR PLSTLFCAGV GTEADSPLHF TVQIFTAGSL
VRALGSGPAT EALLISDDPN RAGHSEASIG VFTSPIAHTI VRCIHTNLLG ATKEVNMILN
HTTSGTDDAE EDPVFQPSND SLLPALTYVA GEHPLMTPLP AAPDDGFLDS RSTRHKRLPH
MLRDLEYVIE TPGTAIRLLL PRRFPVYQGP PLSMRGEDVL AFPVVFSRML RLAQGMDPQK
RKISGGHVEY EHIRWLEAFG LSLNFAGARD ALSESPPRSS SVALGADYME NVMGVREALG
NIGASLLREI KLWLYREGML ETGLPLPPGG AHGATDMAQV ESLQRSTLHV SGSQSTSNDA
LSNSNAGAVA LACATGVKMT EAQLSLIENV LKAEAVERFH SKQGQVLTPK SMGPVMGDWL
RVPHSPLAGD SLSFHIPLHR ALAESIRCVC ALSVSEASRK SEPSGWWKLP CVL