Gene PHATRDRAFT_47999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47999 
Symbol 
ID7202999 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp658524 
End bp661904 
Gene Length3381 bp 
Protein Length955 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182439 
Protein GI219124287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGAGGACT CTGGGAAACG GTTGTCACAC ACCTTCTCTC TCTCTCTCTT GCAATACTTG 
TTTTAGCTCT TTACCCGTTG TGATCACAAA TTAAAACCCG TTCGTCATGT CTACTACGAA
TGCGGAGGAC GATACACCCG TCTTGGAATG GCCGATGGAA AAGGTTCGCG ATACGTTCAA
GAACTATTTC GTGGAACAGC ACGGACACGT CTTTTGGCCT TCGTCGCCGT GCGTCCCCGT
CGATGATCCC ACTCTACTCT TTACCAACGC CGGAATGAAT CAGTACAAGC CTCTATTCTT
AGGTACGTAC GTTCGCACGA TACGATACGA GATGATGCGA TCCCTACAAA ACATGGTTCC
CGATGGAGCA GGCGTCGATG GAACCAAAAG CCTTTCCAGA GAACGTTCGA ATCGTCGTGT
CTTTCTCCCG TGAGGCCAAT GCGACCTTTT GTATCCAATA TCTGCTAACG ACGTTCTCGT
ATTTACACAG GAACCTGTGA TCCCAATCTG GAAATGTCCA AACTGACCCG CGCCGTCAAT
TCGCAAAAAT GCATTCGTGC TGGAGGCAAA CACAACGATC TCGACGACGT CGGGAAGGAC
GTCTACCATC ACACCTTTTT CGAAATGCTC GGCAACTGGT CCTTTGGTGA TTACTTCAAA
GCTGGTGCCA TTGATATGGC CTGGCAGTGT TTGACCGTCA CCTTTGGACT GGACCCGGAA
CGACTATACG CCACTTACTT TGCCGGAGAC GAACTCACCC CGGTCGACGA AGAAGCACGT
CAACTGTGGT TGCGATACCT CCCCGACGAT CGCGTACTCC CCTTTGACGC CAGGGACAAT
TTCTGGGAAA TGGGTGCCAC CGGACCCTGC GGACCTTGTA CGGTACGTGG CTAGCGGTGT
GCACGGTATA CCGACGGACC TTTGTTTTTG TCGATACATT GGCACTCACG TGTCTTTTTT
TGTTCCGTCG TTGGTTGTCT AGGAAATTCA TTACGATCGA ATCGGTGGAC GCGACGCCTC
CAAACTCGTC AACGCCGATT TGCCGGATGT GATGGAAATA TGGAATGTTG TCTTTATTCA
ATACAATCGT GAAGCCGATG GTTCCCTCCG TCCACTCCCG GCACAGCACG TGGACACCGG
GATGGGATTC GAACGCCTGA CGTCAATTCT CCAGAATGTC GATTCCAATT ATGATACCGA
TATCTTCATT CCACTCTTTA CCGCAATTCA GAACATTACG GGCGCCCGTC CGTACGCCAA
GAAGGTCGGC AAGGAGGATC CGGAATACAT TGACATGGCC TACCGTGTGG TGGCCGATCA
CATTCGAACC CTATGTTTCG CCATTACCGA CGGTGCCGTT CCCAGCAATG ATGGTCGTGG
TTACGTGCTC CGACGCGTCT TGCGTCGTGC GGTGCGGTAC GGCCGTCAAA ATCTCGGAGC
CGAACTCGGG TTTTTTGCCA AACTCGTTCC AGCCTTTGTT GACGTTATGG GATCGGCGTT
CCCGGAAGTG GTGGAAAAGC AAGAATACGT CACGGGAATC ATCCAGGAGG AAGAAGAATC
GTTTTCGCGA ACGCTCGATA AGGGCTTGCA AAAGTTTAAC GAATTGGCCG AAAAGGTGGG
AGCGGACCAG ATCTTTTCCG GTGCCGACGC GCATTTCTTG TACACTTCCA TGGGATTTCC
CGTCGACTTG ACTGAACTCA TGGCGGAAGA GAAGGGCATG ACACTTAATA AGGAAGAGTT
CGAAGCCAAA ATGCAGGAAG AGCACGATAT TTCGCAAGCG GCACATTTGG CAAAAATGGC
GGGTGGCTCC GGAAAGGATA TGCGTCTGGT CGCCGAGCAG ACATCTTATC TCGTCGGCCA
GAATATTAGT GCCACGGACG ACGCGGCAAA GTATGTGTGG CACGAGGAGC TGGCTGACTG
TGTGGTCAAG GCATGCTTTA TTGGTCGCAA CGAGACGGAA GACATGATTG GATTCGTCGG
TAGTATTTCT CCAGAAAGCA GTGCCGTTGG TATCGTCCTG GACAAGTCGA GCTTCTACGC
CGAAGCGGGT GGACAGGTTT ACGATGTTGG TACCTTGACT TCATCGACTG GTGCCGTTGT
CAAAATTACG AACGTGCAGG CGTATGGACA GTTCGTCCTA CATCTTGGTG AGGTCGCATC
CGGAACGCTG TCGGTTGGCG ATACTGTCAA ATGCAGCGTT GACTACGTCC GTCGTGCCCC
AATCGCTTCC AATCATACAA TGACGCATGT CCTTAATCAC GCGTTGCGAG AAGTCCTTAT
CAAACGACCA GAAAAGGAAT CGGGCAAGAC ATCCACTTTG ACCGTCGACC AAAAGGGATC
GCTGGTGGAC GAAACAAAAC TGCGTTTTGA TTTCTCGTGG AGCGGCCAGT TGACGCCGGA
ACAGTTGGCG GAAGTCGAAA AACTTTGCAT GGATCGCATC GTAAATGCAG TTCCCGTGGA
TGCGTACGTT GCACCGTTGG GCGATGCGCA GCAGATTAGT TCGCTGCGTG CTGTTTTTGG
TGAAAAATAC CCCGATCCCG TACGAGTGGT TGCCGTTTCG GATCATGCCG TTCCAGAGAT
GCTTGCCAAC CCACAAGACA GCCAGTGGAA TGAATATAGT GTCGAATTTT GCGGAGGAAC
GCACTTGACG AACACCAAAG AAGCGGAAGC CTTTGTTTTA TTGGCCGAAG AAGGAATTGC
TAAGGGAATT CGGCGCATTA CGGCCATAAC TACCGGTGAG GCTAAGAAAG CAATCGCTCT
TGCCAACGAA TTTGAGAGCA AACTAACGGC TGCAGAAGTA GTTCAAGGAG ACGATTTGGA
AAGCACTGTA AAACAACTTT CTGCCGAATT GGATGGATTA GATATTGCCG CCGTGAAGAA
GATGCAGTTC CGTGAACAAC TCGCCACCAT GACAAAACAG GTCTTGGCGT ACAAAAAGCA
GAAGCTTGCG GGTATGGCGG ACGAGATTGT CGACAAGGCT GTGTCTGTCG CTGCCGAAAC
AGGCGGCAGT AAAGTCGTCA TGCGTTTCGA CTTTGGTGTG GAGGGCAAGG TTGCCAAGTC
GGTTATGACC GCCTTCGGCA AACAAGTCAA GGATAAGGCT TTGCTGTTGG TTACGGCAGA
CCCCGAAGCT GATCGCTTCA TGGTTATTGC GGGCGCACCT AAGGCAATGA AAGACTTGAA
TTGTAAAGCA TGGATTGAGG CGGCAACTGA CGGCCTAGAC GCCAAGGGCG GAGGCAAGCC
CGACAGCGCT CAATATCAAG TATCCGGAGT AGAGGCAGTT GACACTGTTT TGGAGAAGGC
CAGAAAATTT TAAAAACGGA TGATGATAGG TCGTAGTATT TCGTTAACAT TGTGATGCCA
TATTAAGACG CCACTTTATT G
 
Protein sequence
MSTTNAEDDT PVLEWPMEKV RDTFKNYFVE QHGHVFWPSS PCVPVDDPTL LFTNAGMNQY 
KPLFLGTCDP NLEMSKLTRA VNSQKCIRAG GKHNDLDDVG KDVYHHTFFE MLGNWSFGDY
FKAGAIDMAW QCLTVTFGLD PERLYATYFA GDELTPVDEE ARQLWLRYLP DDRVLPFDAR
DNFWEMGATG PCGPCTEIHY DRIGGRDASK LVNADLPDVM EIWNVVFIQY NREADGSLRP
LPAQHVDTGM GFERLTSILQ NVDSNYDTDI FIPLFTAIQN ITGARPYAKK VGKEDPEYID
MAYRVVADHI RTLCFAITDG AVPSNDGRGY VLRRVLRRAV RYGRQNLGAE LGFFAKLVPA
FVDVMGSAFP EVVEKQEYVT GIIQEEEESF SRTLDKGLQK FNELAEKVGA DQIFSGADAH
FLYTSMGFPV DLTELMAEEK GMTLNKEEFE AKMQEEHDIS QAAHLAKMAG GSGKDMRLVA
EQTSYLVGQN ISATDDAAKY VWHEELADCV VKACFIGRNE TEDMIGFVGS ISPESSAVGI
VLDKSSFYAE AGGQVYDVGT LTSSTGAVVK ITNVQAYGQF VLHLGEVASG TLSVGDTVKC
SVDYVRRAPI ASNHTMTHVL NHALREVLIK RPEKESGKTS TLTVDQKGSL VDETKLRFDF
SWSGQLTPEQ LAEVEKLCMD RIVNAVPVDA YVAPLGDAQQ ISSLRAVFGE KYPDPVRVVA
VSDHAVPEML ANPQDSQWNE YSVEFCGGTH LTNTKEAEAF VLLAEEGIAK GIRRITAITT
GEAKKAIALA NEFESKLTAA EVVQGDDLES TVKQLSAELD GLDIAAVKKM QFREQLATMT
KQVLAYKKQK LAGGSKVVMR FDFGVEGKVA KSVMTAFGKQ VKDKALLLVT ADPEADRFMV
IAGAPKAMKD LNCKAWIEAA TDGLDAKGGG KPDSAQYQVS GVEAVDTVLE KARKF