Gene PHATRDRAFT_50052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50052 
Symbol 
ID7198744 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp254801 
End bp258318 
Gene Length3518 bp 
Protein Length932 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184930 
Protein GI219129509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAACAGTT CACAAATCCT TGTAGCTTCT CAATCGATAG TTTTGCTCTA CCGTACGTGA 
CATGAGACGA ATACCAACCT ACGTTCTCCC TTGGCTGTGG CTTTGTTCCG TTCTCCAGTC
GCCTATTCCC ACAGTATCAT TCGTTCCGCC CTTCAATCAT GCTTCGTCAT CATCGGCATC
ATCATCCCAC CTGCATGCAC GCTATACTCC ACCAGTTTCA CAAGTGGTAA TGCCGCCACC
ACCACCTCCA CCACCGCCCA TTCCAGAACC ACCCCAACTC GATGGAATCG ACCGGATCTT
ACAGAATCTT GCGCAGAAGT TGGATTCTTC GCTCCCCGAT TGGAAGAACT TGCATTCGCC
GGTTGGAAAG GATGTACAGG GGCCTTTGAT TGAGTCTGTT GTCTCAATCC AACGTCAGCT
GGGACAACTG GAGTCCGATG CGGCCTCACA GATTCATCTC GTTTCCATAA AGTTCCAGAG
CACCGTGCTC CAACAGATTC CCCAATTAGA ACCCGTTCTT CAGAAAGTTA CAGCTTTTTT
AGCACCGTTA GTGCAAGATC CAACAACCCA ATTGATCGTA TCTGCCTTGG TGTCCTACAC
GATTGTGTCC AAGTTGCTAG CCATGACCTT ACCACCTCCA CCATTGAAGC CGTATCCCAG
TGGACGCTAC GACCCCGTGG CCTCCCGGGC CTACTTTGAC CAACGATTGC CGCTGGTGAT
TGCCCGGAGT TTTTCGATTC TAGTGCAAAG TCTGCAATTC GCTGCTGCGC TACTCCAGGA
CAAAATGCAG TAAGTGAATT ACGCATCGTT GTTGTGATTG AGTGCCGTCC TTTCTGTTTA
TGTGTTTGTG TGTGGAGCCG TTATCGATCC TGTATTCGAA TGTTTCTTTT TCCCTTGGAC
AATTGTGTCA AACTCATACG GAACGATAAT TTTGCTACCG TATAGGAACA AGCTGGTACA
AAATGAATTT CAACGCGGTG AGCAACTGGC GGTTCTACTC TCACGCCTTG GACCAACCTT
CATCAAAGTC GGCCAGTCTT TATCGATTCG TACCGATCTA CTCTCTCCGG CGTACGTCCG
CGGCCTGGCA TCCTTGCAGG ACCAGGTGCC GGCCTTTGAC ACAGCTATTG CCAAACAAAT
TTTGGAAGTG GAATGGCAAC GTCCCGTATC TGACGTGATT GTTGGAGAGT TAACGTCGCA
ACCTATTGCC GCCGCTTCGC TTGGTCAAGT GTACAAGGCC ACACTGAAAT CGACAGGACT
GGATGTAGCT ATCAAAGTAC AGCGACCCAA CATAAATGAA CAAATTGCGT TGGATATGCA
CTTGCTGCGC GAAGCAGCGC CCGTTCTGAA GAGACTGTTT AATTTGAACT CCGATACAGT
AGGAACAGTG GACGCCTGGG GCGCGGGGTT TGTTGACGAA TTGGATTACA TTCAAGAAGC
TCGCAACGGT GCGTTTTTCT CGGAACGCAT TCGTCAGACG CCTCTGAGAG ATGTGGTGCT
GGCCCCCGCT ATCGTGGAAG ATTTTACAAC CGGATCCATC CTCGTGACGG AATGGATTGA
CGGGGAACGA CTGGACAAGA GCGAAAAGGG AGACGTGACG GTATTGTGCA GCATTGCCAT
GAACACGTAT TTAACCATGT TGCTAGAGCT AGGACTACTC CATTGTGATC CGCACCCAGG
CAATCTCTTG CGCACTCCGG ACGGAAAACT ATGGTACGTG GCTGGTCTTT TCGTCGTTTG
CATGCACTCA TTGCGAGCAG GTCTCTCACT AACTTTATCC TTCATTGCAG TGTTTTGGAT
TGGGGCATGG TAACAGCTAT CGACAAAGAC TTGCAGTTGA CTTTGATTGA ACACATGGCG
CATTTGACGT CGGCAGATTA TGGGGAGATT CCTCGTGACT TGCTGCTACT GGGATTTATT
CCGTCCGACA AGGCACATTT GATCGACGAC AGCGGTGTTG TTGACGTCCT GGCGGACATT
TACGGAGCCT GGACCAAGGG TGGTGGCGCA GCAGCGATCA ATGTGAATGA TGTTGTCAAC
CAGCTACAGG ATTTGACGTC CAAGAAAGGA AACCTCTTCC AGATCCCGCC CTACTTCGCG
TACATTGCTA AGAGTTTTTC TGTATTGGAA GGCATCGGAT TAAGCAACGA GCCGAACTAC
TCTATCATCA ACGAATGCGT TCCTTACGTA TCCAAAAGGC TTTTGACAGA CAAAGAAAAG
ATGGGGCCGG CACTTTCGAC ATTTATTTTT GGACCAGCGA AATCAAATGC AGATCGCATT
GTAGATTATC GTCGTGTGGA ACAGCTCGTA GAAGGCTTTG GTGAATACAC AACCTCAGCT
TCTGGTTCTC TTTTGGGGAA GCAGAACATG TCCAATACAG AAATACTGGA AGATGTCGCG
GACGAGGTGC TCGATTTGGT GTTGACAGAA GAAGAGACCC CTTTGCAAGA AATCTTGTTG
GAGCAACTAG CAAAGATCAT CACAGCTAGC AGTCGATCTA TCTGGACACA AATCCGGGAG
CGCTCTGGTT ATCTACCTTC TGGCCGTACT GTGCTTGGTA CAATCGTCGA CCCTTTTGGC
CTTTTCCGAA CTAGTCCTCT CGTGCGGATG AACGAGCTTG ACGAACGCAC TGTAGAGACA
ACTCAGAAAC TCATCGCTTT GGCGCAAAAG CAGATTCAAA AGTCCGACAA CCCGGCTTTT
GATCTTTCTA AGCTTTCTCG AGAAGAGGCA CTACAATTCT CCTCTATTCT GGTGCGAAAA
GTTTGGGTCC GGAGGGGTGG TGTCGTGCAA ACTAGCAATC GCTTCGCTCG AAAGCTGCTG
CAATTAACAG CCGAGAAGCT TGAAGCAGGC GAGCGTGATA CTCGTACCTT GCCGATCCGC
ACAAACTTGA CAAGGACGGA ACCCCTTATT GAAGCTGAGA AGTCATTCAC AGAGCGATCA
TCAATCGAAT TAGCTGATCA TCATCCCACG CCAGTAAAGG CAGAGAATCC TCGTCTCATA
GCAGCACGAC GGCGCCTTGA CACTCTCAAA GCTGAAGATG GCAATGGTGT CATTGTAACT
GAAGCTGCGG AGATTTCTAA TGTAGATATC TAGTGTAAGG ATACTAACGT TTTATGAACA
GACTTACCGT TAATGCGCCA AAATTTGCTG AGTCGCAACG CTCTTGCGCC AATAAAGTAA
GGCCAACAAA GAGCGAGCCT TCCGACAGAC ACTTGACGCC TAGATTTGAC GACCAATTCA
GATTATCGGA TTCACAGTCA AGACAAATTT TGGCATTGTT GATGGTGCGG CAATCAGTCA
GTGGTTCAGA CAAAGCCGTT CGGAATGATT TATGCTCATT TTATGATCCA TAGAAGCCCG
TGTCAACAAA AACCGAATTC TTCTGATTGA AAACTTGCTA CTCAAGGGTT TGAGGAGTCG
TAGCAATGTA CGTAGCAACA TGCGGGCATC GACATTCAAT CAAATACGTC ACTGTCACCT
TGAATAGGAA CAAACTTTGA CACACTTCCC AGATGTTA
 
Protein sequence
MRRIPTYVLP WLWLCSVLQS PIPTVSFVPP FNHASSSSAS SSHLHARYTP PVSQVVMPPP 
PPPPPPIPEP PQLDGIDRIL QNLAQKLDSS LPDWKNLHSP VGKDVQGPLI ESVVSIQRQL
GQLESDAASQ IHLVSIKFQS TVLQQIPQLE PVLQKVTAFL APLVQDPTTQ LIVSALVSYT
IVSKLLAMTL PPPPLKPYPS GRYDPVASRA YFDQRLPLVI ARSFSILVQS LQFAAALLQD
KMQNKLVQNE FQRGEQLAVL LSRLGPTFIK VGQSLSIRTD LLSPAYVRGL ASLQDQVPAF
DTAIAKQILE VEWQRPVSDV IVGELTSQPI AAASLGQVYK ATLKSTGLDV AIKVQRPNIN
EQIALDMHLL REAAPVLKRL FNLNSDTVGT VDAWGAGFVD ELDYIQEARN GAFFSERIRQ
TPLRDVVLAP AIVEDFTTGS ILVTEWIDGE RLDKSEKGDV TVLCSIAMNT YLTMLLELGL
LHCDPHPGNL LRTPDGKLCV LDWGMVTAID KDLQLTLIEH MAHLTSADYG EIPRDLLLLG
FIPSDKAHLI DDSGVVDVLA DIYGAWTKGG GAAAINVNDV VNQLQDLTSK KGNLFQIPPY
FAYIAKSFSV LEGIGLSNEP NYSIINECVP YVSKRLLTDK EKMGPALSTF IFGPAKSNAD
RIVDYRRVEQ LVEGFGEYTT SASGSLLGKQ NMSNTEILED VADEVLDLVL TEEETPLQEI
LLEQLAKIIT ASSRSIWTQI RERSGYLPSG RTVLGTIVDP FGLFRTSPLV RMNELDERTV
ETTQKLIALA QKQIQKSDNP AFDLSKLSRE EALQFSSILV RKVWVRRGGV VQTSNRFARK
LLQLTAEKLE AGERDTRTLP IRTNLTRTEP LIEAEKSFTE RSSIELADHH PTPVKAENPR
LIAARRRLDT LKAEDGNGVI VTEAAEISNV DI