Gene PHATRDRAFT_46231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46231 
Symbol 
ID7201192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp673476 
End bp675412 
Gene Length1937 bp 
Protein Length557 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180484 
Protein GI219119447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGTGCTAT GCCCCAACGA GAACGTATAT AAGCCTCAAC GACTTTTCCA AGACTTTAGT 
TGGCAACCCC TTCCGTTGAG AACGACTTCC GTGTCCCGCT TCGCACAACG ATGGAAAGTT
CGTGATCTCG CCACTAGAAT ATGAAAAAAA GAACGAGCGC AACAATAAAA AGGGAATAGT
AAAGCAAACA AAGCAAAAAG AGAAGTTGCA ATGAGTGTAA GTGCGAAGGA AAGTTCCGGA
AAGTGGACTG TAGGCGACCC GAAACTTGAT AGCGAAAGCC AACGGGGCAG CAGCAACGGT
CTTCAAGAAA AGTATGAAGC GCATGGTTCA CATCCACGAG GTCTTTTGAT GGCAAGTACA
TCCGAAGATT CAAGCCCGCC AGCAGAAATC TTACAGCAAC TTCAGAAAGT TTCCTGCTCG
GTGTCGGTGT CGTTATCGTC GTCGCCTTCT ATCCCACTGG AAAAATTTCC AAATCAAACT
CAAGAGCAAA CGATCTTCCA AGAAGAGGAG CTCACAGTCA GTGCGTGTGG CGATCGGCAT
AACGGCTTCA TAAATACAGA TATTCAAAAA GACCCAATTA CTTTACAGTC GGAAGATTCT
GTTTCGCAGC CAGCGAAATC GAAAAAAAAA CGAGAGAAAT CCATTGAGAG ACAGTCAATC
TATCCGGACA TCTCTTGTTC GCCCCTGCCT ACTGGAGCTC CGGCACGTGT TGCCGGGCCT
CGCCGACACC GCCGAACGAA AAGTGACACT TCCAGCAACC TGACTACTAT TATGGAGAGG
CCGCTTTCTC CACTGGAAAG CATGAGGGTT CGAACCTCAT CGCCCGACTC GGTGCTGACA
GCAATTCCAG AGCATGGACG GTGTCTGTCG CCGACAAGTG CCGTAAGTGT CATGCCCATT
TTCCAGCCAA CGTCGCCGAA CTTGGAGACC TCAGCAATAT TGGCTCCCCT GATCCTCCCC
CCACTCGCAC CTAGTAGTGA ACATTCAAAT GCTTACAGCA GCGGTTCTGC GTGCTATACA
AAAAGCAGAA ATGGATCCTT AGGATCCAGG CCTCGAGTTG TCAATCGACA CCGACGAGTT
AAATCTGCAG GGACCCATCA GCTTGTGGCG TTGGACACAG AAAGCTTGCT GATTTCCCAA
CTCCAAGATT TGCACGCTCG TCACGGAACT CAGCACGCCA AGCTTTCCTT GACATACAAC
GTCCTTGGGA ACGTCTACTT CCGACAGCAA CGCTTCGAAG CAGCAATCGA AACTTACCGG
AAAGCCATTG TAGCTGCACA GGAGAAAGAC AAGGTGCCAT TGGCCGACTC GTATAGTAAT
CTTGGTACCG TGTATTGGTC GACAGGAAAT GTGGATCAAG CCATTGAGTG TTTGCAACAA
GGATTGAGAC TCCGACTGCA ACAGGGCAAT GTCTTAGCGA TTGCTACCAT CCACTACCAA
CTCGGTTTGG TCTATACCCT GCGGGGCACG TTCGACGAGG CTTTGCATCA ATTTATATCG
TGTATCACAG AACGAGGGGG AAAAAGTGGC CAAGAACTCG AAATAGCCCG TTCGTATGAT
GCGATGGGCA AAGTCTATAC GTTACGCGGC AACTTCGAGG AAGCTCTTGA GTCCTATGAT
CATGCGATTC GCCTCAAAGA GAGGAGGGGA GCTTCAACGG TCTCCACCTT AGAAGAAGTT
GGCCGTGTTC AACACACTAC GGGAGATTTA CAAGCTTCGT TATCAACTTT CCAAAGTGTT
TTTGATATGC TTAAACAAGA AGTTGTTGAA ATGGGACTGA GCTCGTCCAA GCAAGCCCAA
ATGAGCGATA CTCGCAGCAT TATTTCGGGA ATTTCCAAAG AACTCGAGGA ATTGTCTGTT
GGACGAGGCC ATTTTAATAA ATAGCACGTC TGGACAAAGG ATATCGAATC GAGGTTTAAA
GGAAGTGTTT GCCTTTT
 
Protein sequence
MSVSAKESSG KWTVGDPKLD SESQRGSSNG LQEKYEAHGS HPRGLLMAST SEDSSPPAEI 
LQQLQKVSCS VSVSLSSSPS IPLEKFPNQT QEQTIFQEEE LTVSACGDRH NGFINTDIQK
DPITLQSEDS VSQPAKSKKK REKSIERQSI YPDISCSPLP TGAPARVAGP RRHRRTKSDT
SSNLTTIMER PLSPLESMRV RTSSPDSVLT AIPEHGRCLS PTSAVSVMPI FQPTSPNLET
SAILAPLILP PLAPSSEHSN AYSSGSACYT KSRNGSLGSR PRVVNRHRRV KSAGTHQLVA
LDTESLLISQ LQDLHARHGT QHAKLSLTYN VLGNVYFRQQ RFEAAIETYR KAIVAAQEKD
KVPLADSYSN LGTVYWSTGN VDQAIECLQQ GLRLRLQQGN VLAIATIHYQ LGLVYTLRGT
FDEALHQFIS CITERGGKSG QELEIARSYD AMGKVYTLRG NFEEALESYD HAIRLKERRG
ASTVSTLEEV GRVQHTTGDL QASLSTFQSV FDMLKQEVVE MGLSSSKQAQ MSDTRSIISG
ISKELEELSV GRGHFNK