Gene PHATRDRAFT_42824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42824 
Symbol 
ID7196428 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1241180 
End bp1243129 
Gene Length1950 bp 
Protein Length460 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177246 
Protein GI219110989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.138741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCACAGCGC TTGTTCTACT AGTAGAAATA TATTGGCTTA CCCGCAGTTT CTGACAGGCC 
GATACAAACT GTTTCGCGAA AAGCTTGAGA CGAGCACCGT AAAGTAACTC ATCAGCAGTC
ATGGCATCTG TTTCCGGACC TTCTGTTTCT GGAAAGCTTA CCTTCCCTGG ACCGATAGGG
GCGAAGCGTC GGCGAATGCG AAGTGGAGGT AATGCATCGA CAGTAACGAA CAACTCTCCA
GGGAAAACCG ACGGCTCGGA AAGCGAGCCT AAAACTTCGG GTAAATGTAA AGGGGGTACT
ACGATTCCAA TTTTTCTGAA AAGTAAGTAT CACGTGGCAA CCAAAAAATA TGATTGTATT
CCGGCCAAGT ACGCTTCGCT CGCGGTTTTG TCCATGCAAC GAATGTGAAG TTTTGAATGA
AATTCCATAT TACTTTTTAG ATTTTACCCG TGATTCGCTA CTCTGATTGT TACCAACCAT
TCTAACGCAT CTTTCTTTTC CAGAGACGTA CAAGATGATC GACAGTTGCG ATCCATCGAT
TGCGTCATGG TGAGTTCCAC TCGTAATTTC TTGTCTGTGC TCTGAGATAC AAATTAAGGT
TTTTGTTCGA TTGTGTTCCA CGCATTGTCT ATCTAGTACG ATAACCTGTT GATTCTCTCG
TTAAGGCAAA TGGTGAAGTT AGCCTTAGTG TACTTTCAAA TCCGCGAAAT TTTTGATCGG
GAAAAATCTG TCTTGACCTC ACCATTTTAT AATTTGTACT TTGATCCATT AGGACAGAGG
AGGGCGATAT GTTTATTGTT AAGGATCCTG ATGTTTTCGC AACGCAAGTG ATCCCGCAAT
ACTTTGACCA CAACAAATTT TCAAGTTTTG CTCGCCAGCT CAACTTTTAT GGGTTCCGCA
AAATGCAATC GAAACCAATC AGAAACAGCG ATTTTGACAC AGGTACCGCT AAGCACGTCA
CTTTTTATAA CGAAAACTTT AAGCGTGGTC GATGTGATCT GCTGAAGAAG ATTCAGCGCT
CGACTCGTGG AGGGGGGAAC ACGACGGGGC AAGACAGTCA TCGTGATGTC CAAAATCTTC
GTGATCAAGT TGCAATGCTC GAACAAAAGA TGGACGAAAT GAGCAGTCAG GTAGAAGATC
GCGTACGTCG CCTCGAACTG GAAATGTTGG CTCGCCTGGA GCAGATGATG CTTGCTATGC
AGCAGCAACA GACCACACAG TTGCACCTTC AGACTGCGAC TTCAGTTGGT TCAAACAGCG
GATCTGGTAG TTCCAACGGT ACTGGCAATC ATATGCCTGC GCCTTCCAGC AATCAATTGA
GCTGGGATAA TAACGGGCTA TCCTTTCCTC GCGGAAACTC TATCAACTCC AATGTGAGCT
CAGTAACCTT CCAGCAACCG CGGCAACAGC AGCAGCCACT CCAACAAATG ATTCATCAGT
CACATCAGTT AAACCAGTTG GACAACACCG GAATGGCTCC TCCGACCTTA CCGCCTCATC
CTAAGCAGAA GCAACTCCCA ATGAATGGAT TTCCTGGGAA CATGGCCACT CCTCCGGAAC
GAATGAATTC TCTACGTGGA ATTTCCACCT TGTCGCGTGG TCTTTCCGGA TTGTCGCGTG
GTGCGTCGAT TGAATCTAGT GCTTCTGCTG TATTGATGCG CAACTCTTGG GAGGATAAGT
TTTTTTCGAT GCTCATGCTA GATAGCGAGC AAAATGGGAG TAGTTCGAAC CCTCACGACT
CTAACGTGAT GCCAACTCCT CTTGCTCCCG GTATTTCGTC TAACGCAAAT GCACAAGCAC
CTGTCACTGT TTCAGATAGA TCGGCTGATA TCAATGGACA GCTCTCGGCG CACCACGAGA
ACAACGACGA TGATCTTAGT TCTGTATCGA CTTCAGACAT GCCATGAGGG GCAAGAAACT
TGTTTACATA AACCATATGA CTTATTTTAC
 
Protein sequence
MASVSGPSVS GKLTFPGPIG AKRRRMRSGG NASTVTNNSP GKTDGSESEP KTSGKCKGGT 
TIPIFLKKTY KMIDSCDPSI ASWTEEGDMF IVKDPDVFAT QVIPQYFDHN KFSSFARQLN
FYGFRKMQSK PIRNSDFDTG TAKHVTFYNE NFKRGRCDLL KKIQRSTRGG GNTTGQDSHR
DVQNLRDQVA MLEQKMDEMS SQVEDRVRRL ELEMLARLEQ MMLAMQQQQT TQLHLQTATS
VGSNSGSGSS NGTGNHMPAP SSNQLSWDNN GLSFPRGNSI NSNVSSVTFQ QPRQQQQPLQ
QMIHQSHQLN QLDNTGMAPP TLPPHPKQKQ LPMNGFPGNM ATPPERMNSL RGISTLSRGL
SGLSRGASIE SSASAVLMRN SWEDKFFSML MLDSEQNGSS SNPHDSNVMP TPLAPGISSN
ANAQAPVTVS DRSADINGQL SAHHENNDDD LSSVSTSDMP