Gene PHATRDRAFT_50885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50885 
Symbol 
ID7200500 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp70759 
End bp72923 
Gene Length2165 bp 
Protein Length602 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179573 
Protein GI219117560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGATGCGATG AGTATTGTGG AACGACACAT CCATCACTCC TCGTGGCGTG ATGACAACAG 
TGAACGAACA AGAAGCCGCG GACCTTCCGC TGTAAATAAA GCAGCACAAG CCGGACTTTC
CTTCGAGTCG TTGATTCTGG TTGTCAACCA AAAGTCTGCG GAAAACTAGA CTAGCTTGTT
CTTCACATTC CGCTGGTAGC ATGCCACTCT TTCGTTCATC GTCTCTTATC ATGGCTGCCA
TTCTGAGTAG TTGCTGTTGG ATGCGGTCCT GTCGTGCGTT TGTTGGTCCT GCTTCTAGCA
TTAGTACCAG TGCGGCGATC CAAAGACTGA CTACGCCTTT GTCTCGTCAC GGCCCGCTCT
TTTCGACGGC GTCCAAGAAG GAGGCCGAGG CTCCCAGCGA TGTAGTCGTT GCCGAAGCCT
TGACGTATTC CATGGAAGAC GTCGTCGGCT TGTGCAAACG GCGGGGAATC ATTTTCCCAT
CGTCCGAAAT ATATAACGGT TACGCCGGAT TCTATGATTA CGGCCCGCTC GGCAGCGAAC
TCAAAAAGAA CGTCAAGGAT GCCTGGTGGA AGAATTTCGT CATGATGCGC GAGGACGTCG
TTGGTGTCGA CTCCTCCATT ATTCACAATC CGGAGACCTG GAAATCGAGT GGTACGAGAA
CACTACAGAA AACGGAAAGG AATGTCTGCC TATTGTTGCG CGCCACTCAT GAATTTGTCT
TGTCATGTTT CACGACTCGC GGTAGGCCAC GTCGACGGCT TTTCCGACCC CATGGTGGAT
TGTAAGGAAA CCAAACTCCG GTACCGGGCC GACCAGCTAT TCTACGCCCC CGTCATGGTG
AGTGGCGAAG CAGAAGTTCT CGGATACTTG TGTGTCCAGG AAGCTAATGA AGCGGACATG
GCCAAGGAGG CGAAAAAACA AGCCAAGGCG CTGCTCAAAT CGTTGGACCG CAAAGGCGAA
ACCGTCCAAG AGCCGTTCGA CTTTCGTGAA GTAGTGGAAG CGACCGAAGC CGAAATGGCC
CAAATACCGT CTCCTGGCTC CGGTAAACCA ACACTCACCA TGCCACGAGC CTTCAACCTC
ATGTTCCAAA CGCAAGTGGG GGCCTTGTCC GACGCGGCAT CCGTGGCTTA CTTGCGACCG
GAAACGGCTC AAGGCATCTT TCTGAATTTC AAAAACGTAC TGACCACCTC GCGACAGAAA
ATACCTTTTG GAATTGCCCA GATTGGTAAG GCCTTTCGCA ACGAAATCAC CCCCCGAAAT
TTCATTTTTC GGTCACGCGA ATTTGAACAA ATGGAAGTCG AATACTTTAT CCCACCCGGC
GATGAAGTTT GGCCGGCGTT TCATCAGCAA TGGATAGATG ATTCAAGGGC GTTCCTGCTT
TCCATTGGAT TGCAGGAAGA ATTGCTCGGA TGGGATGTGC ACGAGGGGGA CAAGTTAGCG
CATTATGCAC AAGCCTGTAC CGATATCACT TTTAGATTTC CGTTCGGTGA ACAAGAACTT
ATGGGAATTG CCGCGCGTGG CAATTACGAT TTATCGCAAC ACTCGGAGGG ATCCGGCAAG
AGTAAGTTAC AAAGCGTGGT CGTATTTATT GGCAGCCTGC TCCGGCGTAG GAACGCTCGA
GGCGTTTCGG GCTTTCTTAA ACTCATATGC CATTTGCTTC AATTCTGTAG GTCTGGAATA
CTACGACGAA CAGACCAAAG AAAAATATAT TCCGCATTGC ATTGAACCGT CGCTCGGTGT
CGATCGTCTA ATGCTGGCCT TGATTTGCTC GGCGTACGCA GAAGACGAAG TAGGCGGAGA
GAAACGTAGT TTGCTCAAGT TTGACCCAAA GATTGCACCC ATCAAGGTTG CCGTCTTGCC
TTTGTTGAAA AACAAGGAAG AGCTAGTGTC GGTTGCCCGG GACTTATTCG ATAAACTTCG
TCGTAGGTGG AACTGTCAAT ATGATGCTGC GGGTGCTATT GGACGGCGGT ACCGGCGAGC
GGATGAAGTC GGTACACCTT ACTGCGTTAC GATTGACTTT GATACAATTG AAACGGATAA
CGCCGTCACA ATTCGCGATA GGGATACGAC GGATCAAGTC CGAATTCCGC TTAAAGATGT
GATTTCGTAC TTGAGCGAAC GTATCGACGG ATACTAAAGA TAAAAAACGC TTGTCTTATT
GCTTC
 
Protein sequence
MPLFRSSSLI MAAILSSCCW MRSCRAFVGP ASSISTSAAI QRLTTPLSRH GPLFSTASKK 
EAEAPSDVVV AEALTYSMED VVGLCKRRGI IFPSSEIYNG YAGFYDYGPL GSELKKNVKD
AWWKNFVMMR EDVVGVDSSI IHNPETWKSS GHVDGFSDPM VDCKETKLRY RADQLFYAPV
MVSGEAEVLG YLCVQEANEA DMAKEAKKQA KALLKSLDRK GETVQEPFDF REVVEATEAE
MAQIPSPGSG KPTLTMPRAF NLMFQTQVGA LSDAASVAYL RPETAQGIFL NFKNVLTTSR
QKIPFGIAQI GKAFRNEITP RNFIFRSREF EQMEVEYFIP PGDEVWPAFH QQWIDDSRAF
LLSIGLQEEL LGWDVHEGDK LAHYAQACTD ITFRFPFGEQ ELMGIAARGN YDLSQHSEGS
GKTCSGVGTL EAFRAFLNSY AICFNSVGLE YYDEQTKEKY IPHCIEPSLG VDRLMLALIC
SAYAEDEVGG EKRSLLKFDP KIAPIKVAVL PLLKNKEELV SVARDLFDKL RRRWNCQYDA
AGAIGRRYRR ADEVGTPYCV TIDFDTIETD NAVTIRDRDT TDQVRIPLKD VISYLSERID
GY