Gene PHATRDRAFT_45988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45988 
Symbol 
ID7201053 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp874798 
End bp878063 
Gene Length3266 bp 
Protein Length693 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180338 
Protein GI219119143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.284744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTAACGGTGG CACCGGAAAT GCGCGTCCTC TCTTATTGTA GAGACCGCCT CATTCGGAAG 
CCAGCTGAAC CAATTTTTGG CACGAGCGAG CCTTATAACA CGAAACGAAG CATGTCGAGC
TGCTTCAATC TTGCGCTATG TGTGCTTGCA GCGGTGTGTC TTCCGTGGTC CGCAACGGCC
CATTTCAACT GCGGAACACG AGATCCTTCT CCTTTTGAGC AGCGCTTGGA TCAAGTCCGC
ATCAACCATT TCAAACAGTC CACGGAAGGC CGGCGATTGA TTACAGATTC TTGCGAAGAA
CTTTGCGTCC AGTGCGTGGA AATTGACGTA TATTTCCATT TGAGCGCCGT TCCTGCTCCA
TCGGCCGATG ATAGTGATCG GTTCTTTTTC CCGCATCCCC TCGAATCGGT TGATCGCTTT
GCGGAAAGTG ATACGACACT GACTATCGAA GACTTTGCCT CATTGCAAGG TATTTACAGC
CTGATTGATG ACAATATGCG GGTTCTCAAC GAGCGGTACG CGGAGTCTCC ATTCACATTC
ACTTGGAGAA ACTCTGATCC TGCGAGTGCC AGTGTTTCCG TCAATACGGA TTTGGTGGAC
TTTGTCGTCG ACACCATGTT TGACGAGAAT GGAGTTGCGT CTGAACTGCA TACCGGGGAT
GCCAGTGTAC TGAATGTCTA CTTGACGTAC AGACAATGTG CTATCTCAAA CCAGCTCGAC
CCTGATACCG GCGAGCCGCT GCTCTCGTGT GGCCTCCTCG GCATTGCTGT TTTCCCAAGT
TTTCAGCAAT CCAACCGAAA CGCCGATGGT GTGTACGTTA ACTACAGCAC GCTTACCGGT
GGAGGGTATG TATGTCAACG AGTCTGTAAT TTTGCTCTTC TGCGTGCTGA ATGATTGCTC
GCCACAAATT ACCGTACTAA CAGCAAAATA CTTGCATGCT TTCTCTCTGC TTTCAAGCTT
TCCAAACAAC GACGCTGGGT TGACACTTGT TCATGAAGTT GGGCACTGGC TTGGACTGTA
TCACACGTTT CAGAATAGTG CTGCCCAGGA GGGAACCGAC CCGTGCTCAC CGCAGAACGG
GAACGACTTC GTCGCCGACA CACCTACACA GTTGGCATCG TCCCAGGACC TATACAACTG
CTCATTGAGC TTTTATGAGG GTGAAGAGAT CCCGGACTCG TGCCCCAACT TGGCTGGAAG
CGACCCTGTG TTCAACTACA TGAACTATGT CTCGAACGAA GAGTGCTGGC CCCCTGGTGT
TGGGGAGTTC ACGTGTGGAC AGTACGAGCG CATGTACATG CAGTGGCTAC TGTACCGCAG
ATCCGACGAG CCTTGTCAAG ACAACGAAAT GGAGATTGAG ATTTCGATGG AGATCAACCG
AAGGTTTACT AGTGAAAACG CATTCTACCT GACCTACGTG GATACGGGCG AAGTGTTGCT
GAACTCGACT CGTGATTTTG AGGCCCTTGG ACCTCCTTTC CAGACCGAAG TGTTGTCCGA
TTTCTGTGCT CCCGTGGGAC AATACTCGTT CGTGCTTGTG GATGCCGCTC GGGACGGATT
TTTGGACGGC GGTTTTCTCG AAGTGTCCGT GAACGGGGAG TTGGTCGGAA GTGTATCGGG
CAACTTTGGA GAGTCGGCCA CGATTGACTT TGGCACACCA GATGGAGATA GCGGTGCCAA
TTTTGTTGGG GGCTCCAACA GTGATGCTAG GCGTCGTTGT TCGACTCCTG CGATCGTTTT
CTTCTGCGGG ATAATATACT TGTTCGTTTA AGGTTTCTTA ATGTAGCATC TGTCATCGGA
AAAATCTTCT TGTTCATTGC TAACAGTAAG CGCCTTGTTT TTTTGCTTCG GTCGAGATCT
AGATCCTGGA GTTCATTTGT GGCCACGGAT CAACTGCTTG CCACTAAACA AGTTTACAAT
TGGCATTTCA ATTTGGCGAC ATTAAAACAG TTGTTTTTTT GCTTCGGTCG AGATCTAGAT
CCTGGAGTTC ATTTGTGGCC ACGGATCAAC TGCTTGCCAC TAAACAAGTT TACAATTGGC
ATTTCAATTT GGCGACATTA AAACAGTTTT TACTTCTTTC ACTCTCTAGA TCGCGAGCTT
TCCTAAACAT TCAGCAGCGT ACAGCTAGCG GAAAAGTCTT GAGCGCCTCG TCACTGTTGC
TTTCTTACTT TTGGTCCTCA CATCAAAAGT TTTGTTCAAC ATTTCCATAA TTGACAGTGA
GGTGATAGCT TTTGAAGTCT AACTGTAAGT GTGAGCGGCA ATGGGAAATC GATTAACAGG
AACCATGAGG GTCTACCTCT CTCTCTGGAT AGGTCAAGAA AGGACAGGAG CATCCAGGTC
TTTCTCTGAA GCGTACAGAG ATGAACAAAT TGAATTTTTA GGTTTGTGAA TGAGTCCCAT
TTCCTCGCAG ACAGCAAAGA GGGCTTTTCT TGGTGTCGCA CCAAAAAAAT TTCTTCCCCC
TGCTCTGTCA ACCAGGATCG ATGAGAGCTG CTATGGGTGG AATAGAATAG CCTAAGTTCA
GAGAGTCATG TGTTGTCCGA TTTCTGTGCT CCCGTGGGAC AATACTCGTT CGTGCTTGTG
GATGCCGCTC GGGACGGATT TTTGGACGGC GGTTTTCTCG AAGTGTCCGT GAACGGGGAG
TTGGTCGGAA GTGTATCGGG CAACTTTGGA GAGTCGGCCA CGATTGACTT TGGCAGTTCT
GCGGGAGGTC CGGGCCCCGT CGGCTTCCCA AGCAGCATAA CGAACAGCCC GGTGGAGGCA
AATGCGCCTG TGTCCAGCCC AGCGGACGGG GATACCTTTG TTCCCACACC CGTCGGTAAT
CCATCTTTTG TCAGCAAAGC GGCGTACCGC GGTTGGGCTG CGGCTCTTGG TGTGGCCCTG
GCGGGAACGG GCTGGACGGT TGATTTGTGA CACTAAATCG AAACCGTATC ACTGTCTTGT
TTCGTATTTT GTGCATGAGG CTGTTTTTAT TCGTCCGCCG AGACCGAGAG CCAGAGCATA
GCGTCGATGG ACACAACAAA CAAACATTGC GGTGGCGACG AAGTTTTTCA GTGTAAACTA
TGTAATCTTT GTATGACTTG ACAATCTTTG CTTTGGAAAG CATGATGCCT GATGTGTTCA
CAAGAGAGAC TTCGACCAGG GCCCGTATTC CCAAACCCGA GACTTTATTT CGAATAGGCG
ATGTCTCGAC GTAACGGCAC CATGGAGCAT TCTCGACAAC TTTCGTACTC CTGTAGGATC
GTTCGACTGT TCCCAAGTTC GGGGTT
 
Protein sequence
MRVLSYCRDR LIRKPAEPIF GTSEPYNTKR SMSSCFNLAL CVLAAVCLPW SATAHFNCGT 
RDPSPFEQRL DQVRINHFKQ STEGRRLITD SCEELCVQCV EIDVYFHLSA VPAPSADDSD
RFFFPHPLES VDRFAESDTT LTIEDFASLQ GIYSLIDDNM RVLNERYAES PFTFTWRNSD
PASASVSVNT DLVDFVVDTM FDENGVASEL HTGDASVLNV YLTYRQCAIS NQLDPDTGEP
LLSCGLLGIA VFPSFQQSNR NADGVYVNYS TLTGGGFPNN DAGLTLVHEV GHWLGLYHTF
QNSAAQEGTD PCSPQNGNDF VADTPTQLAS SQDLYNCSLS FYEGEEIPDS CPNLAGSDPV
FNYMNYVSNE ECWPPGVGEF TCGQYERMYM QWLLYRRSDE PCQDNEMEIE ISMEINRRFT
SENAFYLTYV DTGEVLLNST RDFEALGPPF QTEVLSDFCA PVGQYSFVLV DAARDGFLDG
GFLEVSVNGE LVGSVSGNFG ESATIDFGTP DGDSGANFVG GSNSDARRRC STPAIVFFCG
IIYLSRAFLN IQQRTASGKN SLSSESHVLS DFCAPVGQYS FVLVDAARDG FLDGGFLEVS
VNGELVGSVS GNFGESATID FGSSAGGPGP VGFPSSITNS PVEANAPVSS PADGDTFVPT
PVGNPSFVSK AAYRGWAAAL GVALAGTGWT VDL