Gene PHATRDRAFT_18067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18067 
Symbol 
ID7197400 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp229139 
End bp232949 
Gene Length3811 bp 
Protein Length1091 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177888 
Protein GI219112273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.437528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCTTGCAT CGCCACAACA AATTGCTTTA AAAGCAACCC ATACGGTTCC CCTCCGTCAT 
TCATCGTACA CGATCGAGCT TTGGATCGTA CCCTTTTTCC TCTGTATACC CTCACTGCCA
GTCAGAAGCC AATAGACTTT ATTCGAAGTC CTCTAAATCC TGCCATAAAT TATCTCACCT
CCGACGCTTC ACTTGAGGTG TGACGACCGT TGTGATTCAT ACTTGGTGAC CGACCTACGT
CATTGAAACA TTTTCTTAAA AATTGCGTTG ATTCAAGAAT GGACGCGGCT CAGTTAGTTC
AGGTGGAAGC CTTGTGCGAA ACGCTCTACA CGGGTGTGGC TGCAAATTTT GATTCCGAAA
CGGTCACTCG TAGCGAGGCA CAGCAACGGT TGCTGAGTCT ACAGTCGAAT GCTGAGTACA
TTCCGCAATG TCAATACATT CTGGATAATT CCAAATCACA GTATGCCCGT CTGGTAGCAT
CGAACAGCCT CATTGAACTT GTAACGATAC ATTGGAATTC CTTCACCGTC CCACAACGTA
TAGATATCCG GAACTATGTC TTAGGCTATC TGGCCAACAA CGGACCTTCT CTGCAAGACT
TCCTGGTATT GTCGCTGATA AAACTGGTTT GCCGCATCAC CAAATTGGGA TGGTTCGACG
ACTCGGCGCA TAGGGAACTA GCCGACGATG TGACCAAGTT TCTTCAAGCA ACTGTAGATC
ACTGCATTCT TGGCCTCAAA ATTCTCAATC AACTTGTCGA CGAGCTGAAT ATTCCCACCA
GTGGTCGAAC CTTGACGCAA CACCGCAAAA CGTCCGTCTC TTTCCGGGAT GTCTGCTTGC
TCAAGGTCTT TCAATTGGGA TTGACCACAC TGAAGCAGTT ACAAACTGGT GCAATAAGTA
CGTTGCTTGC AAGACATTCT TTTTTTGTTC AGCTAAAGCA GATCTCATCT CTGTTTCTTA
CCACTTTTTT GCTTATGTTT ACCTTAGGCG CTGATCCTCG ACAGCAAGCA ATATTAGGCG
AACAGGCGCT ATCGCTGACG GTACGGTGTC TCAACTTTGA CTTCATTGGA ACAAACCCGG
ACGAATCGAC CGAAGACGTA GGCACAATTC AAGCACCTAC CTCGTGGCGA CCACTTCTAT
CCGACCCGGC TACCACAGAA TTGTTGTTTG AATTCTACGC TAACACGGAG CCCCCTCAAT
CATCTAAAGC TATGGAGGCA TTGATACTTT TGTCCTCAGT GCGTCGATCT CTCTTTCCAA
CAGACAAGGA CCGGGCCGTT TTTTTGGGCC GCCTCATCAC TGGCATTCGC GAAATGATGT
CCAATCGTAC CGGTCTACAA CATCAAGACA ACTACCATCA ATTCTGTCGT CTTCTCGGGC
GCTTGAAAGC GAACTACCAA CTCAGTGAGC TTGTCAAGGC GGATGGCTAC CTCGAGTGGT
TGGAGTTGGC TGCGACATTT ACTGTGCAAT CGGTACAAAA CTGGCAGTAC AGCACCAATT
CGATTCACTA TCTATTGGCA TTGTGGGCAC GTTTAGTCTC AGCAGTACCC TATATTCGTC
CTGAAACGGG GGCGCGTGGA CACGTCGCTC ACTTGGAAAA GCAGGTCCTT CTAGTTGCCG
AAACTTACAT TGATAGTATG TTAGGTTCGT CGGAAACAGT TATACGCAGC GACGGTGCGC
TTGAAGATCC GTTGGACGAT GATGGGTCTT TGAAAGAACA GTTGGATCGG CTCCCAATCA
TTTGCCGCTT TCAATACGGT ACAGTTGCCA ACTTGATTCT TAACAAGTTC GACCCGTTGC
TGAACTCATA TCAAGACATT GTGGCTAAGC TAGGGTCAAG CGCGACCAAC TCAGCCCCGC
CTGATGTGAT GCTGCGCGTC GAGATCATTG AAGGGCAGCT CACGTGGTTG GTTTACATCG
TAGGGGCGAT CGTTGGTGGC CATTCGTGGT CGTCTACCCA CATGGCAGAT GGTGAAGAGA
CTATCGATGC CAGTCTCAGT AGGCGCGTGT TGCAGTTGGC GCAAGGAATG GAATATAGAC
TGACATCGTC AAACGGGGTT GGACGTGCCA ATGCTCGCTT AGAGCATGCC CTTCTATGCT
ACTTTCAGAA TTTTCGTCGC GTATACATGT TTATGTGGGA TCAAATGGCC GGGGCAAACT
CTTCAAGCCC TATTGACACA AACGGCACGC TATCTGTAGT TGCAATGATG ACATCTAAGC
TGGACTCAGG CGGGACTTCG ACGAAGCAGA AAATCTATCT GCGTATGTTT GAGCACTTGG
GCATGGGTGA TCATACAGCG GTTGCCAACC TGATTGTCAC CAAGATTGGT AATAACTTAA
AATTCTGGCC CGAAGACCAA GATCTCATTG GAAAGACCTT GGACTTGCTA CACGACATGG
CTCAAGGGTA TAGTAGTTCA AAACTATTGC TCACCCTGGA GACAGTCCGA TTCCTAGCTC
ACCATCACAC CGAAGAGCAC TTCCCCTTCT TGTCGATGCC TGGGAATTCT CGCCAGCGGA
CGACTTTCCA CGCAACTTTG ACACGACTAC TACTGTCGCC TTCTGGAGAA GAAAAGTTGG
GTTTGACGTT CGAACAATTT CTAGAACCTA TTGTGGTGAA GCTGACACGC TTGGAAGGGC
TTTCACCATC CGATCTGCGA CAAGAGCAGT GTAGGCAACC TTTAATAGGA GTCTTTCGAG
ACCTTCGTGG CATTGGTGCA AGTCTCCACA ATCGCAAAAC TTACAGTGCG CTTTTTGACA
TTATGCACCC GCATCATTTG CCATTGTTGT CCAAAGTTGC TGACGTGTGG TTTGACCAGT
CGGATGTCAC GGTCAGCTTA CTTCGCTTTC TGCAAGAGTT TTGTCATAAC AAGGCCAATC
GTGTCAACTT TGATCAAAGC AGTCCAAACG GTATTCTCCT TTTTCGCACC GTCAGCGATG
TGGTGTGCGC GTACGGAAGT CGCATCTTGT CCTTACCGCC TCCAGTGGCG AACGATCCAG
AAGTTTACAA AAAGCGCTTT AAGGGCCTTG CTTTAGCACT GAACGTGCTT AATTCCGCTC
TCGGCGGCAA CTACGTTTGT TTCGGCGTTT TTGAATTATA TAACGACCGG GCTCTTGAGA
ATTCGTTGGA TGTAGCGTTG CGCTTATGCT TGACGATTCC ACTTGAAGAA ATCAATGCCT
ATCCGAAGGT AAGCAAGGCG TACTATGGGT TCATCGAAAT TCTGTTTCGA AACCACCGAA
GAACTGCTTT TGCCATGGAT ACGAATATAT TTATGCAGAT CATGGCTTCG GTGCATGATG
GATTGCAATC GACTGACGCC ACAATCAGTG CCTGCTGTGC AAACACCATC GACCATATGG
CATCATTCTA CTTCACGAAC CAAGGAAAGG ACAAGCTAGA AATGCGAAAC CTTAGCAAGG
TATGTGATGT TTGCAAACCT TGTCTTCTCC AGCTAGCTAG CGTTTCTAAG ATTTTTCTTC
GATTTTTTTA CAGCACTTGG CAGCCCAGCC CAATCTGTTT TCCAGTCTAA CGATGACACT
GTTTAACCTG TTGCTTTACG GTCCACCCCA ACATCATTGG GCAGTGATGC GTCCGATGCT
CAGTTTGATG CTTGCCAGTG AATCAGGCTT TGCTGCATAC AAAGATCATC TTCTCAGTAC
CCAAGCTCCT GAAAACCAGG CAAAATTGAA CGAAGCTCTG AACAAACTCC TAGCAGATGT
CAGTCGAAGT CTTGATAATG CCAACCGTGA TCGATTTACT CAAAAGCTGA CTGCTTTTCG
AGTTGCAGCC AGGAGTTTTT TGACGCTGTA G
 
Protein sequence
MDAAQLVQVE ALCETLYTGV AANFDSETVT RSEAQQRLLS LQSNAEYIPQ CQYILDNSKS 
QYARLVASNS LIELVTIHWN SFTVPQRIDI RNYVLGYLAN NGPSLQDFLV LSLIKLVCRI
TKLGWFDDSA HRELADDVTK FLQATVDHCI LGLKILNQLV DELNIPTSGR TLTQHRKTSV
SFRDVCLLKV FQLGLTTLKQ LQTGAITILG EQALSLTVRC LNFDFIGTNP DESTEDVGTI
QAPTSWRPLL SDPATTELLF EFYANTEPPQ SSKAMEALIL LSSVRRSLFP TDKDRAVFLG
RLITGIREMM SNRTGLQHQD NYHQFCRLLG RLKANYQLSE LVKADGYLEW LELAATFTVQ
SVQNWQYSTN SIHYLLALWA RLVSAVPYIR PETGARGHVA HLEKQVLLVA ETYIDSMLGS
SETVIRSDGA LEDPLDDDGS LKEQLDRLPI ICRFQYGTVA NLILNKFDPL LNSYQDIVAK
LGSSATNSAP PDVMLRVEII EGQLTWLVYI VGAIVGGHSW SSTHMADGEE TIDASLSRRV
LQLAQGMEYR LTSSNGVGRA NARLEHALLC YFQNFRRLDS GGTSTKQKIY LRMFEHLGMG
DHTAVANLIV TKIGNNLKFW PEDQDLIGKT LDLLHDMAQG YSSSKLLLTL ETVRFLAHHH
TEEHFPFLSM PGNSRQRTTF HATLTRLLLS PSGEEKLGLT FEQFLEPIVV KLTRLEGLSP
SDLRQEQCRQ PLIGVFRDLR GIGASLHNRK TYSALFDIMH PHHLPLLSKV ADVWFDQSDV
TVSLLRFLQE FCHNKANRVN FDQSSPNGIL LFRTVSDVVC AYGSRILSLP PPVANDPEVY
KKRFKGLALA LNVLNSALGG NYVCFGVFEL YNDRALENSL DVALRLCLTI PLEEINAYPK
VSKAYYGFIE ILFRNHRRTA FAMDTNIFMQ IMASVHDGLQ STDATISACC ANTIDHMASF
YFTNQGKDKL EMRNLSKVYF SSIFLQHLAA QPNLFSSLTM TLFNLLLYGP PQHHWAVMRP
MLSLMLASES GFAAYKDHLL STQAPENQAK LNEALNKLLA DVSRSLDNAN RDRFTQKLTA
FRVAARSFLT L