Gene PHATRDRAFT_42442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42442 
Symbol 
ID7196645 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp67387 
End bp70840 
Gene Length3454 bp 
Protein Length980 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177010 
Protein GI219110517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.285973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTGCAATC GCGGCATCCC CCGGCATTTG TGTCGTTCTT GTCGCCTCAT TGCCGCCTTG 
CATTCCCTTG CTTGAGTTTC TTCTTTCCGT ACTTTTTCCG TATTGAAAAC TGCCTTTCAC
CATGGCTTCT CCCTCTTCCG GCGGTAGCCT TTCGTACGTT CCAACGCCGG TAGAACGGGT
ACGTGTGCAG TGTGTGTATG TGTATGTGTT TGTGTGTGTC TCTATCGTGT ATGGGATGGA
CTGAAGGGTG CGTTAGTCCT GTCGGAAGCG AGCGCTACGA CACCATTGCA ACAACAAGCA
ACGAGTTCAA GCTTGTCACA GTGTGACTGT CACTTGGACG GGGAACTTTG CCTTTTGAAT
TGGACTGACT GTGCCTGTGA CTGTGTGCAT ACAGCCCTAC TTTGAAGGAC TCTTTGCGGC
AGCCGATACC CAAGGCGGAG GGCAGATTGG TGGAGCCCAA GCTGTCCCGT TCTTTCAGCG
TTCGCAGCTT CCCACCGAAG CTCTCCGCAA CATTTGGACG ATAGCCGATC AACCCCCCAC
CAATGCTTTG GATCACCGTA AGTTTGCCGT CGCGATTCGG CTCATTCAGC TTTTGCAGAA
CGGAAAGCAA GGCGAAGGAC CCACTCTACA GGCTCCACAG GGTGTGGATT TGCGTCCCGT
GTACTTTGAA GGAATCAGTG GTGTTTCCGT CCCCCTCCCG TCGATGGAAC AACAGCAACA
TCCACACCAA CCGCAGCCAC AGATTCCTCC CGTGCAGCAA CAGCCACATC CACAGCAACA
ACAACAGTAT CCTCAACAGC AGCACGCCCA CCACACTCCG CCCCGCCCTC CGTCGTCAGC
ATCGCAGTAC GCTCCACCCG TGCAGCAACA ACAGCAGCCG CCGCGCCCTC CTCCCTCCAC
GAGTATGGCC TTGACGCCAC AAGATCCGTA CACGCTCCCA CCCAATGAAC AAGCCCGCTA
CGAGTCCATC TTTGCCGAAT ACACGCAACC GGACGGATTT GTTCACGGCA AGGAAGCCGT
CGCGCTATTT TCCAAGTCGG GCCTTCCCCA AACACAGTTG GCAAGCATCT GGAACATGGT
CGATACACCC GTGGATAATA AACTCGACAA GGTGGAATTC GCGATCGCCA TGCATTTAAT
TGTTTGCATT TCCAAAAAGA ACCTACCAAT GCCACCCTCG TTGCCGCTTT CGCTCAAACA
GCTTAAATCG CAGGCCCCGC CTCCGACCTC GGTGCAAACC CAGCTTCCCA CAGTTGGTGC
AACTTCCTCC CAAGGATTGC CCTATCAACA AGAGCACCAA CAACACCAGC CGCAGCAGCA
GCAGATTCAT CGTACCATGA CCAACGATGC CGGTTCGGTG GCTTCCGTTC CACCGCCTCC
GGTTATCTCC GGGGGTCCCC CACGGTCGAT TCAGTTGCAG CCCCAATCCC AGCAGCCTCC
GCCCAGTGGG TTGTCGGTTT CCGCGTCTCT CCAGGGTCCA CCGCCACTGC CAGCTCGTGG
TGAGGGTGCG CTGAGTATCT CGGACGCGTT TGAAGGATTA TCCGTGGACG GAGCAGCGGG
TTCGTCGTTT TTGCCCCAAA CACTCGCCCC CGCCTCCTTT GGTGCACCGA ACAACCTCGG
CACGACGGCA TCCTTCGACA ACGCCAGTCA CGCCAGCAAT ACTGGTGCAG TCAGTGACGT
CGGTGGGGGC ATTCCCAGCC CGGGACGCAA CGCCGCCTCG TCTTTTGCCA TGGGACCTCC
GGCGATTGTG ACGGCCACCA GTCCGGCGCC AGCGCCCAAA ACAACCCAAC AGCTCGCGTC
TAGCTACAGC ATGGGTGACT CAACGCAAGA ACTGGAAAAG CTCAAGGACG TTTTGCAAAA
GTTGCAGGCG GAGAATATTG CTCTCAAAGC ACAGCTGGGC ACGATGACGG GCGACGAGAA
GGATGTCCTA AAGCAATTGG GTGCGACGGT CGCCGAAATA TCCACTCTCT CCAATGAATT
GACTACCGTA CGTGCACAAG TGCTAGCATC CAAGTCCCGC TTGGTGGAAG CAACTGCGGA
ATTGCAGGCA GCCAAGGAAA AGAAAAGGTA AGGAGATTGT CTAAATGGGG GAAACACAAT
GAAGGAAGGG TAATCATCGA CTTACCAACA CATTTTGTTT TCTCTTTAGT GTCGTCAAGG
ATCTGATTTC GGAAGCCAGC GAAACGAAGA GTGCTATTCA GCAGGCTCAT ACAGGTGTAG
AAGAAGCAAT TGAAATGGCG AAGGCCCCAC CTCCGGCAGC AAACGGATTT GACGGCGACT
TGTTTGATTT CGGCGGAGCA GCTCCCGCCC CATCGGGTCC TGTCGCCCAG GACAGTTCCT
ACGCTTCGAA TGCGGAGTCC ATGCACCCGA ACCCAATCAC GGAGCCGCCG GCGTATCAAA
ACAACCAGGT ACTAAAAACA GTTGCGTCAA ACGACTCCGA GTATGGACAA TTGAAGGAAG
CGGTCCTGTC AACAGACACG TTCAATTCAA GCTACGGAGA AGCATCGAAA GCAGGGTTGT
CGAACTACGC CTCTCACTAC GGTCAGCTGG AAACAGTGAC ATCGTACGAA TCCAACCAGT
CGGATGGAGG TCCGGGTCAC AACCGCACCG CTTCCGCAGC TTCGTTGGGT TTCGACAGTA
GCATGGTAAT GGGCGGCGCA CCGCTGGACT ACTCCACGGG CTCTACTTTA GCCGGACCGC
CCCCGCCAAC CGATCGTTAT CAAGGCAAAA ACGCAGATGA CCACAATTCG ACGCCGTCGA
TTGGAGATGT CAACGAATTG AGGCGCAGAG CCAAAGAGGC AGAGGACGTC GCACGAGATG
CGGAAGAGTC GCGTCAGCAA GTGGCAGCTC AAGTCGAAGA GCTGCGTCGT GTGGCCGATG
AGGCGGAAGC GGAAGCTCGC AAACATTTGG CCGGTGGAGA CGGCAAGAAG AAGAAAGTCG
GTATGCTGGG TCGGGGTAAA AAGCGAGATG CGGTACGTAA GCTTCCTGGG TCCATTGTAC
CGGCCAATGA TTCTTAACGA ATCTTGGTTG AAAAAGACTT ACCTTTATTG CTTTTCGTAC
ACAGAAAGAA GGAGAGCGGC TCGCACTGGA GGCAAAAACC AAGAAGGATA CGTTTCTCCA
GGCACAGTCG CAAGCCAATG ATGCCCAGGC TTTGGCACTG GACACAAAGC GCGAAGCCGA
TCGTTTGCGG CAGCAAGCCG AGGAGGCCGA AATCAACGCT GCGTCGGCCG CTTCTATGCA
GCATAGCCAG CCGGTCGCTC CTTCTCAGCA GCCATCCAAT GGATACCCAG CACCAGCTGC
TACTCCAGCG TACGGAACAG GAATGGGACA GCAACCCCAA TACGGGGGTA CATCTTTTGG
CGGACAATAC AATCCCAACG TCATGGGTAG TGGTGGCGTT GAGATTCCCA CACCGAGCGG
AGGCGAAGAT CCATACTCAA ACCCTTTCGG CTAA
 
Protein sequence
MASPSSGGSL SYVPTPVERP YFEGLFAAAD TQGGGQIGGA QAVPFFQRSQ LPTEALRNIW 
TIADQPPTNA LDHRKFAVAI RLIQLLQNGK QGEGPTLQAP QGVDLRPVYF EGISGVSVPL
PSMEQQQHPH QPQPQIPPVQ QQPHPQQQQQ YPQQQHAHHT PPRPPSSASQ YAPPVQQQQQ
PPRPPPSTSM ALTPQDPYTL PPNEQARYES IFAEYTQPDG FVHGKEAVAL FSKSGLPQTQ
LASIWNMVDT PVDNKLDKVE FAIAMHLIVC ISKKNLPMPP SLPLSLKQLK SQAPPPTSVQ
TQLPTVGATS SQGLPYQQEH QQHQPQQQQI HRTMTNDAGS VASVPPPPVI SGGPPRSIQL
QPQSQQPPPS GLSVSASLQG PPPLPARGEG ALSISDAFEG LSVDGAAGSS FLPQTLAPAS
FGAPNNLGTT ASFDNASHAS NTGAVSDVGG GIPSPGRNAA SSFAMGPPAI VTATSPAPAP
KTTQQLASSY SMGDSTQELE KLKDVLQKLQ AENIALKAQL GTMTGDEKDV LKQLGATVAE
ISTLSNELTT VRAQVLASKS RLVEATAELQ AAKEKKSVVK DLISEASETK SAIQQAHTGV
EEAIEMAKAP PPAANGFDGD LFDFGGAAPA PSGPVAQDSS YASNAESMHP NPITEPPAYQ
NNQVLKTVAS NDSEYGQLKE AVLSTDTFNS SYGEASKAGL SNYASHYGQL ETVTSYESNQ
SDGGPGHNRT ASAASLGFDS SMVMGGAPLD YSTGSTLAGP PPPTDRYQGK NADDHNSTPS
IGDVNELRRR AKEAEDVARD AEESRQQVAA QVEELRRVAD EAEAEARKHL AGGDGKKKKV
GMLGRGKKRD AKEGERLALE AKTKKDTFLQ AQSQANDAQA LALDTKREAD RLRQQAEEAE
INAASAASMQ HSQPVAPSQQ PSNGYPAPAA TPAYGTGMGQ QPQYGGTSFG GQYNPNVMGS
GGVEIPTPSG GEDPYSNPFG