Gene PHATRDRAFT_47258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47258 
Symbol 
ID7202349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp101936 
End bp105334 
Gene Length3399 bp 
Protein Length969 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181660 
Protein GI219122662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTCTCCTCG GGAGAACGCT CGTCCCCGCA CCCGTCACGA CTCTTGGGTG GCATTCCCCT 
CCCTCCCCCC TCACTAGTAG GAGCACGAAA TACCCAAGCG TGCCTAGAGT CGTCCGCGTT
TCTCCTCGTC ATCGTTGTCG CGTGGGTTGT CCACTAGTGA TTTGTTCTGT TGCGTGTCGT
TCGTTCCGGG CCGCACCCAA AGGACGTGAT GGGGTTTGAG CTCACGCAAG TCCTATCGGA
CGACAAGCAC TACAGTGTGT TTGTGTGCGG GTTCTGTCAG AATTTGGTCG ACCTCGATGC
GGTCGTGACC GCTCCGTGTT CCCACGCGTC GTGCCGGCAG TGCTGGCACG TGTGGGTGGA
ACAACACTGT CGGCAACACC GGCCGGCGGA ATGTCTCTGT CCCACGTGCC GCGTCAACGT
GACGCTGCCC GACCCAGTGA CACCAGGGAC GACGGCGCTC CGGATACACG GGGTCAACCT
GGCTCTGCAA CCACTCGCAC AAACGCAGCC TTTGGCGTTT CGAACACTCC AACAAGTGCA
AGTCGCGTGT TTGCACGGCA ACCGTTGCGA TTGGAGTGGA GACTACGGGG ACGTGTCGGC
ACACGCGGAA GGTCACGCGG CGGAGGAAAC AAGATCGGCC GACACGACGG TCGCACACAC
ACCCGGACGG CCTTCGTTGA GGGACTCGCA GAGTCATTCC ATGTCCGCCT TGTTCTGTCG
CCAAAAGGTA CAGGAGAGGA GGAACACAAA TCCACGACCG GTGGTGGCTC GGACGCACGC
GCCGGCTACG CCGCAGCGCA AGAGCCGGAG CTTGCGCGGA CTGCTCGCGG GGGAACAGCT
AGTAGACCCC ACCGCGTGGA TGGGTTCGAC GGAATCCGCG ACCAACGCCT CCGCGGAAGA
CAGTATGCGG ATGGTACCAA ATGGGCACAA TGAGCTTCGA TCACCATCGC CGGTGGACAC
GGATGTATCC AATACTCCCG ACGATCCGGT TACTCCCGGA AAGGCATCGA GCTGTGGTGA
CCACGGAATG GGTGAAAGGT GGGCCACCAA CGAGGCCGAC CATGGTCTCC TCAATACTCC
ACCGCGAGAA GATGTGGCGG ATCGGGTACC CGCCTCGCCT CAACCAGAAT CTCCCCGCTG
GACCACCAAC GGAATCAACG ACTCCATTTC GACGAATCCT TCGCCGGCCA AAGCACCCAT
TACTCCCAAA CGAGACGAAG CGCCGAATCT CGTGAAACGC ACCAAGTCCG AGGAGAAGAC
CGATTTCAAG CGGAAATTGA ATCGAAGGCA ATCTCACGAC GAACAAAACG ACGACCTGGA
AGTTTCTTAC AGTCAATCGA CGGACTGGAA TATGTCCATC AACAGCCTTG GTGCCAACGT
TTCTGCTCTG ACCAACGATT CTGTATCGCC AATGGAAACC GTGAACGAAA TCGACGAAAT
CGACTTCAAG CAGCTTTTGG ACTCTGCAGG GGTGGCGGAC GGGGCCCTTT TTTCTCCAGA
GCGGGTAAAG AAAATGATCG ACAAGGCAGA GAAACTGAAG AAACAAGCCA ACGCCAAATT
CAACAAGGGA GATTTAGTCA ATTCTCGTGA GCTGTATACG GACGGAATAA AAATCATGCG
AAAGATTCCC ATGGAGTCGG AACAACACAA AGAACTAGTT TCACAGATGT ACTCCAATCG
AGCAGTTACT TACTTTCGAG AAAAGCGTTT CGACAGTTGT GCGTTGGACT GTGACAAAGC
GATCGAACTA CTGCCAACCT ACGAGAAGTC ATGGATTCGA AAGTGGAGGG CACTAATGGC
TCTAGGTGAC TTTGAAGCGG CCTACAATTG TCTAGAAACA GGGTCGAGAG TTGTCCCGGA
CTCTCGTCGT ATTCAAGCAG AACTGACCAA GACTCAAGGC GAAAAGGAAT TGCTCTTCGA
GGCAAAACAG GCACTCGATA TTGGGGACTT TCAGAGGAGT AAAGACATTT TGAAGCCGCA
CGCAAGGACG TCAGACAACA TAGGACTCTT GTTTCTCGCG GCCAAAGCCG ATGTAGGCCT
TGGCAACGTG GAGTCTGCCT TGGAAAAGAT CAACAAGGCA CTGAGGTTCA ACCCTACTCA
TTCGGATGGA CTGGAATTGC GAGGGCACAC TCTGTTTCTT TCGGGAGATA CTGAAAAAGG
CGTCCACATT TTGCAAGAAA CATTTAACCG AGACAAACAG AATGGCAATC TGGAGACTGA
ATTAAATCGA TGCCAGAATA CCCATGTCGC CATTACCAAA GGCCGCGCTT CTGTAAAGCG
TGGCCGCTAT GCCGAGGCTG CCGATTTTTT TTCCGCTGCT ATTAAGGAAA CTGGACTGAT
ACCAACTCGT TGTCCATTGT TCGAGATGCT TCGAACAGAA AGAGCAGAAG CGTGGCTTTT
GTCAAAAAAG TACCTCGAAG CACTCAAAGA CTGCCAAGAA GTTATTTTGA TTCAACGAGA
AAACGCTACT GCATGGACCG TTCGTGCGGA AGTTCTAGCT GCGTTAGGGA AGCCAGAGGA
AGCTAGACGG GAGTTGTTAA AAATCAAACG GACCTGGGGA GCAGAGAACC CGACGATTGA
GGAAGGCTAC AGACGTGTAG ACTTTGAGCT TCGTGTAACG ACGGCAGATA AGAACCTTAC
GGAGTTTATA CATCAGCTTG AGATGGGTAA CACCAATGTT ATGTCCATCA CAGCAAATAT
GGACTGCGAG TCAGAGAGCA AAGCTGATCC TCGAGCACCA ACGTCGAGAT CCGATCATCG
ATCGGCGCGG AGTCGTAGCA AGGTTCGAAG CCAAAGTGAC GGCCATCGAA GTAGCAGTCG
GGCACGAGGT GACAGTCACC AGAAAGATCG CCGAGGCGAG AGACGATTCA GCACGGGTCG
ATCCTCCCCT TTCCACGACC GAAAGGCAAG GGAGAGGCGA AGATCTGCTG GATCGGGCGG
TAAGGACGAT GAACGACATC ACAATGGCCA GTCACGGGCG GCTGTGAAAA TTTGCGACAG
TGCAAAGGTC TCTCCAGAGG GAAAGGTCGC CAAGAGTTCA AAAGAGATAC TGGAGCGCAC
TCAAAGGGAA TTGAACGAAG AACGGAAGAA CAGATCGCGA TCGCAAAGTA TCAAGTGAAC
ATGACGCAGC GGCTTATCCT TTGCCCTGCC CAGACCGATC CCGAAGTATC AAGCGAAAAA
AATGTAACTA ACTGACTTAC GCTGTCGCTT GACGCGTCTC TACCACAAGG TTTATTTCTA
TACCAGAAGA TTCCTCCCAA TCCTTTTGCC TGTTTCGTAA GCAGGAACAG CGGTTTATAA
ATCCCGCTTG GTTGAGCTCA CTGTCGGAAA GAAGCGTGCA GCTTAGTGTC TCCGCCATGG
GTTTCGGCAA TTAATAGTAA GCAAGGTAGT TTCAATGTC
 
Protein sequence
MGFELTQVLS DDKHYSVFVC GFCQNLVDLD AVVTAPCSHA SCRQCWHVWV EQHCRQHRPA 
ECLCPTCRVN VTLPDPVTPG TTALRIHGVN LALQPLAQTQ PLAFRTLQQV QVACLHGNRC
DWSGDYGDVS AHAEGHAAEE TRSADTTVAH TPGRPSLRDS QSHSMSALFC RQKVQERRNT
NPRPVVARTH APATPQRKSR SLRGLLAGEQ LVDPTAWMGS TESATNASAE DSMRMVPNGH
NELRSPSPVD TDVSNTPDDP VTPGKASSCG DHGMGERWAT NEADHGLLNT PPREDVADRV
PASPQPESPR WTTNGINDSI STNPSPAKAP ITPKRDEAPN LVKRTKSEEK TDFKRKLNRR
QSHDEQNDDL EVSYSQSTDW NMSINSLGAN VSALTNDSVS PMETVNEIDE IDFKQLLDSA
GVADGALFSP ERVKKMIDKA EKLKKQANAK FNKGDLVNSR ELYTDGIKIM RKIPMESEQH
KELVSQMYSN RAVTYFREKR FDSCALDCDK AIELLPTYEK SWIRKWRALM ALGDFEAAYN
CLETGSRVVP DSRRIQAELT KTQGEKELLF EAKQALDIGD FQRSKDILKP HARTSDNIGL
LFLAAKADVG LGNVESALEK INKALRFNPT HSDGLELRGH TLFLSGDTEK GVHILQETFN
RDKQNGNLET ELNRCQNTHV AITKGRASVK RGRYAEAADF FSAAIKETGL IPTRCPLFEM
LRTERAEAWL LSKKYLEALK DCQEVILIQR ENATAWTVRA EVLAALGKPE EARRELLKIK
RTWGAENPTI EEGYRRVDFE LRVTTADKNL TEFIHQLEMG NTNVMSITAN MDCESESKAD
PRAPTSRSDH RSARSRSKVR SQSDGHRSSS RARGDSHQKD RRGERRFSTG RSSPFHDRKA
RERRRSAGSG GKDDERHHNG QSRAAVKICD SAKVSPEGKV AKSSKEILER TQRELNEERK
NRSRSQSIK