Gene PHATRDRAFT_41259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41259 
Symbol 
ID7199068 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp320116 
End bp323333 
Gene Length3218 bp 
Protein Length915 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185258 
Protein GI219130198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.68146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCTG GGAAGCCTGC ACTGAAATTG GACCGACCAC CAGCACTTCA AAGCAAAGCA 
ACTTTTTACT TATTTTCTCG GCGTACTGGC AATTTTAGCT TCGACCAACT TACCAATTCA
TCGCAGGCCA AGCGAAACGG ACGGCTCCAG AGTAACAAGC CATCGACCCT GCGAGGATTC
CCTGACAGGT TTGACCTTAC TTTTGGCACT CTCTGCCTAT ACCGTCAATC TAGTCGAAAC
TTGACGGTCC TCTCGAATGT AATAGCGATA GTTGCCGAGA GGAACTCTTG GGCAAAATCA
AAATCTGTTG ATTTTTCGCG TCAGTCGGCT GGTTACCGTT CCCCGTCTGA TCTCGACCGA
AGCTCCACGC TCGATACGTC AAAAACCGGT ACGAACCACA AAGTATCGTT AGTTTCGTTA
ACTACCAAAC AACGTCACGA ATAACCATGA TCGGCAAGCT TTACTGGTAC TTCCTCTCTG
TCTGTCTGTA TCATTATCAT TCACAGTCAA AAAGTTGACA GTGAATCTGA TTCTCAGCAT
ACGAAACTGC TGTGCAGCCA TTATATCCTT TCCATGTCTA TTTTGCAAAT TCGTTCTGTA
ATACCAGTGA CGTTATCGTT GTTCTCTTCT TTACGGAACG ATCTGAATTT GCGATAGAGT
GTTGTTTTAC GGAACGATCT GAATTTGCGA TAGAGTGTTG TCGACAGTTG TGATGAGGAA
GAAGAAAACC AATGATAAGG CAATCTTGGG TGACTGCGAG GCCAATACGG AAGAAATAAA
TCAGCTTTTC GTTGGACTGT ACCGTGTTTG CGGCTTTTCC AATGGTGAAT CTCTGGAATC
TCGTGAAAGC TCACGAAAGC AATTCGTCGA GTCTATGCAG GTCTTGTGTT CGGCGCTTGG
ACAACTCCAT GACGAGCGAA AATCGGGTGA ACTCGGAATA CAGAGCATCT TGCCCGCTCT
CGAGCAAGCC GCGTCTCGAG TACGGCTTGT CGCCCACGAT TTCATGGCTT ACTTGACACG
AAGAGTTTGC GAGTTGGATT TTCCAATGAA AAAAATCCTG TCATCGCTTC GTGACAAGCC
GGCTTTTACT ACTTTGCTTA TCCTGTCAAG GTCCTGGAGA ACGTGTCTCA ACCATCAAGT
GCAACAGGAA AAGACTGTGG CATTGAAAGC GGTCTCATTA CTTGATCAAT TGTATTCGTC
TTCCAGACTG AAGCATCCTA CAAACGACTT GCTAGCGTCC TTACAAAATT TAGTTAACAT
TTTGACACGC ATGGACTTCA TCGCGCTTCC ACACGCTAGG ATTGGGCTTC AGTTGCTGGG
TTCTCCCAAT AACATACAAA GAGTTCACTC GTCAGATTTA TCGCTGCTTG CTGTGGACTA
CGACATGTCC CGGTCGACAG AGTATAACAA GGTGCCGGCG CATTGCACCG TCGACGTACC
CATAACATGG CAGGGGATCG AAAATTTTAT AGAAAGGGAC GCTGGAAATT TCCCTCCTTT
GACAAGCTTT CTACTGGTGG GTCCGGAAGG GAGCGGAAAA ACCCATATCT GCGATGTGTT
AGAAAAGAGC TGCGTACGCT CGTCAATAGC AGGTAAGTAA GGATGGTTAA AATGGAACGA
ATACCAATCT ATTTCTGCGC TAACATTTCG GTCAACGATC CGTAGTCCTC CGTCCTCGGC
TTCCTCTGGA CATATTAGGT CAATCAGTTG GCGAGATGGA AGATGTCCTC GTCGCTCTCG
TTGATTCAGC AAAAAGCGGG AGACAATCAT GTTTTGCGCT CATTCTGGAT GACGCTGATT
TTCTCATTGC CACTGGCGAA AGTGGGATTG GCGAGGGCGG AGAGCGATTC TCAGGGCGTC
ATCATATACA GTCGAGATCC CAATCTACAT TTTTCGCTCT GTTAGATAGT TTTCGCAGTG
ATGCAATATC TTGTAGCCGA CTAATCCTGA TCTGCACGTC CAAAATGGAT CAGGATTGGA
CCGCGGGCCG ATTTGACCGC AAATACCACA TCTTGCCACC AAATGAACAC GAAAGGAGAC
TCTTTATCTG CTCAAATCTT GGTCTTCATA CACCAGTGAA ATGTAGCTTA ACGATACTTT
TGGAGGATAT GGTAGAAGGG ACAGTAGGAC GAACATACTC GGAAATTGCA CTCTACTGCA
GGCAAGCTGC GATTGATCAT GCTTCATCTG AAACGGTCGC CGAAGCGGAG ACCCTTCTGC
ATTTCTTGAA ACGTCGTCTC CAGTCAATCA CTCCTGAGTC CTTGCGTAGT GGCGTGCTGG
ATGAATTTGT GGATATGCGA GTTTGGACAG CCCGTGATTT GGGTAGTATG GAGACACTGG
ACGATTCTGA GTCCTCCTAC CATCTCCCTT TGTTCGGTTC AAGTGCGGAG CAGGCATGGA
AGGATCTTCA ATCAACTGTA ATCATTCCTT TATGCCGAGC AAGAGAGCTG GAAGACTTGA
GGAATCCTTG CGGGTTTTTT TCTCCGCGAA TATTTGTTGG CGGCATGCTG TTGGCAGGTC
TTCCAGGAAC AGGCAAGAGT TCGTTAGCTT TTCACACCGC AAAGATCGCC GCCCGACTGC
TCCCGACTGT CAAATTTTTG GAAGTGAGCT GCACGTCTCT TATTCATAAA GAAGTCGGTG
GATCGGAGCG TGCCCTTCAC CACTTGTTGG TATGCGCTCG CAAGGCTGCG CCCTGCATTC
TACTGATGGA CAGTATCGAA ACAATTGCGG CTGTTCGTGG AAATGATGCA ACGACGGAAG
GCACGATGGA TCGCTTGCTT TCAACACTTC TAGTCGAGCT AGACGGCGTG CAGGAACATG
GGCAATCATC TGTTTCCTCA CCTGCAGGTA TTGCTGTCAT TGGCATAACG CACAATTCGG
ATTGGATTGA CCCTGCATTG CTGCGGCCTG GGCGTCTAGA CAAGATTGCT ACTCTGGATT
TACCCGACTA TCAAATTCGA TATGGCATTG CAGCTAGGGA CCTGAAAAGC GGGATAGCGG
TTCCTGCTAA TCTTAACCTC CTGAATGTAA TTGCTGCAAA AACGCATGGG ATGAGTGGGG
CGAGCGTCGC TGCCGTTTGC AGTGACTTAA AATTGGCATT TGCTCTGGGT AGCAACGTGT
GTCAATCTGC GCTTGCGGAA ATAATACGTT CGCGACGGTA AGGATCGGTG CACTATGTAC
TCTCTTTTGT AGAGCTTAGC CTGTATGTAC ATACTTAA
 
Protein sequence
MSSGKPALKL DRPPALQSKA TFYLFSRRTG NFSFDQLTNS SQAKRNGRLQ SNKPSTLRGF 
PDRFDLTFGT LCLYRQSSRN LTVLSNVIAI VAERNSWAKS KSVDFSRQSA GYRSPSDLDR
SSTLDTSKTG TNHKVSVLST VVMRKKKTND KAILGDCEAN TEEINQLFVG LYRVCGFSNG
ESLESRESSR KQFVESMQVL CSALGQLHDE RKSGELGIQS ILPALEQAAS RVRLVAHDFM
AYLTRRVCEL DFPMKKILSS LRDKPAFTTL LILSRSWRTC LNHQVQQEKT VALKAVSLLD
QLYSSSRLKH PTNDLLASLQ NLVNILTRMD FIALPHARIG LQLLGSPNNI QRVHSSDLSL
LAVDYDMSRS TEYNKVPAHC TVDVPITWQG IENFIERDAG NFPPLTSFLL VGPEGSGKTH
ICDVLEKSCV RSSIAVLRPR LPLDILGQSV GEMEDVLVAL VDSAKSGRQS CFALILDDAD
FLIATGESGI GEGGERFSGR HHIQSRSQST FFALLDSFRS DAISCSRLIL ICTSKMDQDW
TAGRFDRKYH ILPPNEHERR LFICSNLGLH TPVKCSLTIL LEDMVEGTVG RTYSEIALYC
RQAAIDHASS ETVAEAETLL HFLKRRLQSI TPESLRSGVL DEFVDMRVWT ARDLGSMETL
DDSESSYHLP LFGSSAEQAW KDLQSTVIIP LCRARELEDL RNPCGFFSPR IFVGGMLLAG
LPGTGKSSLA FHTAKIAARL LPTVKFLEVS CTSLIHKEVG GSERALHHLL VCARKAAPCI
LLMDSIETIA AVRGNDATTE GTMDRLLSTL LVELDGVQEH GQSSVSSPAG IAVIGITHNS
DWIDPALLRP GRLDKIATLD LPDYQIRYGI AARDLKSGIA VPANLNLLNV IAAKTHGMSG
ASVAAVCKLS LYVHT