Gene PHATRDRAFT_44991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44991 
Symbol 
ID7199664 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp914982 
End bp917208 
Gene Length2227 bp 
Protein Length587 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178876 
Protein GI219116162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGACCTCCTA AATTGCTTAC AGGAGATGAT TACTGCCACC GTTACACCAT ATATAATACA 
CTTGAATCGT CCAGCCAAAA AACATAGCGA AGGGCACTCC ACCACGTGAT TGCGTGTAGA
AGAAAATAAT CTGAATGCAG TTTCGAGAAG TCGGAGTAAG CAATAATGAC ATCCTGGATT
GCCATAGAAA CGGGTATATA TCCTTTGGTG CAAAGAGCAT CTCGCACTAG CTCTGTAGAT
GCTCACAAAT ATTTCATTGA GAACACTTCT CACAAAATTT ACGCATAGCC GAAGAAAAAT
ATGATATTTC GCATCACCTT CCTGTTGTTC TTCGCCCTCT CCTTGAACAC TGCTAGTGCA
GCATCGAAGG CCGAGCAAAA GCAAAAATCG TTCTCGGTTT TGGACTTCCT TCAAGATCCT
GTTGCCCAGG AAACCGCCCG TTTTGATGCC TCTCAGCACC AACGCCAGCG TCGAGAGCTA
CAAGAGTCTC CAGCTTGTGG CGACCAGATA ACTACCATTA CAGCGGGTGA GATTAAGCAA
ATTGCCGCTC TGCTACTGGG AAACGTATTG GCAGGCGATT TGGCGGCTTT CGGTGAAGTG
GCTTTAGCGG CAATAGATCA GGTAATTGTA TATGATTTAC AAGCCGTCAA GATCTGCAGT
TCCTGCATCG AAACCAACAT ACTGACCGCG GTTGAACCTT GGAGGAGTGC GGAAGGATTC
GGCTCTTACA ATACATACTG CAACGAAGGT ATTTTTGGTT TTGATGCCCA ACACAGCAGT
GTCATGTTCC TACCGCTTGA CAGAGAGACT GGTTTGCCTG TTAGTGGAAA CCTCCGTGGT
GTGATGGCTA TGCACGGACT CTTGTTGGAA AACAGAGATG CTCCCAGCGA GATATTGCCT
GCCAATTTGA CAGAGTCTCT GGTAGCAACT GGTCCCGAAG CATTCTTTTT TATGATTATT
GCTTTCTTGG AAAGCCTGGT GGCATCGTCT GCGGGAGCAG TTTCTCTGAT GCCAGACTTT
CTAGGCTCAG GCGAGTCAGC TCTCACCCAC AATCGAGTAA GTTGGTGAGA CAGTGACCTC
TCTCTCTGTT GTAATCATCT TCTGAAAAAC TGACTGCCTC TTTTAGACCA TTTCTGTGCC
ACAATTTTAC ATGCAGTCGG CTGCACAGGC ATATTTTGGA GCACAGAGCT ACTTGGACAA
GCATACAAAC GGATGTACAA AAATGTACAA TGAATGGAAT GTTGTAGGAT CCTCCGATGG
CGGCTTTGGA TCTATTCCTG CTGCTAAAGC TCTTCAAGCA TTGGACAACC GTATCCTGAA
TGTATTTGCT GGTGCACCAT ATCTGGATGC CAATGTACAA CTCAGATTTA TTTTTGGTAA
GTAGAAGAAT AGGGTCACAA TTAGCAGTGC TTTCTGAAGC TTTTTTAAAG TGCCACTCAC
AAAACCAATT CCTTGTTTCT CAATTCTAGA CACATATCTT AGTGGCTTCT TCAGTCCGGA
CAAGGATAAC TCCTTCTTGC AGCTGTTTAT TCCACTACTT GGTTTTACGG GCTCAATTGA
AAGTCCTGGA TACCCAAACA CTGGGACGGG GCAGAAGTTT GTTGATGCTT CTCAGCTTGC
TGCAGTTACC AAGTGGATGG CAAATCCTTC TCCTCTTGGA CCAACTGAAC TTGCAACTAT
TGTACCATTC CCGGCACTGG ATATCTTGAA CAAAGACTTA GTCGAAATGT ACCAGACAGC
AATTGTGACA AACGTGACCA ATCCTTGTTT GGAAGGCTTG TTTTCCAACA AAACTGATAA
ATTGTGTGAT GTCATCAATG AAGGGAGCCT TTTTAGCATT CTCCAAAGCT TTGACATTCA
AACTCAGCTT TGCTACAGCC AAGAGGACAC GCTGAATACC CGTGACAACT TCATCCCCGA
GCTTTTTGAG AACTCGCTTG TTTCCGAAGT AACTTCACTT TTGCATGGAA TATTGCCTAT
TACAGGGGAT CACTTAAAAG CCACCTTCCT TTGCAAAATC AACCCCATCA GCTTCTTTGC
TATGAATGGT CCGGCAACAA CCAATCCTCC AGTTTTGACA ACTACCATTG ACGGTGACGA
ATTGGCAAAC TACTTAAAAG CATCCGGTCC TCGCACCATC TCCACAATTA CCAGTTCCGG
CGGTGTGAAG ATGGGATTTT CCATTTTGTC TACTGTGCTA CTTCTGGCCT ACTTCCCTTT
TGTTTGA
 
Protein sequence
MIFRITFLLF FALSLNTASA ASKAEQKQKS FSVLDFLQDP VAQETARFDA SQHQRQRREL 
QESPACGDQI TTITAGEIKQ IAALLLGNVL AGDLAAFGEV ALAAIDQVIV YDLQAVKICS
SCIETNILTA VEPWRSAEGF GSYNTYCNEG IFGFDAQHSS VMFLPLDRET GLPVSGNLRG
VMAMHGLLLE NRDAPSEILP ANLTESLVAT GPEAFFFMII AFLESLVASS AGAVSLMPDF
LGSGESALTH NRTISVPQFY MQSAAQAYFG AQSYLDKHTN GCTKMYNEWN VVGSSDGGFG
SIPAAKALQA LDNRILNVFA GAPYLDANVQ LRFIFDTYLS GFFSPDKDNS FLQLFIPLLG
FTGSIESPGY PNTGTGQKFV DASQLAAVTK WMANPSPLGP TELATIVPFP ALDILNKDLV
EMYQTAIVTN VTNPCLEGLF SNKTDKLCDV INEGSLFSIL QSFDIQTQLC YSQEDTLNTR
DNFIPELFEN SLVSEVTSLL HGILPITGDH LKATFLCKIN PISFFAMNGP ATTNPPVLTT
TIDGDELANY LKASGPRTIS TITSSGGVKM GFSILSTVLL LAYFPFV