Gene PHATRDRAFT_39360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39360 
Symbol 
ID7195068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp282010 
End bp283467 
Gene Length1458 bp 
Protein Length485 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183458 
Protein GI219126425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00278594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCATC CCCGGCTGGA AGAAATTGGA AGTGGAAAGT CTTTGCTTGA TGTAAGCGGT 
ATTCCAATTT CCACAGCCTG TTCTTCGACG CCGTCCGCTT CGCTTTCGGA TGAAGAAAGG
AAAGCGTCAT TCCGCGATCG CTTCGGGCCA GACCCTGAAC CAACAACAGA GAAACGAGCT
TGTAGGCCTT TTTCTGTTAT GCGAGGGCGA TCCGGGCGTT TTGAAAAGCA GGCCCGAGTT
GCTAGTACTG CTGAAGAGTT GGCTATGTGC CTTCCCGATT CGCCGCCAAC TCTTGGAGGC
TCTCCAAAGC TGAAAAAATC CTCAATGTGG AGTACACTCT TTTCTTCTAG TGCCAGACTG
GAAGAATCTT CAGCTCATCG CCCGACGTCC TCTGCGTGGA TTACTGCAAT GCTTCCACAT
CGTAAGGGTG CTAAGTCTTC TCCGTCTGGT AGTAACATTT TGGCTCGATC TTCTACCGCC
TTTTCTAAAA ATGAGGACGA TGTACGTTCC ACTGACCGTG AGTCACTTGT GTTCAAGAAT
CAGCCGCTAC ATCGCAAGGT CATGAAACGT GAAACCTACA CAGTGGAAGA AGACACGCAA
ACGCGACATC CGAAAGCATC ATTTTCCCGT CAGAATGGTT CCTCGGTCGC AAAATCCCCG
CACAGTATTG CCCCGTCGAT CGTAGAACAA GACGCATTCG ATCAATCCAG GTCTTATCTT
GGTTCTCCGC TTTGGACCCC TGATTCTCCG AACTCCGAGA AACACGACCG ATGGACACCG
CCACACAAGA ATACGGGATT GTTACGGAAA GAATTTGGAA TTGAATCTTC CCAAGAGCTC
ACTTGGCTCA AGCACCCTCT TCGAGACTTG CTGGGGAAAT CAAAACAGAG TAACGAGACA
GAGCCGAGGC ATGCCATGGG AAAGAAGCCT TCGCATGACA ACTTTCAAGT ACAAGATGGC
GATGGAGGCG CGTTCGGCCT CCAAGGATCG TCCGTTCTTC TCGGTAACAG GCTTTCACAG
ATGCGTGCAG CGGGCGATCC AATTTCACTG AAAGTTACAT TATCATTGGA TAAGGATCTG
AAAGATGAAG ATCTCAACAA TGTAGGCAAA TTGCTCTCAC AATTAGAGCA ATATCTGTCA
ATTGCACCAA GACAAGGACA CTTTCTCAAT CAGCACTTTT CTACGAAGAA AATATTCAAT
GAAGCCGGAA GTATTGACCG TGAGTCAACG GGAAAACATG ACGCAATGCG CCAGAAATTG
TTCGAGCTGG AAGCTGAAAT TGATCGTACC GCGTTGGAAA GCGCAGCCGC AGCTATGTTG
CTCGAGGGGT GCCAGGATGA GAAGTTATCG GATCACAACT TGATGGTTGC TTCTTCTCCG
GTTGCTGCGT CTGAGAAGGA TTCATCCTCG TTAAGTAGCG ATGCCGATGA ACTTGTCGCC
CAGCCCGCAC TCTTTTAA
 
Protein sequence
MNHPRLEEIG SGKSLLDVSG IPISTACSST PSASLSDEER KASFRDRFGP DPEPTTEKRA 
CRPFSVMRGR SGRFEKQARV ASTAEELAMC LPDSPPTLGG SPKLKKSSMW STLFSSSARL
EESSAHRPTS SAWITAMLPH RKGAKSSPSG SNILARSSTA FSKNEDDVRS TDRESLVFKN
QPLHRKVMKR ETYTVEEDTQ TRHPKASFSR QNGSSVAKSP HSIAPSIVEQ DAFDQSRSYL
GSPLWTPDSP NSEKHDRWTP PHKNTGLLRK EFGIESSQEL TWLKHPLRDL LGKSKQSNET
EPRHAMGKKP SHDNFQVQDG DGGAFGLQGS SVLLGNRLSQ MRAAGDPISL KVTLSLDKDL
KDEDLNNVGK LLSQLEQYLS IAPRQGHFLN QHFSTKKIFN EAGSIDREST GKHDAMRQKL
FELEAEIDRT ALESAAAAML LEGCQDEKLS DHNLMVASSP VAASEKDSSS LSSDADELVA
QPALF