Gene PHATRDRAFT_49574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49574 
Symbol 
ID7198191 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp86823 
End bp88640 
Gene Length1818 bp 
Protein Length555 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184390 
Protein GI219128375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.251941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGAACCGA GGATGCTTTC TACTACCTTT CCTCGACCAA CATCATCCAT TTTACATTCC 
CGTACGAGTT CATTATCAAA GTCGGAAAAA TACTTCTCAC ATTAGCAACC GAAATACGTT
ATGCCGATGG AACCGGCGTT ACCATTACGC CGAGGCTTTG TCAAAAGCTG TTTCTCTGTT
GCCAGAGGGG TTGCGATAGC AGCAGTATGT TTGCTAGTCC TACGCAATTT GCAGATGCAA
CAGAAAAGTC TGCTGATGCA ACAAGAGACC TTCAGCCATT CTTCGTCTTG GTTGGAAACC
TCCTGGTGGG ATTGGGAACG ATCAAGACTT TCAGAACCCA AAGTCCCGAG GGCTGATTCT
TTGATTCATC GGCAGCTTCT CAATCGCTCT GTGTCTTTCT TGACGGCCAA ATCCATGTCC
ACTACGGTAC AACCTGTCGG ACATTATTCA GGAATGTGGT GGATGCCCCG GGACGCGTTC
GTTAATAAGC TTCAAGCATA TCGTATACAA AATCTAGAAG GAGGTTCATG GCAGGCCTTC
GCCGCGGTGG GAATGGACTC GGTTATTTTG ACCTTTGACT GGCTGGACTT TTGCGTCGAG
CACTTGTCTC GCTATTTCAA AATGATCAAC TACAACAAGT TCAACCACGT CTTTGCCAAA
CTCGTGGACC TGCACAAATC CTACATGACT TCAGAGCTTG GGGAGGACGC TAGCGGTCGC
TTTGACGTAC AGGATACGGG CACGGCCATG CGAGAAACCA TTGCCCTGTT CCCACTGTAC
ATTCCCAGTG AACCGGTTTT GCTCACGGGT ATTTCCAGGC CTGAATACAA TACCTCGCTT
TTGTATACAA TTCCCGCAGG TGCTGCCAGA CGCAACGCAC TGGACATGTA TTCCATGGCG
GCCACTTTGT TGTCGTTGTG GCGCGTGGGT GTGGGACGAG TAGTGGTGGC GGGAAATATG
GCCTGTGGTG AAGATGACAC TGACGTCAAC AGCGTTTACC ACGAAGCCCT GGATCTTTTC
TTGTCGAGCA TACCGTCCTC TGGACGTGAC GCCATGGAAA TCAACTACGT CTGCGCAGTC
GACGATTTCG ACCGCCAGGA GGCCAAGAAC ACCTCCAAAT CGTTACTGAT GCCGCGCCTT
GTTATTGACA AACTTCAACG AGCCTTTCGT GGCAATCTGA CGACAACACA AAACCATGCC
TGGCTTGGAG ACGACAGCAA TCGGTGGAAA TACGTGTATT TTTCCGAGCC CGATCTGATT
CTGCACACCC GGCCCCACGC AGTACGAGAA CTCGGGGTGC AGCTCGAGCA AGGCAAACTC
ATTGCCGCCC ACCGCTTCCA GCCTATATCC CATGCTGTGG ATTTTCCAAA TTATCCTCGA
TCACAGGACT TGATACCGGC CGACGCGGAC GACCCAGCCA CAACCGCGTT TATCAACCTA
GATCCATCGG CCGGAGACTC GTGCTGCGAT GCTGGTAACT ATTGGCCGGG AAGAACCGAA
CACAAGAAGT GCAGCTACAT GTGGTTGTAC TGTGGCTATT TGGCAAACGA CGCAGATCCA
GATGTAAACC AAAGCGTATC TTGGAAACGG CACGAACGAC TGTGGCGACA CTTTCCTTTG
GTGAGTTTCA CGAGCGGCTT TCGCTCTCCC ATGGTAAGCG AACATGCACG CATCTGCCGG
CCGCAGCCGG CTTCGGCTGG AGGATGCATG CATGCTTAAA GACTTTGTCT TGATTGGACT
ATCGAATTCA CTTTACTCGA GCTGGACTGT TAATCTCCCA CTTTTGCATA AGCCTACCTA
GCTTTTGTGT ATCTTTCT
 
Protein sequence
MLSTTFPRPT SSILHSRKQP KYVMPMEPAL PLRRGFVKSC FSVARGVAIA AVCLLVLRNL 
QMQQKSLLMQ QETFSHSSSW LETSWWDWER SRLSEPKVPR ADSLIHRQLL NRSVSFLTAK
SMSTTVQPVG HYSGMWWMPR DAFVNKLQAY RIQNLEGGSW QAFAAVGMDS VILTFDWLDF
CVEHLSRYFK MINYNKFNHV FAKLVDLHKS YMTSELGEDA SGRFDVQDTG TAMRETIALF
PLYIPSEPVL LTGISRPEYN TSLLYTIPAG AARRNALDMY SMAATLLSLW RVGVGRVVVA
GNMACGEDDT DVNSVYHEAL DLFLSSIPSS GRDAMEINYV CAVDDFDRQE AKNTSKSLLM
PRLVIDKLQR AFRGNLTTTQ NHAWLGDDSN RWKYVYFSEP DLILHTRPHA VRELGVQLEQ
GKLIAAHRFQ PISHAVDFPN YPRSQDLIPA DADDPATTAF INLDPSAGDS CCDAGNYWPG
RTEHKKCSYM WLYCGYLAND ADPDVNQSVS WKRHERLWRH FPLVSFTSGF RSPMVSEHAR
ICRPQPASAG GCMHA