Gene PHATRDRAFT_36547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36547 
Symbol 
ID7201703 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp722209 
End bp723627 
Gene Length1419 bp 
Protein Length313 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180886 
Protein GI219120289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000358149 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTGGT TTAGTAGCCG TCGCGCCAAG CCAGCCAAAG AGAGCGCGTT CAATACCGAC 
GACAAACCAC AAAATGCTTT TGGAGGATTC GGCGCCGTGG CGCACTTTGA CGACGGTCCC
TTGCGAGACC CTCGGATGGC CAACGACTTT CTCGAAAGGG GAACGAAAAA CGAAAATATT
CATACGCATT CGGACACCAG TAGCGACGGT GACGACGCTG GTTCCGAGGC AGAGTCGGGT
TCTCTCAGCG ACGATTCCTT CGTAGGAGGT CCGCTCATGT ACGAGTCAAC GTGTTCCGAC
GATTCTATTA CTGTAGACAC TGCCGATCCA GATTCTCAGG ATTGGAATCT ACGAAAAGCC
AATAACTTTC TGCAGGATTT TTACGACAAG GAGGACCTGG AACGGGTGTC GCAAGAACAG
CACCGTTTAA GGTCGACTGA CGACGAGAAC GATGCAGAAT CGGCATCGTC TTCCGACCAG
TCTTCGGAAG AAGGCGAGTC CGATTCGGAA ACGCTAGAAA AATCCCACTC TCCTCTGTAT
AAAACATGTA AAGATGCCGA TATTTGTCAC GAAGACCAGG ATCTGGGTAC GGCCAAAATA
GACACTGCTA CTACCTTATC AAGCCACGAT GCAAAAAAAT TTCCGATAGC CCCGGAAATC
TTTGACGAGG GATATGGCGC GGAGCATGAT GGCTATGATG TTGCTCCCAA CGAGTCCCAG
ATACATGAAT GTTCTGAAAC AGAGACGAGT TCGCAGCATG TAGTTGTCCA CGATTTTTCT
ACAGCAAGCG CCTTGAGACC GGCTGTAGCT GAAGCAAGGT CTTTCCGCAA CCTTGACAAC
TCATATANNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGAAAGAAG CGAGGTCGTC
TGACTGTNCT GTTTCGCTTG ATGCCGGCCG ACGCGAAGGA TCGTCAGATG AAGAAAATTG
ACAGATCTGG CGAAAGAATT GGCCACTTCA CAGTCACTGT CAATTCGAAA CCGAGACTGT
ACCCACGCCT ATCAAATTGC CTACAACATT CTTTTGGCCG AGTGGTTTGA TAGCTAGACA
GTCAGACAGG CCTTAGAATA GTCCATGGAG CGAAAGTCCC TGTTGAAAGG TTATTCGCTT
CCGTTTGCGA AGGTCACCAG TGTGGCGTAT CCGCGGGCAG GCTTGTTTCC TTGTCTTATC
TTCAGACTGT AGCTCTAGTG CTCGTCGAAT CACAGGTCAA CCCAACGAAG TTGTGAAGCC
ATTTCCATGA AGAACCTAAG ACGAGCCAGG TTAATGGATG CTTCTACAGG ACAAATGCAT
GAGTATTGGT CCCGCACGAA AAGGCATAAA GACCAATGA
 
Protein sequence
MSWFSSRRAK PAKESAFNTD DKPQNAFGGF GAVAHFDDGP LRDPRMANDF LERGTKNENI 
HTHSDTSSDG DDAGSEAESG SLSDDSFVGG PLMYESTCSD DSITVDTADP DSQDWNLRKA
NNFLQDFYDK EDLERVSQEQ HRLRSTDDEN DAESASSSDQ SSEEGESDSE TLEKSHSPLY
KTCKDADICH EDQDLGTAKI DTATTLSSHD AKKFPIAPEI FDEGYGAEHD GYDVAPNESQ
IHECSETETS SQHVVVHDFS TASALRPAVA EARSTQRSCE AISMKNLRRA RLMDASTGQM
HEYWSRTKRH KDQ