Gene PHATRDRAFT_47952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47952 
Symbol 
ID7203137 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp538325 
End bp540069 
Gene Length1745 bp 
Protein Length420 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182248 
Protein GI219123888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.870123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAGTACTT CCAGATTAAT GTTGCAGTTG GGCAATCCTG ACGTGTGAGT CGCGCGTTTC 
TCCCTTCTTC TGGGCAGAGC AGCTTCCAGT AAAAATTTCC CTTCGAAAAC GTGAAACAGT
GTAGCCGACA GCGGGGTATT GACTACTTGA GACTGTTCAC TCTCCGTTCA CAAAATATTA
CTATCACAAC ACATTCCAAC TGTGCACTCC CAAGTTGCGT CTCTTTACGG AACAAGGTTG
ACTAGTGCTG TGGCATTTGT CATTGCGAAT CAGACAAAAT GTCGTGGTAT TTCGCAAGTC
TACCTTCGGC ATCGGATCGT TGCTTGGACG CCTCAACAAG ATCGACTCTT CATCAACTTC
TGAAAAACGA AGAAGAACTG AGCAGTTTTC AGCGTACTAG AGTATGTCCA GGAGATTGTC
AACGGAGACT TTTGGATAGT CTCATCACTC CTCCTTTCTT CTTTAGAGGC GAGGCACCAT
CTGGTCTCGG GGATTACAGC GACGTTCCGT TTCATCAGCT TCCCAGAACT TTTTTGGGAA
GTGAGCCAGC AATAGCTCCG GCTGAGATGC GCAGCCTGCG TTTTCCACTC CCGGCTAAGC
TCAACGCTAT TCTTTCCAAC CATGAAAATG AGCATATTGT TTCTTGGATG CCGCACGGAC
AAGCATGGCG AATTCATGAT CTTGATCGAT TCCGTGATCA GATTCTACCT GTTTATTTTG
ATTGCGGCTC ATGTGCTGGT AGCATCAATG CATTTTTGCG CCTTTTAAAG CTTTGGGGGT
TCCGTCAGCT CGCGCATGGA CCCGACATTT CCGCATTTTA TAACGAGGTA TGCCTGCTTC
ATAAAGTTTC CAAAAAATGC AATATTTGTG TTATCACAAC AATTGTACTA AGATTTTCTT
CCCTCTTCAA ACAGATGTTT CTTCGAGGCA TGCCGAATCT GCATAGACTG ATGCGCGCTT
TCGACAGCAA TATAAGGCAA TCTCTTTATA CCACCCCTGA GCCCAATCTA TCAGCTTTTC
CCGTGTTGCA ATACAACCAG CCTGCGTCTC GAATAGACCC GACTTCAAAC GCGCTACACT
CGCTCAAAAT GTCCCGAGAT CTTCACTTTC GCTCGCTGGT GCTATCTTCT ATGCTACTTT
CAAAGGGCGC GTTGTGCAAA AAGGGGTCAT CCGTTGTTTC TTCACAAAAG AGCAGAGCGT
CTTTTTGCAG CAATACCAAA GAGGAGACTT CAGACCTTGA TGGTTTAGAT TTAAATACTG
GCTTGCTTGC AGCACAGGAG CTTCACTTGA GCAGCATGGC GAACATGAAT CAAGATCGGC
AAGTTGATAA ATCCCAAGGC CTGGCTTCGG GTTTGGTTCG CATCGGATTC TGTGGTAGGA
AAGATTTTGC CGGTAGCCGT TTAGAGGACC CTGAAAAACT CCAGCCCACC AAGAATGGAA
CAGGGTCAGA ATCAGTCGTA TGCCCCGTCT CGTCCTCTTC AAACCTGATG ACAAGAAAAG
GGCGGAAAAG ATGGCTGCCT TCATTGGGCC CCAAGCAAAC ATTTTCTGGT CCCGATCCGG
GCTTATGCGA GTGGTTTTGT ACCAATCCAG AAGTGTTTGC TTCCTTGGCA GACTCGCCGC
AATCTTGATG AGAAAGGTGG ATGTGAGAAG GAGATACCCC GTGCATCATA ACAGTTGATG
AGATTTTCTT TTTGTTTTCA TTCCATTCAA AAATTGACAG TATTTTAAAG AAACGAATAT
AATGT
 
Protein sequence
MSWYFASLPS ASDRCLDAST RSTLHQLLKN EEELSSFQRT RVCPGDCQRR LLDSLITPPF 
FFRGEAPSGL GDYSDVPFHQ LPRTFLGSEP AIAPAEMRSL RFPLPAKLNA ILSNHENEHI
VSWMPHGQAW RIHDLDRFRD QILPVYFDCG SCAGSINAFL RLLKLWGFRQ LAHGPDISAF
YNEMFLRGMP NLHRLMRAFD SNIRQSLYTT PEPNLSAFPV LQYNQPASRI DPTSNALHSL
KMSRDLHFRS LVLSSMLLSK GALCKKGSSV VSSQKSRASF CSNTKEETSD LDGLDLNTGL
LAAQELHLSS MANMNQDRQV DKSQGLASGL VRIGFCGRKD FAGSRLEDPE KLQPTKNGTG
SESVVCPVSS SSNLMTRKGR KRWLPSLGPK QTFSGPDPGL CEWFCTNPEV FASLADSPQS