Gene PHATRDRAFT_49199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49199 
Symbol 
ID7195511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp199862 
End bp201237 
Gene Length1376 bp 
Protein Length418 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183946 
Protein GI219127446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00518745 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAGAACTCG GCGATGATCA TCAAACTCGT CCTAGTCATC GGCCATGTCG TCTTCTGGGT 
GAGCGCAGTG GCCTCCCCAT CCAGCATTTT GAATTCCGTA AAGACCGCCG AATCTTCTTA
TCAGCAGGAG AGGGAGCTTC AAAGTTTGGA CGAATCCTTG TACAACATGA CTTTCGTAGG
TGTTAGTAAC GCGAACAACA CCATTACGGT GACGTTTGAC AATGGTATAG GCAACGGCGA
CAGCGGAACC AATCCGGATG GTCGCTTCGA CATTTACACA GCCCCAGATG ATGTCGACGT
CACGGCAGAC AAGACCCTAT CATGCTATAC CGAAGGAGGT GAGCTGTTTG ATCATGCAGC
AACTGGTTCG GGCATTTCCG GACAAATAGT ATCGATTGGA GCCAACACGT TTCCAACTCC
GTCAACCTTT GAGTTTACCT TCGATGAGAA TGTCACGGAA TCAAGTCCCT TTTACAATTA
CGATTCCGGT ACGACAACGA AGCAGATTGA ATTCATCTTT TGTGTTAAGT TCACTCTTAC
TCGCGAGATT GAAAATAAGG GTTCCGACCC AATCACTACC AGCACGGTAG ATATTAACTT
TCGCGAAGTA GCGATTGTGG TCACGGTTAC CCTTGATGGC AATCTCAATG CTAACAGCGT
GGATGCATTT AATGTCGCTG CTGCCCCCAT CAATTTCGAC CTTGACAACG AAATCGTATA
TACGGCAAGT GTTGGTCTCT GTGAAACATA CAACCTCGAC GTCAAAACAC CTCAACAGGG
CGATGTCGTT CCTATTTGCA TTATGTCGGA CGACTTCCCG TTGGCTCGAA TCATTTCAGT
ACAAGACCTG ACCTTCACTT CGGATTCTCT GACGCAACAA ATCCGCGTGG ACGGATTAGA
CGCTCCTGGG GCTACTGGTT TGTACGGCCG AGCGAGTGCC GACACCCATT GCGTTACCAA
TGAGTGCATT CAGTACGACG TGATTGTCTA CGCTATTTTC GCGACCAGCG CGGAAATCAA
TTTGAAAATT GATATTACTG GCAGCGTGGT CCTCGCCGTG GGTAACTATA TGACCCGGAA
GTTGCGAACT CGGCTGGAGC CTACTCGTGA GCTGGCCGAA TTGTTTCAAG GGCAGTCTTT
TCGGTCGACA ATTGAGCTGC CACCCTTGCC TTCGAGCGAG TCAGCTGCGT CTACCGCCTC
CGCAAATGTC GTCTGTATGA AGTTTATCCC GTTTGCTATT GCCATGCTGG CACCTTTCCT
AGTTTTCTGA AAAGGTCGGA GCGAACATTG TAAATATGTC TACAAATGGC GAATGCCGCC
TTGCGTTCCA ACGTTAAATC ATATATAGAA AAATTAAGAA AGCATTTTTT TAAAAA
 
Protein sequence
MIIKLVLVIG HVVFWVSAVA SPSSILNSVK TAESSYQQER ELQSLDESLY NMTFVGVSNA 
NNTITVTFDN GIGNGDSGTN PDGRFDIYTA PDDVDVTADK TLSCYTEGGE LFDHAATGSG
ISGQIVSIGA NTFPTPSTFE FTFDENVTES SPFYNYDSGT TTKQIEFIFC VKFTLTREIE
NKGSDPITTS TVDINFREVA IVVTVTLDGN LNANSVDAFN VAAAPINFDL DNEIVYTASV
GLCETYNLDV KTPQQGDVVP ICIMSDDFPL ARIISVQDLT FTSDSLTQQI RVDGLDAPGA
TGLYGRASAD THCVTNECIQ YDVIVYAIFA TSAEINLKID ITGSVVLAVG NYMTRKLRTR
LEPTRELAEL FQGQSFRSTI ELPPLPSSES AASTASANVV CMKFIPFAIA MLAPFLVF