Gene PHATRDRAFT_16944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16944 
Symbol 
ID7199259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp99257 
End bp100570 
Gene Length1314 bp 
Protein Length438 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185426 
Protein GI219130551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.994947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAATTGCCA AAGGTACCCG GGACTACCTA CCGGAACAAA TGATGATTCG TCAGGAAGCC 
TTCAACATTA TTCGACGCGT TTTCGAATCG CACGGTGCCG TGGAGATTGA CACACCCGTA
TTCGAACTCA AGGATACCTT GACGGGCAAG TACGGCGAAG ACTCCAAACT CATTTACGAT
TTGGCCGATC AAGGTGGGGA GCTCTTGGCC CTGCGGTACG ATCTGACCGT ACCCTTTGCC
CGGTTCTTGT CCCTTAACAG TGTCGGTAAC ATTAAACGTT TCCATATTGG TAAGGTGTAC
CGTCGGGATC AACCCCAGTT GAATCGTGGG CGGTACCGGG AATTCTATCA GTGCGATTTC
GATATAGCCG GAACGTACGG ACGCATGGTG CCGGACTCGG AATGTCTCTG TGTGGCCTGT
GATATTCTCG ACGCCTTGCC CATTGGAGAC TTTGGGATCA AACTCAATCA CCGAAGACTG
CTGGACGCTA TTCTCGATTT GTGTGGCGTA CCAGCCGACA AGTTTCGGAC CATTTGTTCC
GCCGTGGACA AGCTTGATAA AGAAGCATGG TCCGAAGTCA AACGGGAAAT GGTGGAGGAC
AAGGGTCTGC CAGAAAGCGT AGCGGACAAG ATTGGAACCT TTGTGTTAAA CAAGGGACCA
CCTTGGGATA TGTACAAATC CTTGATGGAC GGAAACCGTT TTGGCAACCA CAAGGGTGCC
AACGAAGCCA TGGAAGACTT ACGCATTCTG TTTGAATACC TCGAAGCCAT GGACAAACTC
AAATTTATTT CCTTCGACCT GAGTCTAGCG CGCGGTCTCG ATTACTACAC TGGGGTCATT
TACGAAGCCG TCTGTATGAG CGGTGAAGCG CAAGTCGGCA GTATTGGTGG AGGTGGGCGT
TACGACAATT TGGTTTCCAT GTTTCAGGAA GCCGGCAAGC AGACACCGTG CGTTGGAGTA
AGTGTAGGGA TCGAGCGCGT GTTTACCCTG ATGGAGGCTC GATTGCGCGA GCAGCAAGGG
GGATCTATCA AGCGCGCGAA CGTCAATATT TTGATCGCGG CTGCTGGCGG AACCATGATG
AAGGAAAAGA TGCGCATTGC ACGAATTTTG TGGGACAATA AACTCAGTGC AGAATTTAGT
CAACAAGAGA ACGCGAAACT GAAAAAGGAA TTACAGAATG CTTTGGATCG TGACATACCC
TTTATGGTAA TCGTGGGAGA AGAGGAGCTG GCGGAGAGCA AAGTTACCGT CAAGGATCTG
AAGGCCAAGA CGGAGCACAA GGTGCCGATT GACGAGCTCG TTTCGACTTT GCGT
 
Protein sequence
KIAKGTRDYL PEQMMIRQEA FNIIRRVFES HGAVEIDTPV FELKDTLTGK YGEDSKLIYD 
LADQGGELLA LRYDLTVPFA RFLSLNSVGN IKRFHIGKVY RRDQPQLNRG RYREFYQCDF
DIAGTYGRMV PDSECLCVAC DILDALPIGD FGIKLNHRRL LDAILDLCGV PADKFRTICS
AVDKLDKEAW SEVKREMVED KGLPESVADK IGTFVLNKGP PWDMYKSLMD GNRFGNHKGA
NEAMEDLRIL FEYLEAMDKL KFISFDLSLA RGLDYYTGVI YEAVCMSGEA QVGSIGGGGR
YDNLVSMFQE AGKQTPCVGV SVGIERVFTL MEARLREQQG GSIKRANVNI LIAAAGGTMM
KEKMRIARIL WDNKLSAEFS QQENAKLKKE LQNALDRDIP FMVIVGEEEL AESKVTVKDL
KAKTEHKVPI DELVSTLR