Gene PHATRDRAFT_50034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50034 
Symbol 
ID7198730 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp210577 
End bp212796 
Gene Length2220 bp 
Protein Length547 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184916 
Protein GI219129480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0110404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTTGGGTT TACGCGTTTT CGTCTTTACG GAGGAGCGCA TTGTTGGAAT CTTGAAAAGC 
CATTCGGTCT CTTCTCTGAA CGCTGTCGAT TCTGTTTCTT TAATCGGATA GGCTGTCTTC
CAAGAAGTTT GCAAAGGTAG CTCAATTGAG GAAAACGGGC GGGCCCTTGA CTTGAACTAG
GTATTCCTTC CAGTCAACAG TGCCATTAGA TCGGAACGAG GAACTGGAAT CACTCTATTC
CTTCCGTAAA TCCCCGAGCA AAGTAAACAA TAGCTACGAC GCTGGTACGA TGAAGTACAT
TGTGGTAACA GGTGGCGTCG TGTCGGGCTT GGGCAAGGGC GTGACGATTT CTAGTATGGG
TCGTTTGCTC CAGGCCAGCG GTCTCCGAGT TACGGCCGTC AAGATTGACC CCTACCTGAA
CGTGGATGCC GGGACAATGA GTCCGTTCGA ACACGGCGAA GTTTTCGTGC TGCGAGACGG
TGGTGAATCT GATCTTGATC TGGGAAACTA CGAGCGCTTT TTGGGCATTG AACTCACCAG
TGATCACAAT TTAACCACGG GCAAGGTCTA CCGCAAGGTG ATTCTCGCGG AACGCCGTGG
TGACTACCTC GGTAAAACAG TGCAAGTGGT GCCGCACATT ACCGATACCA TTCAAGACTG
GCTCGAAAAA GTCGCGTATA TTCCTGTCGA CGGTACTGGC AAAGAAGCCG ACATTTGCTT
GATAGAAGTA GGGGGAACGG TCGGTGACAT TGAGAGTTCC GTCTTTTTGG AAGCCCTCCG
ACAATTTCAG TTCCGGGTCG GGCACGACAA CTTTTGTCTC TGCTTCGTCT CGCTTGTTCC
AGTCTTGAGT GACGAGCAAA AGACCAAGCC GACCCAGCAC GGAGTGCGCG ATTTGCGCTC
ACTCGGACTC AGTCCGTCGA TTATTTTCTG CCGTTCCACG GAACCCTTGC AAGAGCCAAC
CAAGCAAAAG ATATCCAATT TTTGCCACGT GCAAGCTAAG AACGTATTGA GTGTGCATGA
TGTAAACAAT GTTTACTTTG TACCGGGGCT ACTGCAGGAG CAGAACTTAC ACGAGATTTT
GGGCAAGGAA CTTTGCCTCG ATAAGCCATT AAATCCAGAT TTGGGATCAT GGACCACCAT
GGCGCATTCA ATCGAACTAG CATCTCATAC GGTGACGATT GCCTTGATCG GGAAATACAC
TGGGCTGCAG GATTCTTATT TGTCGGTCAT AAAGCCTTGC GTCATGCGGC AATCGCATGC
AACGTACGTC TACAGTTGGA ATGGATTGAA GCGTCGCAGC TCGAAGACGA AAAGGAAGAA
GGATACATTG GGAGTTGGGA CAAACTCAAG GCGTCAGACG GTGTCATTGT TCCTGGAGGG
TTTGGACAAC GTGGCTGGGA AGGCAAAATT TTGGCCGCCA AATACTGCCG TGAGAATAGG
AAGCCAATTC TCGGAGTTTG TCTAGGTTTC CAGGCAATGG TCGTAGAGTA CGCCAGGAAC
GTTTTGGGAA TTGACCAAGC CGATTCAACA GAATTTGAAG AATCCACACC AGAACCTTTT
GTTTTTTTCA TGCCCGAGAT TGACAAGGAG ACTATGGGTG GAACGATGCG ACTCGGGGCG
CGCACCACCA AGTTTACACA CACCCTTGCC GATGGAAGTA TGAGTGTTTC GCAACGCCTC
TACGGAGGAA AAGAAATGGT TTCGGAACGC CACCGACATC GCTACGAAGT CAATCCAGAA
AAGGTTGATG CCGTCCACGA CGGTGGCTTG CGTTTTGTGG GCCGAGATGA GACGGGCGAG
CGAATGGAGA TAGCAGAGTT ACCGCAATCG GAACATCCTT ACTACCTCGG ATGCCAGTTC
CACCCAGAAT TTCTCTCTCG TCCTTTGAAA CCGAGTCCTC CATTCTACGG CTTAATCTTG
GCAGCCACTG GTATGTTGGA GGACCACCTG CAGTCCGTAT TGTAAAACAA CGGAATCTTG
ATTCGTCTAT ACGATCGCGT GGGCTGCCAT GTCTTTGCAA TAAAGCTTAC GATTTCATGA
TTACGACACA GTTGGCAATA AGGATTGCAA AAGGTTAAGG AAATGGAAAA TTAAAATGGT
TACTTCTTTC ACAGTAAGTT CAACATCTCG ACAATTGCAA ATTTGGCAAC AGTTCGGTAT
CAACCATAGA AAAATTGGTT TGAATCACAA AATCTAACCT ATAGTGGAAA TTAGCCATTG
 
Protein sequence
MKYIVVTGGV VSGLGKGVTI SSMGRLLQAS GLRVTAVKID PYLNVDAGTM SPFEHGEVFV 
LRDGGESDLD LGNYERFLGI ELTSDHNLTT GKVYRKVILA ERRGDYLGKT VQVVPHITDT
IQDWLEKVAY IPVDGTGKEA DICLIEVGGT VGDIESSVFL EALRQFQFRV GHDNFCLCFV
SLVPVLSDEQ KTKPTQHGVR DLRSLGLSPS IIFCRSTEPL QEPTKQKISN FCHVQAKNVL
SVHDVNNVYF VPGLLQEQNL HEILGKELCL DKPLNPDLGS WTTMAHSIEL ASHTVTIALI
GKYTGLQDSY LSVIKPCVMR QSHATYLEDE KEEGYIGSWD KLKASDGVIV PGGFGQRGWE
GKILAAKYCR ENRKPILGVC LGFQAMVVEY ARNVLGIDQA DSTEFEESTP EPFVFFMPEI
DKETMGGTMR LGARTTKFTH TLADGSMSVS QRLYGGKEMV SERHRHRYEV NPEKVDAVHD
GGLRFVGRDE TGERMEIAEL PQSEHPYYLG CQFHPEFLSR PLKPSPPFYG LILAATGMLE
DHLQSVL