Gene PHATR_46949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46949 
Symbol 
ID7204766 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp901558 
End bp904135 
Gene Length2578 bp 
Protein Length633 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185976 
Protein GI219121507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.779504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCTTTCTTT CCCCCAAAGA GGTATCTTTT GCAACGTGTG TCTATTTGCG GTACCCTCGT 
TCGTGCAGGA TTCTGTCAAC AATGTTGTCT CGCGTTCGTC ACGGTAGCTG TCAGCATCTG
TCATCGACGA TCGTTCACTT CCGAGTCCGT TTTCGGCTTT GAAACTTTCA GCATTTCACC
AAAGATCCTC CATTTGTCCC GTACGAAACA CTCTTGGTGG TTACGTTTCT ATCCTTCCAG
TAGAGAAGCG CTTGCTACGC AATCTGTAAT AGCTCCGTTT CCAGGCAGAA CACTTTCGAC
TGCGGGTTGT CGGGATTTAT CGACGGACGG AAGGAACATC CATACACGTA TCGCATTTAC
GTATCAAATA CACGTATCGC ATATACGCAC CCGTCGTCGA CACGAAAAGG ATGAACGCGT
TTACTACGGG ACCGGAGTCA ACTGCTGCCG ACCGCCACAA CAGACGCGCC ACGGGAATTT
CTTCCGAGGC AGAAGAACGC CGGATGGAAA CCTTGGAAAT CATGAACACA CCAGTCGAAA
ACGATGCGGA AGAGGAGAAT GATGATGACG ACGAAGAAGA AGAGGCGATG GCGCCTGTTG
CTGGATGGAG ATTCTGGTGG ATCTTTGCCA GGTTTCCGCT GGGTCTGCTC CTCGTTAACA
CGATCCTTCT CGCTTTAATC ACCCTATACA ATCTACAATA CTCGCCTTCG TATATTGTGG
GAAACTCGAC GCACGCTGTA TTGAACGACG ACTTCGATTC CGACGAGCGA AGGGATTTCG
GGCTTGCGCA GTTCAGTGCG CTCGAACTGC AATCATGGAT TGTCAAATGC GGGATGGTAG
CCCTCTTAGC TTGCCTGGAT GCTATTGTCT TTTACTGGTT CACAGTACGT CTCAAGAAGG
GCATGGAATT GCTTGCGCGA AAGGCTCAGC AAGAAGAGCC TCAAGGACGG CCAGTCGCAA
ATGAGAATCT GAACAGGCCA AGAACGGCAC CGTTGCACAA GTTGGACTTG AACTACAGCG
AACTGGACCT GGTACACGCT CGGGTGACCC AAAAGATAGA CTCGTATCTA TTTAGGGACC
CAGCGCGGCA ACCGACCGTC CCAATCTTGG TCAGCCTGTT GTACCTACTG ACGGCACTCA
CAGTGACTGC TTGTCTGACT GCGCTATCAC TTTTGATATT TTTGTACAGC TCCGAAGGAT
CTTCCATGTG CTTGAAGAGT ATCACATCAC AGTCAGGTGA TTCCGTGGAA TTTGATATTG
ACTCGTTAGA AAATATCCCG CAAGAACTAA AGGAATGGGC AAGTCAGAGA TATTCATATT
CTGATGGATC ATCATACATT CATATGAGTG ACGGAACTAC TTACTTTCGC GGTAGAATGG
CCGAAAAGGA ACATCATGGT TGGGATTCCT ATATGGATAC GGAAACACTA GTTGCGACCA
ACGTAAATGG AAATCTTACT GTGTACAGTC AACTCCACGA ACCGCATAGC TTCGTGAGTA
TTAACGAAGG TTCTGGTGAA ACAATGGAAG GATTCTGTTT CCTCTATACG GAGTTTGTTG
GACACGACGA CGAGGAAATA TATGAATACA CAACGCAAGC AGTTGCGTGT GTATCTTCCA
ATGAAAACAT AAGTCAGGGA TTTCGAAACA CAACTCTTCT AAACAGTGAC GGAGGGGGAT
GGTTAGAATC CGTTGGCAAA GCTCACGATG GACGATACTG GATCAGGCTG CAAGAGGATA
GATTTGTTGA TGAGTGGTCG TCGTATCAAG AGGTTTTGCA TATCATACAG CTGGACCCGC
AGTCGATGAT GTATACTGTA GTTGCAAACT CAACTTCCTT CCCCGATTTT CAACAACCCT
TGAGGAACGA AGGAAGTAGA TGTTTCCGCT GGACTAGCGG CATTGGGTAC GTTGCTGCCG
CAATATCTCT GTTTCTTTCG GCGCTGGTGC TCCTACTATT CATCAAGACA AAGTCGGGAG
CAGGTTGCTT GGCGTTGTCT ATCTTTGCAG TCCGACTTTG GCTGGAAGAA ACTTGGCTAG
ACATGCTGAG TCCAGTATTG CTTGTTTTTA CATTCATATG CTTGTGCACG GCATCACTCA
GCCTTGCCGT CCGAGAGATG GTGCTATGGG GAATATATAG CGTCATTGTG GTGCAATTGG
TTTTAGCTTT TGTGAACCGA GAATTTCCGG TAATGGGGAC TATTGGACTA GGTATGGGCC
TCGCGCTGGA CCATCCTGTA CTTCAGCTGG GTGGGTGGAT CGGAGCGCCT TTATCGGTTT
GTATTCTCTT GTACTATGCG ATAGTTGGCT ATTTCGGCAA CACTTATGGT GGCTACTTCT
ACTATGATCG ACAGTATACT ACTACTCTAC TGATTGTTGC ATGCATCCCT GTAAGTATAA
TTATCGGTTG CGGGATGGTG ACGGCAGGCC TTTACTTTTC AAGGTCTCGT GCAGTCTTGT
TGTTCTATCT GAGGCGCTTC TGGCGATCTC TTCGTGTGAA ACTGCGGAGG CGAAGTCGGC
CTCAAAGCAG CAATAGCGAA ATGGTCTAGT TCACATAAAA TTCTTGTTTG TATTTATG
 
Protein sequence
MNAFTTGPES TAADRHNRRA TGISSEAEER RMETLEIMNT PVENDAEEEN DDDDEEEEAM 
APVAGWRFWW IFARFPLGLL LVNTILLALI TLYNLQYSPS YIVGNSTHAV LNDDFDSDER
RDFGLAQFSA LELQSWIVKC GMVALLACLD AIVFYWFTVR LKKGMELLAR KAQQEEPQGR
PVANENLNRP RTAPLHKLDL NYSELDLVHA RGPSAATDRP NLGQPVVPTD GTHSDCLSDC
AITFDIFVQL RRIFHVLEEY HITVRMAEKE HHGWDSYMDT ETLVATNVNG NLTVYSQLHE
PHSFVSINEG SGETMEGFCF LYTEFVGHDD EEIYEYTTQA VACVSSNENI SQGFRNTTLL
NSDGGGWLES VGKAHDGRYW IRLQEDRFVD EWSSYQEVLH IIQLDPQSMM YTVVANSTSF
PDFQQPLRNE GSRCFRWTSG IGYVAAAISL FLSALVLLLF IKTKSGAGCL ALSIFAVRLW
LEETWLDMLS PVLLVFTFIC LCTASLSLAV REMVLWGIYS VIVVQLVLAF VNREFPVMGT
IGLGMGLALD HPVLQLGGWI GAPLSVCILL YYAIVGYFGN TYGGYFYYDR QYTTTLLIVA
CIPVSCSLVV LSEALLAISS CETAEAKSAS KQQ