Gene PHATRDRAFT_47193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47193 
Symbol 
ID7202185 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp765866 
End bp768850 
Gene Length2985 bp 
Protein Length709 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181441 
Protein GI219122204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0349401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCTGTACG CGGGGTTGTG GAGGGGTAGG AGGGCGGGCA TCGTTTAGAG TGCTTTTTCG 
GTGAACGGTA ACTGTAAGGT AACCTCGGCG AATTCTCGTC CCCCGTATAG TATCGAACAG
TAAACGCCGT AGCTTGACCT CTCGACGTGT TTCTTTCACC ATCCCGGTCA TCGTAGTTGC
AGTTCCTTCC GTGATCGATT CCCACACACG ACCTCACTAC CAGTCGTTCT TACTCGAAAC
CGTTCAACAT CCACCGACTA TCAACGTAGA AATAGCGGTC CGATTCTTCG TCGTGGGAGA
AAATCATCAA ACGCAAGATC TGGGTTGTGG ATTCCTGGAT TTGCGCTCAA AGTTTTTGCG
TTTCCTCGTA ACGTAGAAAC TTACCGCAAC ACATCACAAT CATGTCCGCT TTCCTCAGGC
ACTGCAGTTC CTGGTTGCAA CTAGTCCTAT GGGTATTGTT ACCTAGCTTG CTTTGGAAAA
CCGACGCCTT TTCCACCCAC AAACCATTCG TGACGATGCC GTTACTGGTG GCAACCGACG
GGTCTCCCGC GTCTCGACAA TTGAGTCCGG CATTACGGCA ACAGTCTCCA CGCCACCAAA
AACTGCCGCT CGGAATTGAC AGTGAGCTAG AAGCACACTC GCGTCGCTTC CAGATGCTCT
CGAGTAGTAC ACGCACGACA CGACGTTCCG CAGTTGCGGG TGAAGGCAGC GAAGAACGAT
TCCGCAACGG CAGCAGGACT CCACCACCGC CGCCGAATGG GGAAAACTTG ATCTCGGCGA
GTGCGGCAGC GGCGGAACGC TTCGCCAAGG CGGCACCTCT CGACGAAATT GCGCCTTCCG
CACCGGGCAG TAAGCTACGC AAACTCAAAG ATCTCATGTG GGTCCGGGAA GCTCTGGAAG
ACTTGACGGC GGCCGAATTC GCTTGCACGG TGGAGGCCTC GCACCACAAA CAGGACGAAA
CTGTGTCTCG GCGACGCAAA CGCGCCGTTG ATTACGAAAA ATTACTCGGA CAGCTCAATC
GCCGGATTCG CGATTTGGGC TGTGAGCCTG TAGAGGCTCC CAAGACGAAC GACGACGAGG
TTACCGACGA AGCTGTCGTT CCCAGTGGAC AAGTGGAACC GGATGTGGGT GCCGGTACGC
TCGTCTACTC ACTCAAACAA CGTCAGGCTT TATTGGATCG TTTGTTTCGT ACGCGTCAAC
TTCTCCTGGA AGTCATTCAA GGCTACGAAC TGGAAATCGA TCCGATGGAT TCCTTTACCA
TCAGTTTGCC TTCGATTCGC GTAGAAATCC CACGAGAAGA AGATCCTTCC TCACCCGGTC
CCAAATTGTA CGTGCGCGAT GATGGTACCG TTGACTGGGA CGGAGCATTA CAAGACCAAG
CTGCCATGAA AAAATTCGGA ACCGCGGTTT GGGCACGCAT CAATGGACGA GATCCTGAAT
CTCTCGACGG TGAACAGCGC AATCCCCAAA ATGCCAATAT TGGCTCATCT GCTGTCATTG
ACGCAGAAAT CGAGTCGGCC ACGGGCGCTG CCGTTGTGGG AGAAGTTCGT GAGGTCGAAA
AGCCAACCGT GACGGCCAAA ATTGAAGAAA CTCCAGAAAT AATTAGGGCT CGTCAACGCT
TGGAGATCCT CACCGCGGAT TTGGCCAAAA TGGAAGCGGA TTACATTGCG CTGATCAGCT
CCGCCATTGC TCCGGGACAA GCTGTGGCCA ACGTCAACCT AGCCAATCTC GAGCCTGCGC
AACGTAGCAG AATTCGTGAA TCCACCGAAG GCATTGATGT TATGAAGGAA AAGGTCTCGT
TTCAAACTCT AGTGTACGAA CTCGAAAGGG TATACACTTA TTTAGTGGGA GAGATGGGTA
ATCCTGCCCA AAATGGGTAC ATTCCATTGC AAGATCGATT GAATGTAGCA GAATTTGGGT
TGCTAGAATC TCAAATTGAT AGCTTCCATC GACAACTGGA CGAGGGTAGC TCATCGCTCG
ATACAGACGT CATGGCGGTC GTCTTGGAGC AAATGATTGA TTTTAAACGA CGATTGGGAA
TCGACTACTT TGTGGCTGGT TTGTCGTTTG ACAGGGACGC GATAAAACGG TATATGAGTG
AATTACTGGA AAAGACCAAG AAAGGTTTAG CCTTTTACGT CAAGGGCGTT CGCCTCTTCT
GGAATGATAT CATATTTTGC TTGAGTCTGA TCAACCGCGC CGCACAAGGG TATACTCTCA
AGCCTCGAGA AGTACGCACA ATTCGGTACG TATGGGTTGT CATTTATGTT CCGTCGCGGA
TATGCTGAGC TTTTTCTCAA ACGTTTTTTT TCGTTTTGGA CAGACGAACC TTCAAGGATT
TTTTTACATT TATTCCGTTT GTGATTATCT TGTTGATTCC GTTGTCGCCC ATCGGCCACG
TTCTTGTCTT TGGTGCTATC CAGCGATTCT ACCCCGACTT TTTCCCCAGT TGCTTTACCG
AGCAACGTCA GAACTTGCTG CAGCTGTACG AGAACGCTGA ATACAAGGAG TTTACAATTG
ATGAAAACTG GAAGGTAAGT CTCCGTTTGT TGCCCTTGGC ACAAGAAAAT GGACAAAAAC
GTTGCCGTCC TTTCGACTTA CACTGGCATT CGTCGGTAGG AAAAAATGTC GCGAATGTCG
GAAGCTGCCG TTTACTTTGG AGCCAACACA TCACGAGCAT TGTTTGAAAA GATGGCCAGC
ATGGTCCGGG GACAGAGTGG TGCAGCTACC GACGCCACTA CCGAGAAGAA TGGAAAAGAG
CAATAACGAC GGAATATGCT TATCTTTTGC GTGTAAATGA ATTTAGTGAA ACAGACGGAC
GAATCCAAAC ATGCTCACAT TCAAGACTTT CAGAGCCACC CAATGACTCA ACTTGCAAAC
TCCGATCCCG ACTACACTAC CTCAGACTCT ACAAGAAAAT GTGGCAGTTT GACTCCAGAC
CAAATCTACA TCGTTGTTTT CACATCGCGA GTTGATTAAA AGCTT
 
Protein sequence
MSAFLRHCSS WLQLVLWVLL PSLLWKTDAF STHKPFVTMP LLVATDGSPA SRQLSPALRQ 
QSPRHQKLPL GIDIAGEGSE ERFRNGSRTP PPPPNGENLI SASAAAAERF AKAAPLDEIA
PSAPGSKLRK LKDLMWVREA LEDLTAAEFA CTVEASHHKQ DETVSRRRKR AVDYEKLLGQ
LNRRIRDLGC EPVEAPKTND DEVTDEAVVP SGQVEPDVGA GTLVYSLKQR QALLDRLFRT
RQLLLEVIQG YELEIDPMDS FTISLPSIRV EIPREEDPSS PGPKLYVRDD GTVDWDGALQ
DQAAMKKFGT AVWARINGRD PESLDGEQRN PQNANIGSSA VIDAEIESAT GAAVVGEVRE
VEKPTVTAKI EETPEIIRAR QRLEILTADL AKMEADYIAL ISSAIAPGQA VANVNLANLE
PAQRSRIRES TEGIDVMKEK VSFQTLVYEL ERVYTYLVGE MGNPAQNGYI PLQDRLNVAE
FGLLESQIDS FHRQLDEGSS SLDTDVMAVV LEQMIDFKRR LGIDYFVAGL SFDRDAIKRY
MSELLEKTKK GLAFYVKGVR LFWNDIIFCL SLINRAAQGY TLKPREVRTI RRTFKDFFTF
IPFVIILLIP LSPIGHVLVF GAIQRFYPDF FPSCFTEQRQ NLLQLYENAE YKEFTIDENW
KEKMSRMSEA AVYFGANTSR ALFEKMASMV RGQSGAATDA TTEKNGKEQ