Gene PHATRDRAFT_45871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45871 
Symbol 
ID7200969 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp521926 
End bp525550 
Gene Length3625 bp 
Protein Length944 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180065 
Protein GI219118591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAGC AAGTTCAGGC AGCCCTTGAC CAAATGGTCG AGCCTCTGTT GGACCTGCAG 
AATCGAGGTG TCTTTTCCCG GGACGAAATT CGCTCCATCG TGGATCGTCG TCGACAATCG
GAATACGCTC TACGCCGCCG AGGAAAACTC CGCAAGGCCG ATTTTATCGG GTACATACAA
GCCGAAACTC AACTGGAAGC CTTGCGGGCT TTGCGTGTGC AAAGAATTAC TCGAGAAGAG
CGTCGAGCGA ATCGAGGCAA AACGTCGCAG GATGATGCTA AGGACAGCAA TACTTCCGGC
ACGAGCAAAA TTGGTGACCG CCACATCGTT CAGCTTATTC ATCTGCTATG GACACGTACG
CTACGAAAAT TCCGCGACGT GAGCTTCTTT CTGGAGTACG CCGAATTTTG TCGATCCCAA
AAATCGTTTG CCAAACTTGC GACTGTATAC GCCCAGGCCT TGGCTTTGCA TCCGAAACAA
ACGGGACTTT GGATAGCTGC TGCCTCGCAC GAATTTTTTC AATCAAGTAC GCCGCATACG
GCTCGAATTC TGTTGCAGCG TGGAATTCGG GTTAACCCCA CATCGCCCGA TTTGTGGTTA
CAGAGCCTGG TTATGGAGCT ACATTTGGTA CAAAAATTAC GGGGTCGTCG CGATATTCTG
CGGGGTGCTG GTCGTGGTGG TGAAGAAGAA GAAGATAAAG AATTCTCAGA GCACAAGATT
GCTCGTTTGC TATACGATAA TGCGATTCAA GCCATTCCCG ATCGTGTCGA GTTTCGTTTA
CAGTGTTTGG ATCAATGTCG CTTGTTTCCG AATACAGAAG ATTTGCAAGC TTACATTCAT
ATTTCAATGG AACGAGATTG TTCGTCTCGC CCCGAAGCGT GGATCGCTCG GGCCATGCAC
GAATGGGAAC GCCATCGAAA GCTAGGAGAG AATGAGAAGA GTAGTATTGG TTTTTTACAG
TCAAACGGAC TTGGCGATGA TTTGAACCAA CGTCAAGAGG ACAACCCGCA GCGAAAAAAG
GCTCGAACGC TTCTCCATCA TGAGCAGGAA GCTAGAGACG ATGTACTGAA GGTGATCAAA
CAGGCTGTGG ATACCCTTTC GTCTCATGAA ATGTATCTGC AAGGAATTCG CTTCCTTCTC
TCATACATGC AGGAATTAGC TGAAGAATTA GAAGAGAGCG ACGATAGCAA AGAAAAGCTA
GAACGAAGTC AAAGGTTTCT TTTTCAGCTC TTCGATCAGG CAAAGACCCT TCATTTCGAA
ACCTCGGACT TAATTCTCGA GCAGGTAGCG TTTTTGGCTC TTGTTGAGTC GGATCAAGAA
GCTAGCCAAT GCATTCAAAA CTTTGTAGAG AACCATCCTA CCAAGGCTTC GATTGATGTC
TGGCGCCGCT ATGCTGCAAT GTCACCCGAT CAAGCCGTTG ATATTCTTGA AGAAGCTGCA
AAGTATATAT CTGTCGAGCA AGAAGCGCAC ATGGTTGTTC TGTTGGAGCT CTTTGGTGCC
AAACTAGCCA TTCCAGACGA GAAGAGCTTG CCGGTCCTCT TTCAAACGAT ACTTTTGCTA
GCTCCTGGTT TTGGGGAGAT GAAGGATGTA GAGGACCCAG TGTATGGAAT AAAGAATATT
TCATGTGCCT GCTTGCAATA TTTGCGATAC GCCTCTAAGC ATCATGGGCT AAACGAAATG
CGGAGAATTT ATCAGGCTGT CTTGTTCGAG TCCTCTTTGG GAACCTTGGA AGGTGGTGAT
CCATCCTTGA TAAACATTTT CTTGGAAGAG GCAATGAAGG CTGAAGAAGA ATTCGGCAGT
AGTAAGGAGA CTCGATGTCA GCTTAGGCGC CTGTGTGACA TCGCGTTGCA GTTGTTTGAA
AACACCGACG AGGAGGAGAA ATATAGACGG CGAAAGGAAA ACATTCTATA CGGGTAAATA
CAAGAGGGGG GTTGAGGAGG CGATTACTTT TGTAGCTATG ATCAAGCTGA CGCTAGTGTT
GCTGTTTGTG TCATAGAAGG CTTCGAACAC CCTCTGATCT CTCTATACAT GGATTGCCCT
GTTCGGTCGG GACGAATATG GTCCAAATAT TCGATGTTGA TGCACCAGAC TTTTCAGCGG
AAGCTTTCGA AGTCGCCTAT AGTAAGTTCC GAGTCGTAAA AATTCGCCAC GCTTACAATA
TGAAGGAACA ACGATCTTTT TCCTGGAAGA ACATTGGTCC AATTTTTGAT AATTTAGACG
CAAGTGACAA AGAGTCTTGG TGTCTCGAAA CAAATGAGGG TGAGCCCTGT TCTCCGGAAT
CTTTCCTGCA GTCCAAGCTT GACACGTCTA GGGCATACTG CAGCTTTTTG ATACAACAAG
ACGAGGATGC TTATAAATCA GCAATGGATT TGCTTCCAAT TTCTGAACTT AGTTGGACAA
ATTGGTCGTA TGAGCCCGCC CTTTGGATGT TTTTCGGCAG AAACCGGAAG GGCAATGATC
CTTTGGAAGG ACGTCCAGAA CATACGGATG CCATTTTACA CGACGGAACT TGGCACTATC
AGTTGTCCGG TGGAAAGGAA TGGTACCTAC GACCAACAAA GGAGCTCTTG AAACATATGG
ATGGCCATTT AACGAAAGCA GAGCGAAGGC TTTGGAGCGA ATCGAGTCGC GTTTGTGTCG
CGTGTGAAGA GGGAAGCATC CTCGTGATCA AGTAAGTCGT GGCTTTTGCT AAATCGCTGG
TTTCGTGTTG TGTATGAAAA CGCATCTGGC TCATGAGTCT ATTATTTAGC ACGAAGCTGT
GGTTTCATAG AACAGTAATT CCATCTCAAA AGCAACCATC CGTATCTTAC GCCAGAGATT
TTCGCTTCGA TTTGAATGCT GCATGCATCC GAGATAGTAA GGGAATGACA AATGTAGACG
GCCTCTACGC GACGAGTGAT ATCGAAGAAG GAACAATCAT TTTTACAGAG AATGATATGC
CTGAATGCGA GCTTCACCGG TCGTCCACGG ATCCAAACTG TGAAGTCGTA GCGTTGGATG
ACGGGTCCAG TGCAGTTGTG TCGATGAAAG CTATAACTGC TGGAGAATTC TTCAGTGTGC
TCGACTCTGA GAGTGAAGAG GAAGGTGAGG TTGATCCTCC TGGATCAACT TAAAAGTAAT
CGGAATGCTT TCTAAAAGCA TCTCGAAAAG TAGCATATCT GTGTCATCAT TGACAGTGGA
TATGGACGTC GCTAAGAATC AACGAATCCT TCTCGGAGAA TAAATTGCCA GGCTTCCTAC
CTTTCACTTG AACATGCTCC CATCGACCCC TAGATCCTAG GCCATCCATT GTCACTACAT
GCTCAATCGT GCCATTACCG TCAAAAACAG ACTGAACAAC GTTGATCTTA CCATTGCCCT
TGATTTGGAA GATAGGATCT TTGCGGGGCT CTCCAGAAGC TATTTTGGGC CGACGAAATG
TAACTCCTTC ACACCATCCA CCGTTGCCCC TCCATACTAT AGTTCCAGAT ATTTCAACAA
CTACGTTTGC TGGGTTGTAT TCATCTCCCA CGATGCGGAT CGGGACATTG ACTGTCACTG
TTCCTTTAAT CCAGTAGTGC CCGTCACCGA GCTCTGTAAC ATATATATAT ATATATTAGA
ATCTGTACTT TTCCGGCCAA AAAAA
 
Protein sequence
MAEQVQAALD QMVEPLLDLQ NRGVFSRDEI RSIVDRRRQS EYALRRRGKL RKADFIGYIQ 
AETQLEALRA LRVQRITREE RRANRGKTSQ DDAKDSNTSG TSKIGDRHIV QLIHLLWTRT
LRKFRDVSFF LEYAEFCRSQ KSFAKLATVY AQALALHPKQ TGLWIAAASH EFFQSSTPHT
ARILLQRGIR VNPTSPDLWL QSLVMELHLV QKLRGRRDIL RGAGRGGEEE EDKEFSEHKI
ARLLYDNAIQ AIPDRVEFRL QCLDQCRLFP NTEDLQAYIH ISMERDCSSR PEAWIARAMH
EWERHRKLGE NEKSSIGFLQ SNGLGDDLNQ RQEDNPQRKK ARTLLHHEQE ARDDVLKVIK
QAVDTLSSHE MYLQGIRFLL SYMQELAEEL EESDDSKEKL ERSQRFLFQL FDQAKTLHFE
TSDLILEQVA FLALVESDQE ASQCIQNFVE NHPTKASIDV WRRYAAMSPD QAVDILEEAA
KYISVEQEAH MVVLLELFGA KLAIPDEKSL PVLFQTILLL APGFGEMKDV EDPVYGIKNI
SCACLQYLRY ASKHHGLNEM RRIYQAVLFE SSLGTLEGGD PSLINIFLEE AMKAEEEFGS
SKETRCQLRR LCDIALQLFE NTDEEEKYRR RKENILYGRL RTPSDLSIHG LPCSVGTNMV
QIFDVDAPDF SAEAFEVAYN ASDKESWCLE TNEGEPCSPE SFLQSKLDTS RAYCSFLIQQ
DEDAYKSAMD LLPISELSWT NWSYEPALWM FFGRNRKGND PLEGRPEHTD AILHDGTWHY
QLSGGKEWYL RPTKELLKHM DGHLTKAERR LWSESSRVCV ACEEGSILVI KTVIPSQKQP
SVSYARDFRF DLNAACIRDS KGMTNVDGLY ATSDIEEGTI IFTENDMPEC ELHRSSTDPN
CEVVALDDGS SAVVSMKAIT AGEFFSVLDS ESEEEGEVDP PGST