Gene PHATRDRAFT_45417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45417 
Symbol 
ID7200535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp97302 
End bp100232 
Gene Length2931 bp 
Protein Length976 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179579 
Protein GI219117572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAT TTTCTTCACC CCTCACGAAG CATCCGATCC TGATCGACCT TACACACGAC 
GACATCGAAT CGCCAGCTGC GGAGGAGCAA GTGCATTCGA CGAAATTGCG GGACAGAGTC
TTTCCTGAAC CTAATACACC GGAGGCTTTT GCTTGTTCGA CTACGGATGC GGCGGTATCG
CGATCGGCAG CATCCTCCAG GCAAATTGTC ACGCCACCAG TGGCAGGAAC TCGTACTGTA
TCCAATTCAC AGGACCCAAA TGTTATTGAT CTCGTGAGGG AAGATTCGGA TTCGATTTCT
TACTCTGAAG TGTTGCAAAT CGATAGGAAT GGGGGAAAAG TGAGCTGGTC ATCCGTCGCA
GAAGAGCAGA CGGACAACGT CCGCAGCCAA GATCACCGCG ATCGTCTGTC GCGTACGTGT
CCTCCACGCC GAGAATATCG ACGATCTAAA CAATGGACGA AAGCCAATTT CAATTTGGGT
CAAACCTTGG GAACGCCCAG ACAACGTGAG GGCACGGACC TTGAACTCCA TGATTTCCTC
GGCTTTCGTA ACTCACTTGG CCGTGTGTAC TACAAATCCT TTTCTTTGAA GAATGACCTG
TGCCAGGGAA CCTTCCGTGT TGGTTGTTCG GTGTTGTTAG CAACAGAAGA ATACGAAGAG
CCTACTTGTC ACCGGATCAT ATCCATCTTC CAAGCTACCA GGTCCTACGT GGGACAATGG
TATGGTACTC AAGAGATACA AAAACGCAGC CACTGCTATT TTGAGTTCGA ATCAGCTTCA
GCTGAGCAAA ACTCTGTTTA TGAAATACCG CAGTGGATTC TTCATCCTTA TCTGAAGGTG
GTCAATTGCG TAGCCATAGA TGACTTAGAA AGAGGCAACG AGACCTCCGC AATGCGTCTA
CTTCTAAGAT CGGCATCACG AAATTGGCAT CCTAAGCTGA AAGCCGAAGA GTGTACTTTC
GTGTGTCAGG GTAGCCCAAA GTCCCGCCAA CTTTCGACGA CCGAACGATG GAACCATGAT
TCAAGTGACG ATTCCGATGA ATTCGAGGAC GCCTCTTCCG AACTCGCAAA AGCCGCAGCC
ACGAAAAAAA GAAAAAGTGA AGGTCGTGCC GATTCCTGTC AGCGAACCCC TGTCCAGAAA
CATAAGATTG CGCGTGGGCG AAAAAAGGCT TCGTCCGTGA GATCGTCTCG CCCAAGACAA
AGTTCGGGAG CAAGGAAAAG GCCAGCTTGC CCTATGCAAA AGCAGCGGTG TCATCCTTCG
ACAATTCAAA CATCAGACAC CGAGAAGATG TTCATCGAGA ATAGCACTGG GCACGCTGTT
CTAGGCAATT TATTGGTACA AAAGATCCGG GCTGGGGCTA CTATGAAACG AGATAGCTTG
TTGTCTTATT TTCTGAATCA TCGTTGGCTG CTAGTCGAAA CTCCACTTCA TAATTTTCAA
GGGCTTTTTG ACTGTCGCTT TTCAAAAAAC GGTGAGACTT TTGTTCTGAC TTCCGTGAAG
GCTCAGGCTG ATACAAGCGT GGGCTCGCTC AACAAGATGG GTTCGCTAAA ATGCGCGCAC
TTGATTTACA CCAAAATCTC GGGACCGGGT GTGGCTCCTA TATCTGTTCA AAATCTGCTC
TTGCAATTTG GTGACTTTTC CATTCTACCA GCAAGAAAAG TCGTTGCCCG CCTAGAATTA
CTTCAGAGTC CTTCTTGTAC GTTTACGGTT GGGCAGAAGA AGCACTATGG TATGTTTTGC
CTCCAAGCAA GCGACTTTGT TGTGATGGCC GAAGAAGGAA ATGACGGATG CGGCTTTATT
TCGGAAGAGT TGCTGGCATC GTTGTTTGGG AATAGTAAAG CAGCGAAGCA ACTCCTTGGT
CCACAGGTTC GAGTGGTTGC TCCTCGACTA GGCATTTTTA AAGGCATGTT GATTCGCAAA
CGGATACCAG TGGGTGAGCC TCCGATACAG CTGACACCTT CAATGCGTAA GGTTGGGCCT
TCCCGATACT CCGAAAACGA CATTCGAGCA TTTTTGCTTG TTACAAATCA AGGAAAGCAT
CCCAGTGTGA ATAATGACGC ACTTGGAAAG CTACTTAACC CTTTGCTTGA CAATCCTCCT
CCCTCTTGGA AACAAAACGG TTTTGCCGAG AGAAGTCAGA TGCTTCCTCT CCTCTTGCGT
ACTTTGGGCG TTCCAGCAAT TGTCATGGAA CGCTACCAAA GGGAATATTA TTCCCAGACA
CGTTGCCGCA TACACCACAC ATTTATGCCA GGGTACGCTG ATCCAACGGG TGCGATCCCA
CATGGACATG TGTTTGTGAC AGGAAGCAAA CCGTTTCAGG AGAACCTTCT TTTTGTGACC
AGATCGCCTT GCATCTTCCC AAGCGATGGG CGGTTGCTGC CGAACTTGGT GACGAAGCCA
AACGCTATGG TAATCGATGA TTGGAACTGG CTCAATTCTC TTCCTTTTGG GGCTCTTATA
TTTGCCGACG CTACTCCCGG AATGAAACCC TTGCCAGCAC ATATTGCTAA CGGCGATCTT
GACGGAGATC TATACTTTGT GTGTTGGGAT AGCGAAATCT TAAGGAATGT ACGAGCCGAT
CCTATTGTGG AGGAACCTCT GACCTTAACG GATGGTGAAG TTGCCAGTAC TCCGCAGGCA
AAGATGCCTC CGGAGAATCC CAATTGGTTT GAGGAGGCTC TAGAAATCAT GTGCGATCCT
GCTGAATTGG CCGAAATCTC CGCATTTTAT GGAAAGCTTT TCAATCTCGC CTTGAAAGCT
GCGTTGAACA ACCCGAACAA TTTGCTGTTG CGGGATCCTG ATGCCATGGA CTACGCTACG
GCCTACAATC AAGCGTTGGA CTACCACAAA CATGGTCGTC TTGTTCAACT TCCCAGAAGG
CTCCATTCTT CAATTCCCAC ACGCTTTCAC CAGTATCTTG CGAAAACTTA G
 
Protein sequence
MEEFSSPLTK HPILIDLTHD DIESPAAEEQ VHSTKLRDRV FPEPNTPEAF ACSTTDAAVS 
RSAASSRQIV TPPVAGTRTV SNSQDPNVID LVREDSDSIS YSEVLQIDRN GGKVSWSSVA
EEQTDNVRSQ DHRDRLSRTC PPRREYRRSK QWTKANFNLG QTLGTPRQRE GTDLELHDFL
GFRNSLGRVY YKSFSLKNDL CQGTFRVGCS VLLATEEYEE PTCHRIISIF QATRSYVGQW
YGTQEIQKRS HCYFEFESAS AEQNSVYEIP QWILHPYLKV VNCVAIDDLE RGNETSAMRL
LLRSASRNWH PKLKAEECTF VCQGSPKSRQ LSTTERWNHD SSDDSDEFED ASSELAKAAA
TKKRKSEGRA DSCQRTPVQK HKIARGRKKA SSVRSSRPRQ SSGARKRPAC PMQKQRCHPS
TIQTSDTEKM FIENSTGHAV LGNLLVQKIR AGATMKRDSL LSYFLNHRWL LVETPLHNFQ
GLFDCRFSKN GETFVLTSVK AQADTSVGSL NKMGSLKCAH LIYTKISGPG VAPISVQNLL
LQFGDFSILP ARKVVARLEL LQSPSCTFTV GQKKHYGMFC LQASDFVVMA EEGNDGCGFI
SEELLASLFG NSKAAKQLLG PQVRVVAPRL GIFKGMLIRK RIPVGEPPIQ LTPSMRKVGP
SRYSENDIRA FLLVTNQGKH PSVNNDALGK LLNPLLDNPP PSWKQNGFAE RSQMLPLLLR
TLGVPAIVME RYQREYYSQT RCRIHHTFMP GYADPTGAIP HGHVFVTGSK PFQENLLFVT
RSPCIFPSDG RLLPNLVTKP NAMVIDDWNW LNSLPFGALI FADATPGMKP LPAHIANGDL
DGDLYFVCWD SEILRNVRAD PIVEEPLTLT DGEVASTPQA KMPPENPNWF EEALEIMCDP
AELAEISAFY GKLFNLALKA ALNNPNNLLL RDPDAMDYAT AYNQALDYHK HGRLVQLPRR
LHSSIPTRFH QYLAKT