Gene PHATRDRAFT_40070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40070 
Symbol 
ID7195888 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp71755 
End bp74814 
Gene Length3060 bp 
Protein Length996 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184179 
Protein GI219127932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.625757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGTAT ACGTGCCCGC TGGTTTCAGC TCTGGACTTC CGGATATTGG AGCCCTACTT 
GTACGTTCTC ACCCGGGCTC GTCTGTCGTC GTACCACTCT CGTTAACAGA CACGTACATC
TGTCAACAGT CACCACGAAA ACTAGAAATC GGCCGGAAAC TGTATCACAC TGTTGGGAAA
GACGATTCCC GGCTCTCGAT CCAAAGCGAG GACCCGACCA ACCCATTCCC CACCATCACT
GTAAGGATAA GCATGGTAGA AGAAACAAAG AATATCAGTC GAAAGGACCG TCGGCGAGAG
GAGCGAGTCC GCAAGAAACA GAAGCGGCGT CCCCCACCGG TCCCGTCCGA GCTCCTGGAA
GAGGCGCACG TCGAACCTGT CGTCGTGGCA AGTAAGGACA AGAAGAAACG CAAGGGCAAA
GTTCAAGATT TACCCGAACG AAAATTGGCG AAACAAAATG ATGCTTACGG ACACCTGGAG
TCGGGTGTGG CGGCGGCCTT GCGTCGTGAC GACGAAGAAA TCGCTGCACT CGAAGCCAAA
TTGAAATTGT CCAGTAAGGC GGACAAATCT CGGCTTAATA AAGAATACGC CAAACTCGAA
GGATACGGTG ACGATTTTGG GGACTTTTTG GACGATCTGG ATGGCATCAT GCAACGAGTG
ACTCACGGTG AAGATTTTGC GGACGAGGGC GACCTAGAAA GCTATGTACA GATAAGGAAT
GACGTAAAAA CCATCGAGAC CAACAAGTCA CAGAACAGAG CAAAGGCAAA GACAGCGGTT
TATTCCAACT TGGACTGGCA TGTGGCAGCT GCACTCCGTC GCGATGACGA AGAAATAGCT
GACCTCGAAG CTAAGTTAGG CCTCGGAAAC AAAAAGGAAA AGAGCCGACT GAAAAAGGAA
TACGCCAAGC TCGAGGGCTA TGGCGACGAC TTTATCGACT TTTTGGACGA TTTGGATCAT
TTGACGGATC GAGTAGTTAA ACAATCGAAA AAGGAAGATT CAGACGACGA TAACGATACT
AGCCCTGACG ATGACGGCCA ATCAGAAAGC GAAGGTGAAG AAATAATCCC GATGAAAGAA
CCGGCTTTTA ACGATCTCGA CGAGGACGAC AGCGTAATGG ATAGTCTCGA GTCGACTCAA
AGCAGTGACT CCTCGGATCA TGAGGCTTTA GAAGATGATA TGACTAGGGA CAGATCTGAA
TCTCAAGAGA ATTTGCAATC TGATGATATG GACATAGAAC AGGACCATGA ACCCGAACAT
ACGTACCGAC CATCAGCTGG CGAAAACATT TACGGAAAAG AGATTGGCGC AGCAGAGAAT
ATGGAAAAAC CCCGCAAGTA CGTTCCGCCT CACTTAAGAA ATACGCAAGA AGTGGAGGGG
AAAGAGGACA GCGCTGCAAG GCAAGACGCC CTGCGTGAAA TTCAAAGATC TTTGAACAAT
GCATTGAATC GACTTTCCGA CGACGCATTG ATTTCAGTAG CTCAATCGAT TTGTCAGCTG
TATCCGATGC ATCCGACTTC AGATGTGAAT ACAATGATTT GGAATAACTT GCAAAACGCA
TGTATCGCCA GAAGCCACCT GATGACAGGA CTGATTCCTG CTTATGTGGC TGCCATAACC
GGTGTACATA TTGAAAAGGG CGACACCGCT CAACTTGGGG AGTTTTTGAT AGAAAAGACG
GTTTTGGAAA TCTGGAAAAA ATTGGAGGTT ATCCGGTCGG TCAACAGTCA GAATGATAGC
CCCCTAGGAG AAGAGTCTTT GACTGTAAAT AAGGAAACCA GCAATCTTAT ACTGGTTCTC
TGCTACCTCT ACAACTTTGG CGTCGTTCAT TGCTCCCTTA TCTACGATGC TATACGAAAC
TTGATTGAAA GCTTTACCGA AATTGATGTG GAATTACTTC TTCTTATTTT GAGTCACTCT
GGCCGCGCAC TGAGGAGTGA CGACCCGTTG GCCTTAAAAG AAATTGTTTT CCTTGTCCAA
AAACAATATA CTATTGCAAA AAAAAGTAAC ACGAATGCTT CGCGCTTGGA GTACATGGTT
TCGGCAGTCA TTGATTTGAA GAACAATCGA AAGCGGAAAC AAGACGCGTT ACTCGAAGAA
AAGACGACGA AGCTACGGAA ACTCCTCGGT CAAATAAAAT CTAAGGTGGC TCAGAACAAC
GTTGGCTATA AAGCTTCCGA TTCATCGCTC CGGATTGGTC TTAGAGACAT ATTCAATGCA
GAAACCAAGG GGCGTTGGTG GAAAGTTGGA GCATCTTGGG TCGGCCATCT GGTAGGAGAG
AAAAGTAGCG AATCACCAAA CCAAGGGACG ACAAACGAGG TCAAACCTTC GATAGAAGAC
GAAAAATTAT TGAGATTAGC GTCAAAACAT CGCATGAATA GTGATACGCG GCGATCGATC
TTTTGCATAA TCATGAGTTC TGCTGACTGT GAAGATTGTT TTGAAAAACT CGTCAGGGCG
GGAATGCTAA AAAACCGCGT CGAACGAGAT ACTGTACGAG TCCTCATTGA ATGCTGTGGC
AACGAGAAAG CGTACAACAA ATTCTACTCT CATTTAGGAG CAAGGATTTG CGAGTACCAA
TCATCTTGCA AATTTACGAT ACAGCTTGCG TTCTGGGATG TCTTTAAACA GTTTGATGAC
ATGAGTGTGC GCAAAGCTGC TAATCTTGCT AAACTTTTGT TCAGTCTGAT TGTCGACCAT
CACATTTTGA AGCTCAACGT TTTGAAAGCG ATTGATATTT CTTCCCCAGA CGAGCTCTCC
GAGACGGCGC TAGTCTTTAC AACAGTTCTT TTGTCTAGTA TTATGGAAAA ATTTGACGAT
CCATCTCAGG TCCAGCAACT TTTTGAGACG GGAATTTCTC ATAAAAAAGC GATTGCATCC
GACAGTGTGG ACGACATCGA TGGATTTGGA GAAGCCGACG AGAGCGAGGC ATTAAGGGCT
AGCCTCACCA TTTTTTTCAT GCAAGTCCTT AAGGGAAGCC CAAGGTACAA GAAAGGAAGC
AGATATCGTG CGAATCTGAA GGCTGCCATT AAATCATGTG ATGTGGACGA GTTCTTTTAA
 
Protein sequence
MYVYVPAGFS SGLPDIGALL SPRKLEIGRK LYHTVGKDDS RLSIQSEDPT NPFPTITVRI 
SMVEETKNIS RKDRRREERV RKKQKRRPPP VPSELLEEAH VEPVVVASKD KKKRKGKVQD
LPERKLAKQN DAYGHLESGV AAALRRDDEE IAALEAKLKL SSKADKSRLN KEYAKLEGYG
DDFGDFLDDL DGIMQRVTHG EDFADEGDLE SYVQIRNDVK TIETNKSQNR AKAKTAVYSN
LDWHVAAALR RDDEEIADLE AKLGLGNKKE KSRLKKEYAK LEGYGDDFID FLDDLDHLTD
RVVKQSKKED SDDDNDTSPD DDGQSESEGE EIIPMKEPAF NDLDEDDSVM DSLESTQSSD
SSDHEALEDD MTRDRSESQE NLQSDDMDIE QDHEPEHTYR PSAGENIYGK EIGAAENMEK
PRKYVPPHLR NTQEVEGKED SAARQDALRE IQRSLNNALN RLSDDALISV AQSICQLYPM
HPTSDVNTMI WNNLQNACIA RSHLMTGLIP AYVAAITGVH IEKGDTAQLG EFLIEKTVLE
IWKKLEVIRS VNSQNDSPLG EESLTVNKET SNLILVLCYL YNFGVVHCSL IYDAIRNLIE
SFTEIDVELL LLILSHSGRA LRSDDPLALK EIVFLVQKQY TIAKKSNTNA SRLEYMVSAV
IDLKNNRKRK QDALLEEKTT KLRKLLGQIK SKVAQNNVGY KASDSSLRIG LRDIFNAETK
GRWWKVGASW VGHLVGEKSS ESPNQGTTNE VKPSIEDEKL LRLASKHRMN SDTRRSIFCI
IMSSADCEDC FEKLVRAGML KNRVERDTVR VLIECCGNEK AYNKFYSHLG ARICEYQSSC
KFTIQLAFWD VFKQFDDMSV RKAANLAKLL FSLIVDHHIL KLNVLKAIDI SSPDELSETA
LVFTTVLLSS IMEKFDDPSQ VQQLFETGIS HKKAIASDSV DDIDGFGEAD ESEALRASLT
IFFMQVLKGS PRYKKGSRYR ANLKAAIKSC DVDEFF