Gene PHATRDRAFT_20740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20740 
Symbol 
ID7201621 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp80273 
End bp83218 
Gene Length2946 bp 
Protein Length837 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180750 
Protein GI219120004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.478562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAAAACAG TCCGTTTCAT TGCTCCCGTT TGCGCCTCCG CCAAAATGCC ACCGTCTTCC 
AAGGCTCGGC TTCCCACCCA AGAAGCGTTA CCGGCGGGAA CAACTCGCGT TTGCCTCAAG
AATTTGCCCC CCTCGTTCTC GTCGCATGAT TTACGAACCT TTGTCCGCGA ACGGTTGTTG
CCGCTCGATC CCCACGTGAA ACTTACGGAC TGTCGAGTGC TGCTCAAGAA GGACGGAAAG
TCTAGACGCA TGGGATTTTG CGGGTTTGCT ACTCCCAGTA CGGCTCAAGT GTGTGTACGG
CAACTGGACA AGGCCTATTG CCGGACCAAC AAACTCGTAG TGGAATTTGC TACGCTCCCC
GCATCGTTGG CCACGACCAC TGCGAACGAT CCGACGCCTG CCGTTGAAAG TTTCAAACAG
GAGAAAAGCT CCGAACCGAT AATAACGGAC AAAGATCGCA AATTGGAAAA GAAAAAAGAG
GAATTCCTGG CGGTAATGGG TGTGGGTAGC AATGCCGAGT CCAAAAACAA ATTCTGGGCC
AACGATGATG GTCATTCGGG CACAAATACA AGCGGCGATC AGATTGCAAC GAAAGCAACT
ACTGGAAACC ACGATGATTC CGAATCCGAT TCCGACAGTG ATGACGCTAC GGATGAAGAC
AATGCAGATC CTCTGGAACG CAAACTTCCA CTACCCGCTA CGGAAGAAAA GAGTTCGTCT
GCTCAAACAT CCGATCTAGA CTTCTTGAAG TCCAAAAAAG TGCAAGTTCA GGATCTGGAC
GATGCAGAAA CCGATAGAAT GAATGAAGGA CCACACGATG ACAGTGAATC TGGCTCGTCG
TCCAGCAACA GTAGCGAAGA TTGCGATATT GTAACCCAGT CCAAAGAGGC CCCGAAAGGA
CAGCCCCAAA TACAGGCAGG CTATACAACA GAAAACAACG ATAGTGTGGG CGATCATCAT
TTATTGGCTG GCGAAGACGA CGCTGAGGCT AAGAACATAG CTGCAAATCG CTTGTTCCTA
CGAAACCTGC CGTTCACAGC AACCGAAGAC GACTTGAAAA CTCATTTTGA AGCCTTTGGT
AGTATAGTGG AATGCCACGT CCCCGCTGAC GATCAGAAAC GGAGCAAGGG CTTTGCATTT
GTGACTTTTG TCAAAGCAAA CGATGCCATA GCCGCGAAAA CTGCTCTGGA CGGCACGGAC
TTTCAAGGTC GTCTTTTGCA CGTCTTACCT GCTCGTCAGG CACCTTCTCT AGGAGACGGC
AATGGCACCA ATCTCACGTT CAAAGAAAAA CAGGAACGAT TGCGGAAGCA ACAAGCTGAG
TCGCAGACTG GATGGTCGGC CTCGTTCGTC CGTGGGGATG CTGTCGTGGA CAATTTGGCT
TCGAGGCTAG GACTACAAAA AGGTGAAATT CTGGCCGTGA AGGATGGACT GTCGTCGGGT
GATGCAGCTG TGCGCTTGGC TTTGGGGGAG ACAGCGGTCA TTGAAGAGAA CCGCGACTAC
TTCCGATTAC ATAATATTGA CATGGATGTT CTTGTATCGG CCACATCCGA CAAAGACGCC
AAGCTGGTTG AGCGAAGTAA GACAATGATA CTCGTAAAGA ACTTGCCTCA CGATACTACT
AAAGAGGACC TTGTCAAGGT ATTCAGTGGA GCTGGGGATA CACCTTCTCG TATTCTCTTG
CCCCCCTCTC GGACAATCGC GGTCGTGGAG TATTCTCACC CGAACGATGC CAAACGTTCT
TTCCGGAAGC TGGCCTACCG AAGATTTAAG AATGTGCCTT TGTACTTGGA GTGGGCACCC
CTGGCTTCCA AGCGAATCGA CAATGGATCC GAAGAAACGA ACGATGAGAA CATAATACAG
ATAGAAAACT CAGAGGATGC CAACCGTGAG ACGGATGACT TGGTGGAAGG TCCCACGCCG
ACAATCTACG TCAAGAATCT AAATTTCCAC ACAACCGAAG ATCAGCTCCG CCAAGTGTTT
TCCAAGCATG TGAAAGATGT TCGCACTGTA CGTATACCCA AGAAGATTGC TCCCGTAAAG
CGTTCTGGAG GCAAATTTGG TACCGAAAAC GAGACAACGA GAGAAATGTC GATGGGATTT
GGTTTTGTTG AATTTGGTTC GAATGAGTCG GCGCGGACAG TTCTGAATAA GCTGCAAGGC
TTCACGGTTG ATGGGCACAT ACTGGAACTC AAACCATCGT CCAAAACTGG CAATCAAGGA
GTGTCATCCA CGGCGGCTAA GAACACTACT TCAAAGAGTA CAAAAGTAAT GGTTCGCAAC
GTGCCGTTTC AAGCAACCCG AAAAGAGCTG CTTCAGCTTT TCGGCTCGTT CGGGCCTCTC
AAAAAGGTTC GGTTGCCGAA AAAATTCGAC GGAAGCCATC GGGGATTCGC GTTCGTGGAA
TACATGGCGG CGAAAGAGGC AGCTGCTGCT ATGCATACTT TATCCGCTAC TCATTTGTAC
GGACGGCATT TGGTTTTGGA ATGGGCTGCT GCCGATGAAG AAGCCGAAAA CTTGGACATT
CTGCGTGCCA AAGCCAAAAG GAATATTGGT CTTGACGCCT TGTCCGCGAG AATGGAAAAC
AAGAGAATTC GTTTTGAGTA GTAAATTGAT GTTCAGTATA TTGATTCAGT CTGCTTGCTT
CGCATCGAAG GGGGCGAATC AACCCACAAT GTAACAGTGC ATTTCCTATA ATCAATCGCA
TCCCACGTTT CTTTCTCGGA TGGGATTGTA TACTGGATAA TATGAGATAT ACTGCCTCCA
ACGAGAAATA TCATGGAAGG ATGAGATTTT TGGCTGGCAA GGTCAAAACT AAACGTATCC
AATTCTCTCC TCTAGATACG AGGACGCCAC CACGACGAAA ATACGTGGTT GTATCGAGCC
AAAGCGTAGT GGCAGCAATA ATATTCCTAT ATAAGTCCTT TGACGTGTAA AATCCATTTG
TCAAAC
 
Protein sequence
MPPSSKARLP TQEALPAGTT RVCLKNLPPS FSSHDLRTFV RERLLPLDPH VKLTDCRVLL 
KKDGKSRRMG FCGFATPSTA QVCVRQLDKA YCRTNKLVVE FATLPASLAT TTANDPTPAV
ESFKQEKSSE PIITDKDRKL EKKKEEFLAV MGVGSNAESK NKFWANDDGH SGTNTSGDQI
ATKATTGNHD DSESDSDSDD ATDEDNADPL ERKLPLPATE EKSSSAQTSD LDFLKSKKVQ
VQDLDDAETD RMNEGPHDDS ESGSSSSNSS EDCDIVTQSK EAPKGQPQIQ AGYTTENNDS
VGDHHLLAGE DDAEAKNIAA NRLFLRNLPF TATEDDLKTH FEAFGSIVEC HVPADDQKRS
KGFAFVTFVK ANDAIAAKTA LDGTDFQGRL LHVLPARQAP SLGDGNGTNL TFKEKQERLR
KQQAESQTGW SASFVRGDAV VDNLASRLGL QKGEILAVKD GLSSGDAAVR LALGETAVIE
ENRDYFRLHN IDMDVLVSAT SDKDAKLVER SKTMILVKNL PHDTTKEDLV KVFSGAGDTP
SRILLPPSRT IAVVEYSHPN DAKRSFRKLA YRRFKNVPLY LEWAPLASKR IDNGSEETND
ENIIQIENSE DANRETDDLV EGPTPTIYVK NLNFHTTEDQ LRQVFSKHVK DVRTVRIPKK
IAPVKQMSMG FGFVEFGSNE SARTVLNKLQ GFTVDGHILE LKPSSKTGNQ GVSSTAAKNT
TSKSTKVMVR NVPFQATRKE LLQLFGSFGP LKKVRLPKKF DGSHRGFAFV EYMAAKEAAA
AMHTLSATHL YGRHLVLEWA AADEEAENLD ILRAKAKRNI GLDALSARME NKRIRFE