Gene PHATRDRAFT_39605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39605 
Symbol 
ID7195261 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp245918 
End bp249422 
Gene Length3505 bp 
Protein Length1062 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183575 
Protein GI219126671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA CCAATACGGA AGAGGAGCGA CAGCGGCCTC CCACGCATCC CAAAAGCACG 
ACGGTAGCGC CCGAGTCGCG GGATGCGGGG ACGAATTACA CGAGCGGCGT CCCGAACGCA
ACCTCGGCGG CGGACCCCTC TTCCAACGCC TCGACGTACC CTCCACCACC GCCTCCACCG
TATCCGTTCG CGGTGGAAGA ACGGGCCGGA CGGGGCGTCC TGTCGTCCGT GGAAAAACGA
CGACAGCACG TTCGGACCGC CTCGGGAGAG CAATGGCACG CGGCTGTGCC CGGCGAACGA
CCGCCTCTCG GAGGCAACGT ACCCTTTCAA CGAATGTACA ACAGTATGGC GACACCGCCG
GCTCCCATGA GTAGGACGTC CAGTACGGCG TCGAGTACGG ACGAAGCCAA ATCGGCTAGG
CAAAAGTTCC AGGACATTGC TCGGAAGGTC CGAATGTTGA ATCTCGTCGC TCCGGGCAAC
GCCCACACCA ACCACGGATT GACCAGTCCG AACGGCAGTA CGTCGGGGGG ATCACGGGGA
CACCGGAAAA CGGCCAGCCG TGCGCACGCG TTACTGGACA GTATCCAAGA AGCAACGGAA
GATGACGAGT CGAGGTCCAA CTGCGGGAAC GTCTTCTTTG AAGGCGTCGG AGGGGACGCG
CTCACGCAGG GCGAATCTCA CTTCTCGGTG GAAGCCACGG ATGCCGATCG ACTCGTCGCC
GGCGCGATAC AGGTGGAAAA ACTCTTCGCC ACGAATGATA CGGGGTCGAC GAGCTCGACG
GAAGAACTCC AAGATGCGGA CCACGACGGC GGCGCGTTGC CCGATTACGA AGAACAAGTG
CCCTTGCGGG ATCATAATGA AAGATACGGA TCCCTGGATG AGTTTAGCAA CGGCGGTTCG
AAGTATCCAC GCCGGGTCGT TAAAAAACGT ACGGACCGGT ATTCTTCAAA ACGCCTGCTT
CGTCGCATTG CGAAAATGTG TCACCCATTC ACTTTGCTGC AAGCCTTGTG GCATACTATC
CTGCATTCCT ATTTCGTCAC ACTAAGCCTA CCGGCGTTTG TTGCAGCGTG GGTGGTCTAC
TATCATCTCG GCAACCCCAA TTTGGAGTTT ATGCCCGGGC ACGCATCGGT GGCGTGGTGG
CTAATCTTTG TTGGGCGTCA GTGTATCACG TTGGAATTGG CGCGTCTTAC CCAATGGTTA
CTCGTGGACA AAATCATCCT CGGTACCCGG GTGGCCGTCA AGTGTTTGGG ACCGTTGCCG
ACATTGTACG CCATTCAGGC CAAGGGTTGG CCAATTATGT TGGCCTTATG GGCCATTGCC
GACCTTTTCT TGCTGCACGG AGACAATCGC TTCCAACAGC ACTGGTTCTA CTTTACCGGT
ATTTCCATTT TCAAAGACGC CAATTCGGGG GTGTACATTT TAACTTCCCC AACCTACTTG
CGCATGTTGC TTAGTATGCT GGTGGCAGGT CTCGCGACCG CGGGCAAGCG GACTGCCGTG
GCCATGTACT TTGGACGTCG TACGTTTTCC GAATTCAAAC CTCGACTGGA AAAGATTTTG
CGAGAAGTAG TGTTGCTTTC CGAAACTGCC GAGCTTGCTC AAGAAGCGGA ACGGGTATCC
TACGGGGTTG GACAGACCGG GGAAACTTTG GAAATCGATT TGAATCGTCA AGATAGTCGA
GTGCAATTCA TGGACGACGT TTCGTGGACC ACTGATCGAA ATTTGGTGAA GAATTCAGGA
CGGGGAAACG CTATGGATGA ATCGAGCGAC GATGAGAGCG AAGATAGACA CAGTCCAGCA
AATTTGAAAA GAAGAAGGAG CGAATCACTG AACGACGCAA TGATGGAAAA GACCGAGAGT
GGCAGCTTTC GAGTGAGCGA CTTGCTCGAA AATTGGGAAG AGCCAGTGAA CAAACTCGAC
AAGGTACGGT TATTTTTTGT TTGAGTATCA TTCTTTGTTT TAATCCTTAT ATACTTACAT
CTTTCGAATT CGTATATAGT CTTTGAATGC GTCCATTAAC GATATTTTGA AATTTCGACG
TGCTTTGACA TTTATGGACG AACAACATCC CTTTGGGGAT GCCTTCGGTC CTGCTGCATC
ACGAAACGAT GTCATCAGTT CGGCGCAGCA AGTATATCAA CGCCTTTTGA AAATGACGCC
CGAGAGTATC ATGCTGAACT GTGACGTCTT CACTATGTTG GCGGACGAAG ACGAGGGGGC
TACCACCAAC TTGGCCAAGA GGAAGGCCCT ACGCAAGCTC TTTCGTCCCG ACGCAAACAA
CGAGCTTTCT CAGTTGGCCT TTATTCAATC TTGCGATTCT CTGTACAAAA AGCTGCGCTT
CTTTCGTGCT TCTGTAGGAA ACGCCTCGGT TATTGACCAT GCCCTCGAAA CTATCATCGA
CTTTCTCTTC AACTTCATAC TGGCGCTTGC TTTGCTTTCG CTTATGCGCT TTAATCCTTG
GCCTCTGTTG GTATCGGTTT CAACATTACT CGTGTCTGTG TCCTTTGCTG TCGGATCTAG
TGCCAGCAAA TACATAGAAG TAAGTCACCC GTTTCAATGC CTCCGTCATT GCCCATCTCG
TTTCTCAGGT TCATTTTCTC CATAACGTTA CAGGGCATAT TGCTGATTGC GGCAAGAAGG
TGAGTTGTTC CCACTATTGT AGTAGCTATT TCTATCGCTT CGAAACGAAC CGTTCCTTAC
ATTACATATG TTCATAGACC TTACGATCTT GGTGACCGCA TATACATGCT GGATCCGTCT
GTTTTAAACA GCAACGACGG CCTTTTCTGG TCCTGGTTTA TTGAAGGTGC GATCGTGTTG
CATTGAAAAC ACCAAATGAT CAGTTTTTTC AGTTCTTCAG CTTTCTTACT CGTCCCACTT
CATTTGTACT TTAGATATTA ATCTTTTCCA AACCACGGTA CGCTACGCCG GTACCAACGA
AGTGGCGACC ATCAACAACG GTTCAATTGC AAATTTACGT ATCGTAAACG CTAACCGGTC
CCCCAACGCT GTTGTTTGGT TCCAGTTGCC TTTTCACATT TCTGTCTTGG AGGAGAAGCG
AATGGACCGC ACCCGTGTGG CGCTCGAAAA GTACGCTCAC GCGCGTCCCC GCAGCTGGCA
CAGTTTTTCC TATTGTCGTG TTGACGAGGT CCATGTCGAG TTGGAGAAAC TAATGGTCAC
CATAGGCTTT CAGCACCGGA CTTCTTGGCA AGACTTGGGT CGAATTTTGA TGGACAAGGC
CGATCTGATG TGTTATGTGT ACCAGCTGAC AAAAGATTTG GGCGTTGACT ATGAAGAGCT
TCCACAACGT GATCTAGTAT ACTACTCGGG TTTGCTCAAG AGTGGTGGCG TGCGTAACTA
CCGCAAGGGT CTCGTCAACC CCTTGAACAT TCAAAACTCT GTTGAAGGAG AAGTACCCAT
CAGACAATCA TCATCGGCTG CATCTACTCG TACCAACGAT ACCGATTCGG TGAATCGCGC
CTTTCTCGCT AATCTACGTC TTTAG
 
Protein sequence
MKPTNTEEER QRPPTHPKST TVAPESRDAG TNYTSGVPNA TSAADPSSNA STYPPPPPPP 
YPFAVEERAG RGVLSSVEKR RQHVRTASGE QWHAAVPGER PPLGGNVPFQ RMYNSMATPP
APMSRTSSTA SSTDEAKSAR QKFQDIARKV RMLNLVAPGN AHTNHGLTSP NGSTSGGSRG
HRKTASRAHA LLDSIQEATE DDESRSNCGN VFFEGVGGDA LTQGESHFSV EATDADRLVA
GAIQVEKLFA TNDTGSTSST EELQDADHDG GALPDYEEQV PLRDHNERYG SLDEFSNGGS
KYPRRVVKKR TDRYSSKRLL RRIAKMCHPF TLLQALWHTI LHSYFVTLSL PAFVAAWVVY
YHLGNPNLEF MPGHASVAWW LIFVGRQCIT LELARLTQWL LVDKIILGTR VAVKCLGPLP
TLYAIQAKGW PIMLALWAIA DLFLLHGDNR FQQHWFYFTG ISIFKDANSG VYILTSPTYL
RMLLSMLVAG LATAGKRTAV AMYFGRRTFS EFKPRLEKIL REVVLLSETA ELAQEAERVS
YGVGQTGETL EIDLNRQDSR VQFMDDVSWT TDRNLVKNSG RGNAMDESSD DESEDRHSPA
NLKRRRSESL NDAMMEKTES GSFRVSDLLE NWEEPVNKLD KSLNASINDI LKFRRALTFM
DEQHPFGDAF GPAASRNDVI SSAQQVYQRL LKMTPESIML NCDVFTMLAD EDEGATTNLA
KRKALRKLFR PDANNELSQL AFIQSCDSLY KKLRFFRASV GNASVIDHAL ETIIDFLFNF
ILALALLSLM RFNPWPLLVS VSTLLVSVSF AVGSSASKYI EGILLIAARR PYDLGDRIYM
LDPSVLNSND GLFWSWFIED INLFQTTVRY AGTNEVATIN NGSIANLRIV NANRSPNAVV
WFQLPFHISV LEEKRMDRTR VALEKYAHAR PRSWHSFSYC RVDEVHVELE KLMVTIGFQH
RTSWQDLGRI LMDKADLMCY VYQLTKDLGV DYEELPQRDL VYYSGLLKSG GVRNYRKGLV
NPLNIQNSVE GEVPIRQSSS AASTRTNDTD SVNRAFLANL RL