Gene PHATRDRAFT_50061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50061 
Symbol 
ID7198750 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp278469 
End bp281915 
Gene Length3447 bp 
Protein Length886 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184936 
Protein GI219129522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.29853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGG TGGACTGTGA GTCCTGTATC ATTACTCGTA GATCAAATTC ATAGTCCTCG 
TTTTGACTTG TCGGACTTCT CGAAGAAATC TTGCCAGGGT GGTCCACCCG TCAGCCGACC
TAAAACCTTA AATCTTTTTC TGTAAGGTTA TGATGGTTCG TATCCACTGA CTCCATCTCT
CCGGTTCGAT GACGGTGACT TCGAAGAGGT ATTCCACGAC CATGAAGGAA CGACCAAAGC
GATAGGACGG GAGTTTACTG GAACTCTCTT TCCAACGCCA ATCGCAGTGT AAGAGAGAGA
GAGAGAGATC CCCCGTGCCC GCTTCCAACA ATTTACGTGC CCCGGTTAGA ATCTGGAATG
AGGCGAGTTT CTCATTCATT ACTGCGCAGT ACTGGGACCA GCTGTAGACG AAGAACCATT
CAGCAATCCT CTCTGCGCTG GGCTCAATTG TCTACAGATG CATCGTCTTT AACCGACGTC
TCATCTCTAC AGAACAGACT AGAATACAGT GCCCGTCGTC AGCCAAAAAA AGTACGGTCA
TCCACAACGA TTAACGAAGA AACTCGCGCA TGGAACGTTT CTACCTCGAG TGGGCGGACG
TTCCCTCATG TCGAAGGACA AAATACAGAA AATTTACTCG ACATAGAGAC GTATCCGATA
GGTAGCTTGA CACCGACATT GTGGAGCCAA GGCCACGAAC TCTTAGCCTT TTGGGTAAAG
CAACGTACGG CTGACTCTGT CGATTTCTCG TTCCGGCTGT GGGATCGACT TTGGCAGGAG
CAAGTGCATC TCGAGACGGA CTGCGTGCAG CGCTCTACAG GCGATAGTCA TCCCCCATCC
ACTTGGCTCA CAACGCCTCT CAATTCGGAA ATGCTCTGTA CATTGGTCAA CAATTGGCGC
GTCTGGTCTT GGAATCAGCA CCCAGACTTG AACAAGCAAA ACCCACAACT GACAGTGAAA
GAGAGATACA ATGCTCCGTA CGTGCAAGCT CTGGTGGAGA GGTGTTCCTC GCTTTTGAAC
GAAAAAGTCT ACTATCTCAT TCTGGACGGA ACTCGCCGCT CGGGAAATCC CGCACAGGTT
GCGGACTTGG CATCCGACCT ACTGGCAGCT TCCATACACC GGTGGAAAAC TCAAATGAAC
CCCTCGTGTC GTCCGTCGAC GGAATTATAC AACTCGGCGT TGCTCGCTTG GTCTCGATCT
AACCGCAATA ACGCCATTAC CATGGTAGAA AAACTATGGG CGCAAATGCA AGAACACAAT
ATAGCTCCGG ATTCTCGTAG TTACGAGCGC ATTATTGCGG CCCACGTTTC CTCTACGACT
CCCTCGCGTT CCGAAAAGGC CGAATATTGG TTGCGCAAAA TGGAACAGGA TCACACTGTT
TGCATATCGA CACGGGCGTA CACGAGCGTC ATAGCTGCAC AAGATGATTG CGCGAAAGCT
GAGCAACTCT TGGAAGAACT CTTAGACCTC AAAGCACAGA GCATTGATAG TCCGAATAGT
ACGGAGGATT CCCAGTTTCA TCCTGGCGAT CAGGATGTTC CCGGGGCAGT CAATGCCACG
CTGCAATGCC ACATCAGAGC AGCGAATGTA GAGCGAGCAC AGGATATCTT GTACCGTATG
CGGGACTTGG GTTACATCGA TGTACTTTCC TACCGCACAG TTATGCTGGG ATGGTTGAAA
GCGTCGCAGC CCGTCCTATG CCAGGCAATT TTACAGTCAG CTTTGGAGTC GTATCGGGAT
GGGCTCTGTG GCGTTTGTCC GTCCGACGAG CTCGTTGCCT TGGCTGTTTC GGCGTGGGCA
AAGTCTTCGC ATCCAAAGAA TGTGGAGCAA GCAATGGCGC TACTCAAGAG TTTGACCTCG
TCTGAGTGGG ATCATATTGA TATGCAGGTA ACGACATCGA CTATGAATGC ACTGCTGGAA
GTTTTGATTC GGTCGAAGGA TTATGGCGAT ATTCAAAAGG CGGAAGAACT GCTCCGCCAT
ATTAAGGGTT TCTCCAAGAA GGATGCGAGC AAAGCCGCGA TGGCTCCAAA CGAGTCGAGC
TACAATCTCA TGATTGGCGG TTGGGCGCGA CTCGGTCAGC CGGTGAAAGC TAGAGCATGG
TTGGAAGAAA TGTACAAAGA CTACCAAGAG GGAAGTATTG AAACAATGCC GGGCCTGAAA
ACTTTTAATA CTGTATTGGC TTCGTATATT CGTTCCCGGG ACCGACACGC GGCGGAGAAT
GCCTGGGAAT TCTTAAGGCT ATTCAGAAGC AATGCGACGA GGGTCGTCTA CCGTTTGAAC
TTGACGTTTA TTCCTACACG TCCGTTTTAT CTGCGCTCAG TAACGCCTGG GATCTGAAGA
AGTACGGCGA GACTGCTCAA CGAGCTGAAG ATCTTTTAAA CGAAATGAAT CATCGTTACG
GACAAGGTCA AGCAAGCGTA CGACCTAATA CGATTTCGTA CAACGCCGTC ATGAATGGAT
GGGCTCGTGC AAAGAATCCC GAGAAGGCGG CGGCTGTTTT GCAAAAAATG TATGCCGACG
TAAAGAAGGA AGGCAACATT AATGCATTGC CCGACGACAA AACTTTCAAT ACACTAATTA
AAGCGTTCGC GTTGTCACAA GACCCAGGCG CGCCAGAAAA GGCGGAAGAG ATTCTTCGAC
ATATGGTGGA GCAATACGAA TTGGGAGCAT CGAAGGTAAA ACCGACGGTG GTCACGTACA
CGACAGTGAT TTTGTGCTAC GGACTTTCGA AACACCCCAA GGCTCCTTAT CGAGCTGATG
AGCTGTTACA GCTTATCAAG GGGTTGTATC AACGGGGAGA GTTGGATGAC GGTCCCAGCC
GTAGTACGTA CCAGGTTGTT CGTAAGGCTT GGGAGTTCTC GACGCATGCT AGAAAAAGGG
ATCGCATTTC CGAATTGGAT CGGGAATACG TTGCGTTGTT TGGGCAGACT GACAACAGCC
CGCGTTCCGG AAGTGGCCGT CCGAGAAAAG ACTACCTAAA CCACAATACG AAAGGGAAAC
GTGACAAACG ACTATAACCA GACAAGATTC GCGACGAGTA ATTTCACATT CGTCGACGTG
AATCATCCAT CAAGGAAGAA TAGAAGTTAC ATTCTAGAGC TTATTTGAAC AAAAAATCAC
TGTAGTATTT GATTTTGATT TTGTCCAAGA TGGCCCAAGG ACGCCTTACA CCTAACCCAA
AGCTTCAAGA AATGCGACTC GCGTGATCCA CCCAGGACTC AAATCCCTGT GCGATCTTTC
TGTGACAGCT ATCAAATTCT GCCTACTGTA AATGGCACGT ACGCACGCAG TTTCTGTCGA
GTCTCTCCAG ACTTTGTTTC TAACTGACTG TGAAGACCCT TTTTGCTGAG CTAAGAGTAA
ATAGTTATAT CGCAAGGAAC CGTCAAGTTT ACTGAGAGTT CGCTGTCCAA AGTGTAACTA
CATCTATTAG TAGTGTTGAA TACTGTG
 
Protein sequence
MNLVDYQIHS PRFDLSDFSK KSCQGGPPVS RPKTLNLFLR RTIQQSSLRW AQLSTDASSL 
TDVSSLQNRL EYSARRQPKK VRSSTTINEE TRAWNVSTSS GRTFPHVEGQ NTENLLDIET
YPIGSLTPTL WSQGHELLAF WVKQRTADSV DFSFRLWDRL WQEQVHLETD CVQRSTGDSH
PPSTWLTTPL NSEMLCTLVN NWRVWSWNQH PDLNKQNPQL TVKERYNAPY VQALVERCSS
LLNEKVYYLI LDGTRRSGNP AQVADLASDL LAASIHRWKT QMNPSCRPST ELYNSALLAW
SRSNRNNAIT MVEKLWAQMQ EHNIAPDSRS YERIIAAHVS STTPSRSEKA EYWLRKMEQD
HTVCISTRAY TSVIAAQDDC AKAEQLLEEL LDLKAQSIDS PNSTEDSQFH PGDQDVPGAV
NATLQCHIRA ANVERAQDIL YRMRDLGYID VLSYRTVMLG WLKASQPVLC QAILQSALES
YRDGLCGVCP SDELVALAVS AWAKSSHPKN VEQAMALLKS LTSSEWDHID MQVTTSTMNA
LLEVLIRSKD YGDIQKAEEL LRHIKGFSKK DASKAAMAPN ESSYNLMIGG WARLGQPVKA
RAWLEEMYKD YQEGSIETMP GLKTFNTAIQ KQCDEGRLPF ELDVYSYTSV LSALSNAWDL
KKYGETAQRA EDLLNEMNHR YGQGQASVRP NTISYNAVMN GWARAKNPEK AAAVLQKMYA
DVKKEGNINA LPDDKTFNTL IKAFALSQDP GAPEKAEEIL RHMVEQYELG ASKVKPTVVT
YTTVILCYGL SKHPKAPYRA DELLQLIKGL YQRGELDDGP SRSTYQVVRK AWEFSTHARK
RDRISELDRE YVALFGQTDN SPRSGSGRPR KDYLNHNTKG KRDKRL