Gene PHATRDRAFT_39238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39238 
Symbol 
ID7194690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp689551 
End bp691326 
Gene Length1776 bp 
Protein Length591 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183268 
Protein GI219126026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.975604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCTGC TGTCTGCACT CGAACACAAC ACTGTTGTGA AAACTCTAGA TCTTTGGCTG 
CCTGGCAAAG ACACGGCTCC TTCTTCCAAT AAAGACAGCG ATATCAACTA TTCCATTCTG
AGTCGGATAT TGGAGCAGAA CAAAACCCTT CAACGCCTCG TCGTCACGTC TTGGAGCAGC
GATAGCGACT ATGCTGCATG GCAAGCTCTC GTAGAAGGTC TCCGCCGGAA TGTGTCAATA
CAATCAGTAG AGCTTGGAGA TGCCGTCCAC GAAACAAGCA GTAGTACGGA CTACCAGGAC
GTCGTGATGA TCCATTGCGA TGAGCAGGAG GATTCAAAGT TTGGAAATGA TGAGTTGGGG
GATTTAGAAA TGATCATGGT CAGTGGAAAA CTAAAAGCAC TGACTCTGAA AGGACTCACA
CTGAGTCCAA CTGCTGCAGA GCGCCTTGCC AATGGCATTT CGCAATCCTC TTCGCTTTTC
ATGTTGGAAT GCATGCAAGT TTCAATGGAT CGGGGATCAT GGGGTACAGT CTTGGAATCT
TGCAGAAATG GTACACTTTC TGAAATTCAC ATCGCTCAGT GTGACTTGGG GTTTTCAGCA
ACGACAATGA TATCCTGTGC GCAGCCTTCT GACGAACACC CGCTTTCCGC CTTGTTACGG
AAAAACTCAA GTCTTCGTGT CTTGCGGGTA ATTTCCAGCC AATTTGGAGT GGCTGAGCTA
CGCGCTGTCA CCGATGCTGG CCAGCTCTTC TTGAAGCAAT TGGAGCTCCG TGACCACGAG
TTGACGGGAT GCGGTGGGCT ACTAGCTCAC ATTGCTCAAC AAGCTAGTAG CCTGGAACAT
TTGTCTCTGT CCGACACTGG CCTTGGAAAT GGCGATTTGA TAGAACTGTG TAGAGGGCTG
TACCTACACG CTTCTCTGAG CTCGCTCAAT ATCCAAAACA ACAATTTCTG CTCAGTTCTA
GCTGCCCAAA TGCTTGCGCA TACGATATCA TCTCTCACCA GAATCGAGTC TTTGGATATT
TCTGAATGCC CTTGGGGTGA CGAAGGAGTT TCTGTCCTGA GGACAACGAT CGCCTCACAT
ACTTCCTTGC GCAAACTCCA CTTGGCGGAC ATCAGAATGA CCGACTGCGG CTTTGTTGAC
TTATGTAGCA GCTTGTTGAA CAATGCTTCA TTGGGCGTCC TGGACGTGAG TAGGAACAGA
TTGGGAACAT CAGGTATGCA TGCCGTTGCA GACTTGCTAT CTCGATCTAA AACTACCATC
TACGATTTGA ATTTGTCCAA TTGCCACCTC ACAGACCACG GAGTGGAAAC TTTGGGGCGC
TCTTTGTCCA GCGCTAAATC GCTGGTCCGC TTGTCGCTGG CATCTAATTC GGCTGGTAAT
GACGCTTGCC GTGCAATAGC AAGTTCGTTG CCCCATTCTT CTTTGGCCTG CTTGGAGCTC
CAATTCAATC GCTTTGACGA AGAAGGCCTG GGCCATCTAG TGGAGGCACT ACAGCATAGT
GTCGATTTGC ACGACTTATT TGTATGGAAT GCTTGTGTAT TTACTGGAGG AGTAGTGTCA
AAACAACAGC AATCCAAACT ACAATTCCTG TCGGAAGCCA TGTTGCATTG GCTGAGACTC
AACAGAGCTG GGCGAGGCGT GATCCGAAAC TACAGCAATC TTCTATGGGA GATCTTACCG
ATCATTTTGC ACCGAGCTGG CCAGTTGTAC GGACCGGACG CATTGTTCCA CATGCTGCAA
GCCCGTCCAG AGCTCGTGCT TCAAAGTAAG CGCTGA
 
Protein sequence
MCLLSALEHN TVVKTLDLWL PGKDTAPSSN KDSDINYSIL SRILEQNKTL QRLVVTSWSS 
DSDYAAWQAL VEGLRRNVSI QSVELGDAVH ETSSSTDYQD VVMIHCDEQE DSKFGNDELG
DLEMIMVSGK LKALTLKGLT LSPTAAERLA NGISQSSSLF MLECMQVSMD RGSWGTVLES
CRNGTLSEIH IAQCDLGFSA TTMISCAQPS DEHPLSALLR KNSSLRVLRV ISSQFGVAEL
RAVTDAGQLF LKQLELRDHE LTGCGGLLAH IAQQASSLEH LSLSDTGLGN GDLIELCRGL
YLHASLSSLN IQNNNFCSVL AAQMLAHTIS SLTRIESLDI SECPWGDEGV SVLRTTIASH
TSLRKLHLAD IRMTDCGFVD LCSSLLNNAS LGVLDVSRNR LGTSGMHAVA DLLSRSKTTI
YDLNLSNCHL TDHGVETLGR SLSSAKSLVR LSLASNSAGN DACRAIASSL PHSSLACLEL
QFNRFDEEGL GHLVEALQHS VDLHDLFVWN ACVFTGGVVS KQQQSKLQFL SEAMLHWLRL
NRAGRGVIRN YSNLLWEILP IILHRAGQLY GPDALFHMLQ ARPELVLQSK R