Gene PHATRDRAFT_42799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42799 
Symbol 
ID7196164 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1161122 
End bp1165356 
Gene Length4235 bp 
Protein Length1343 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176733 
Protein GI219109961 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.528448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCCG TTGGTGACGC CCTGTTGGGA AATGATGTCG ACTTTCACCC TCCCTTTCCG 
TCCGACAAAG CCGACTACTT GCAAGCTACA GTATTGCAGG GTCTAAGCTC TGCGAGTGCT
AGGGTTATTC TGTATTCTGT AGAGAGTCCA TCAGCAACGG TCACAGTCAG TCCTGCTGTG
ATGAACCCGC AAAAGTTGCC AGCAGGTTCG AGATACTCAA CGTTCAAACT TGCGGTGTAC
GATCCCTCGA TCCGAGAGAC GTTGTAATAC TGTTGTAATT TGATCCAGCA ATTATAGCAA
TCCAGCAAGA ATAGCGTCGT TGCTAAGTAA GAGCGCGGTC TCTACTTTCC GCAACAATTC
GAATATGAGC CACTCAGCAT TATCAGATTA TGAGGACCAT GACGAAGAGG AAGACGAGTG
CCGCGTGTGC CGCGGTCCAG AGGAAGAAGG GTAAGCATCA CTTTGATACA GCTGTCGATC
CTCGTGCCAA GCCATCCTGG ACCTTAAGCC GCAGACGAGC ATATTTTGGT CTTCCAATCG
TGCCTTTTCG GTTTTGGCCG AACGAGTTTG TATGAAGCAC TCTCTCCAAC TTATCCCTGA
TCTCACTTTG TCTGTTTTTG TGCGTCAGAC GACCCTTATT TAAACCTTGC AAATGCTCCG
GGAGTATCGG TCTGACGCAT CAGGATTGCT TGCAATCCTG GCTTGAGGTG CAGCGAGGCG
ACGGCCGGTG CGAGCTATGT CACACTGAAT TTCGCTTTGC TCCTCAATAC GACAATGATG
CCCCCGAGCG ATTGCCCGCT TCTCAAGTTG TCTTGAGCTT GATGCGACAG TTTTTCTCTC
GCTGGCTTCC AGTCTTGATA CGTTGCGTAT TTGCCGCCAG TCTCTGGCTA CTCGTAGCCC
CCCTTCTTAC GGCGTATGTC TACCACGCAT GGATGCATCA ACCGTCGGTG GTCTGGGACC
GCTGCTCCGA CTGGAGTCTC ATTCCTGGTG ATATGGTATC GGGGGCCGTC CTTGTGGGGG
TCATCATTGT CAGCTTTTTA TCAATAATGA GTTTTATTGA CTTTTTGCGA GTGGAATGGC
ACCCTGACGG TCGGCCCAGG CCGCGCTGGG GGGAAGAAGG ACCGGAACCA GCCGTGGGGG
AAGCCCCCGC TCCCGATGAA AACGCCATTG ACAATGCCGT TTGGGATGCC TTCCAGCGAC
AGGTTGTGGA GCGACATCGA CGGCAACGAG GAAGAGCGGT ACCTCAACGA GTCGAACATG
AACTTCTGCA AGCACATGCG CCAGGACAGC CACGAACAGA GGAACGGTTC TCGAATGAAA
ACGTGGAACT TGACGCATCT GGAAGCGACT CGGAATCGAG TTGGCAAGAC GACGATGACC
GTGACGATGA TGACGATACG GTTTCGGATG ACGAGTGGAT CGAAAACGTT GAAGAAGACG
ACGATAGCGT AAATGATGAC AACGATCGAG ATGAGCCACA GTTGCATCCA CCTGCTGTCC
CCATTGTCGA CGATCGAGAC GACGGTGACG AAAATCCTCA GGCATTTGGT CGTAACAATT
TTGACATGGA CCCGGATGAC GGGATGGATA TGGATATCAA CATCGCACTC GATGAGTTGT
TGGGTGCACG GGGACCAATA ACTTCGGTGG TTCGCAATCT TTTGTGGCTG CTGGCCTTTA
ACACTGTGTA CTTGGGCTTT TTTGGATTTA CACCAAAAGT GCTGGGGACC ATAACGTCAA
CGATCTTTAG AAATACTACA GTATGGTCAC CCATGGTTTT CACTATTGTC ACCAACGCTA
CGGTGCCCGA TGACACTCAA ATCGCAAATG AATCTCTGTC GATTTGGACA GCGTACAGGG
CCATTGAGTC TGAGAGCGCA AGTGCCAATA CAACATTTCG ATTGCACGAT CTTTTTTTGG
TACTCTTGGG CTATGCTTCG TGCGCTGGTA TGGTTGTTCT CATTCGCTTT CTGTGGTTAG
CGTCTCAAAA GATTCGAGCT CTTCGGAGCG GGCGTGCTGA CAACCCAGTA CCCTTGCGTG
AACTCCAAGA AGGGTTTGAG GAGATGAATC GAATCATGCG CATGGGTCCT GAACAAATGA
ACATGGTAGA TGACAATGTT GCCATTCACG TTTTCCTTAC CACGACTCTC GATGCGACTT
TGGCAATTAC GAAGGTCGGC GTACTTTTGT TCATGAAGAT GTTTTTGCTA CCTATTTGGT
TGGGTCTATG CTTGGATGTT TCTTCGCTCC CAATCTTAGG AAGCTCGTTC GAGGAAAGAA
TCGCTTACGC TGGGAAAGAC TTATTCTCTT TTCTCTTGCT TCATTGGGTC GTGGGCATAA
CCTTCATGCT TCTAGTGACG GTATCTGTCC TTCAGCTTAG GGAAGTAGTA CATCCTGAAC
TACTAGCGCA GACGATTCGG CCACAAGAGC CCCAACCAGA TCTTCTTGGT AACTTAATGA
ATGAAAGCAT CATTACGCAT ATGAAACGAA TGGTGCTTTC CCTTGTCATC TACGTGGTGC
TACTTGCTAT GTACATATAT CTCCCAATTC AGGCAATCAT GGCAAGTGGC GTTAGCGCAG
ACCTATCAAT GGCACAACTC AAGTTCTGGT ACCCTATAAT GCCAGAGCTT CAGGTTCCTC
TGGAGCTACT TACATTCCAT CTTTGCATGC TGGCACTTCT CGAAAAGCAC AAGAACTCAA
TCGGTGAAAT TCAGCATTAC TGGCTCAAAT TTATATCGAG GCTTGTGGGG TTGACAGACT
CCCTGATTCC CATGCGCGTA GATTGCTTTG AGTATGTTGG AGTGCTTCCA ATATTCGAGC
ATGAAGCTGT CTCGCCTTTT TGGTCCAAGC TGGCAAAAAA CGAGAATGAG AGAGAGAAAC
TTCTGGATGA AAGCGTAGCA ACTTTTCTGA AGAGCGACAT TCCACGAGTC AATATAGGTC
AGTCCAAAGC TAATGGACAA CGTGTTCTGG CATCTAAAGA CTATGTTCGG CTTCCGGATG
TTCTTCCTGG TAGATTGCTG CGCAGTCGTT CCGTTCTCAT GCCGACAACG ATTGGAAAGT
ATCGTCTTCA ACGAATTCTC TCGCTCGACG GTACTCCCTT GATTGAAATC TGGAAAGAAG
AACGAGGCAC GCCCATTGCA CGTCCACCAG AAGGGTGGGA TGATCTTGGC GCAGGAGGAG
CTGATGTTCA AGGACGCTGG GCCTGGGGAC GGGAGAAGAA ATCCGTCGTT GAAGAAGCTA
TTGCGCATCG AGTGGATTTT TTCGGTCGAC ATAAAGCATC AGTATCCTAT TGGGCCGTAT
GGGTTAAACT CATTTTCATG TTCTTCTCAG CGTGGCTGTC GACCACTTGT TTCATTTGCA
TTTCGCTTAT ATGCCCGTTG GCTTTTGGCC GAACTCTGTA TCATATACTC CAAGTACCTC
AAGCCTACGT TCATGACCCT TTGGCTTTCG TTGTCGGCTG TCTTATCTTT TTCCCTGTGG
CACGATGTAT TGGACGATGG AGTCTCTCTG GAGAGAACTC TCTGATGCAG AGGCTCTTCT
CATGGCTGCG GTCTTATAGA CGCCCGCCCT CTGCCAAAGC AAGACTTTTG CTTGTCACAG
CTATTACATG TTTCGGTCTT GCACCTGTCC TACTTGGCTT TATCTATCAT GCTCTCCTTG
TGAAATTGCC TCCTTTCTTT GCTGGAACAC AGGAGTGGAT CGCACTCTCG CTTTTCTCAT
CGTATTGGGC AACCGGATTT GTTTTGTTAT TTGCGTGGGC GCGGCTCTGT ATTGCTGAAG
CATTTACCAA AAAGTTTTGG AAAGGTCTAA TGGGAGCTGC TGGCGATGTC GACGATAACG
AAGGCAATGT CCAGAATAGT TTACGGTGGG CTTGGCAAGG AAAGGAAGGG CGCGTATCTC
GGTTTACCAA GTGTTGGAGA AAGGTTGTCT TAACCTGGGA ATTTGACCAG GTGGATCTGA
ATACGCTAGT CGACGACGTT GCCACTCCTG TTATCACTGC TCTAGCAAGA GTTGTAGTTC
CATACTGCAT TTTTATGATG CTAATTGTCA GAACTTTTGG ACTAGACCAG GTCACTGTTG
CTACCTTTGG CCGAATAGCA TTGGGGCTTC TGTGCGCCGA GGGAGTTTCA CGAACTTGGA
GGATACAGTT TTCGAATTGG CTTGAAGCCG CCCACCAAAT GGCACGAGAC GACCGCTACT
TGATTGGCGA GATGCTGATG AATTACGATG GATGA
 
Protein sequence
MSPVGDALLG NDVDFHPPFP SDKADYLQAT VLQGLSSASA RVILYSVESP SATVTVSPAV 
MNPQKLPAGS RYSTFKLAVY DPSIRETFNY SNPARIASLL SKSAVSTFRN NSNMSHSALS
DYEDHDEEED ECRVCRGPEE EGRPLFKPCK CSGSIGLTHQ DCLQSWLEVQ RGDGRCELCH
TEFRFAPQYD NDAPERLPAS QVVLSLMRQF FSRWLPVLIR CVFAASLWLL VAPLLTAYVY
HAWMHQPSVV WDRCSDWSLI PGDMVSGAVL VGVIIVSFLS IMSFIDFLRV EWHPDGRPRP
RWGEEGPEPA VGEAPAPDEN AIDNAVWDAF QRQVVERHRR QRGRAVPQRV EHELLQAHAP
GQPRTEERFS NENVELDASG SDSESSWQDD DDRDDDDDTV SDDEWIENVE EDDDSVNDDN
DRDEPQLHPP AVPIVDDRDD GDENPQAFGR NNFDMDPDDG MDMDINIALD ELLGARGPIT
SVVRNLLWLL AFNTVYLGFF GFTPKVLGTI TSTIFRNTTV WSPMVFTIVT NATVPDDTQI
ANESLSIWTA YRAIESESAS ANTTFRLHDL FLVLLGYASC AGMVVLIRFL WLASQKIRAL
RSGRADNPVP LRELQEGFEE MNRIMRMGPE QMNMVDDNVA IHVFLTTTLD ATLAITKVGV
LLFMKMFLLP IWLGLCLDVS SLPILGSSFE ERIAYAGKDL FSFLLLHWVV GITFMLLVTV
SVLQLREVVH PELLAQTIRP QEPQPDLLGN LMNESIITHM KRMVLSLVIY VVLLAMYIYL
PIQAIMASGV SADLSMAQLK FWYPIMPELQ VPLELLTFHL CMLALLEKHK NSIGEIQHYW
LKFISRLVGL TDSLIPMRVD CFEYVGVLPI FEHEAVSPFW SKLAKNENER EKLLDESVAT
FLKSDIPRVN IGQSKANGQR VLASKDYVRL PDVLPGRLLR SRSVLMPTTI GKYRLQRILS
LDGTPLIEIW KEERGTPIAR PPEGWDDLGA GGADVQGRWA WGREKKSVVE EAIAHRVDFF
GRHKASVSYW AVWVKLIFMF FSAWLSTTCF ICISLICPLA FGRTLYHILQ VPQAYVHDPL
AFVVGCLIFF PVARCIGRWS LSGENSLMQR LFSWLRSYRR PPSAKARLLL VTAITCFGLA
PVLLGFIYHA LLVKLPPFFA GTQEWIALSL FSSYWATGFV LLFAWARLCI AEAFTKKFWK
GLMGAAGDVD DNEGNVQNSL RWAWQGKEGR VSRFTKCWRK VVLTWEFDQV DLNTLVDDVA
TPVITALARV VVPYCIFMML IVRTFGLDQV TVATFGRIAL GLLCAEGVSR TWRIQFSNWL
EAAHQMARDD RYLIGEMLMN YDG