Gene PHATRDRAFT_49441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49441 
Symbol 
ID7195924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp256360 
End bp259512 
Gene Length3153 bp 
Protein Length1045 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184214 
Protein GI219128004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0740088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAATACATC CTAACATGGC GTCTATGTTG ACGGCAATGT CTTCTTTACC CGCCAAGAAG 
CGAAGAGGTT CAAACGCTAG TAAGAGCAGT GTGGAAGGCG AAAGTAGTGA TGCGAACGAA
GCATTGCCAT CGTTGTCGCA AAGGAACAAT CCTTCCGATA CAACTGCCGC AATGGCAAGC
GGATCATCGG AAAAATCAAA AGCAGAATCT GCTACTCCTC GGTCTCCTGA AGGCCTTCAT
GAGTCAAGCA ATCATGCAAA GCCTCTGAAA ATTAGTGAAA ACCGGCTGCC GAGCAAAAAG
CGAAAGGGAT CACATGACTC CAATCCACAA GGCATCCACA GTCAATCGAA ACGGAAAGGG
TCGCAAGAGT CATCCTCTTC ACTTCGGAAT CATCTTCAAG TACCGCTTCC AGCACTTCAT
GATTCTAACG ATTGTAAACA ATCATCGCAA AGAGGCTCTG GTTGCTCTGA GATGCCTTTC
GAAACAGATC ACAAAGCACG AAAGCGAAAA GGATCGCACG ATTCGAATAC CGTCAAGTTT
GACAAGGACA CCAAAGTGCC CGCAGAACGG AAGGGATCTC ACGATGCATC GGTACAGTTT
AAAGAGGACG TCAAATCGCA AAAACGAAAA GGAGCACAGG AAAAATCGTC TACGCCTTCA
ATAGATTCGG CGGCGGAACA GAAACGAAAG GCTTCTCATG ATTTTACCGA ATCTGTTCTC
CTTAACGGTG TATCGTCGTC GATACCGGGA CGCAAAATGT CTCACGATTT ATCAGTACAA
TTCGTGACAG GCTTCAACGA ACGAAAGTGG TCGCAAGACA TTACCATTCC TAGTTTCGAG
GGAGTGCCTC GCAAAGATAG CTACGATTCC TCTTTGAAAC TGGACGAGTT GCTACCGTTC
CCCCCGCCAG AACCAGTTTC CAATCTTAGA CACCCGTCAT TCAACACTAC CAATACAGAT
ACCCCAAGTG TGCTAACAGG CGTCCATCCG ACGGCAATTA GTGCTATGGA CCATCTCAAG
GCACTCAGTG GCGCAAATGG TGATACAGCT GCGGTGGCAT CAAAAAATGA TCTAGAAGAT
GAAGATAGTC GAAGCAGTTC AACGTTTGCA AGTTCGGGGC AACGCATCCT GCTAGAAGCC
TTTCCCAATC CCTCGCAGGA AGGAAATTTT AGCTCGGGAA ATTTCGGGAG CCATTCAGAT
ACCTCACACC CGACGGGAAG ACATCGTTTA GAGTCTTGGG GCGCCATGTC TGATTTGAGT
GCCCCGTTGG CTGGTGGGGG CTGTTCAGAC TCCACAGCTG CTGCTTTGGC ACACTCAGCT
CTCCAGCACG CGGACTTGGC AGACGATGTT ATGGATGCCG CCGCAGGATT AGACTCTATC
AACGATTTGC ACGAGGCACC AGACGCAATT CCGAATCGTA TTTCTTTGGG ACGGGAACGG
TTTAACTCTG TGGCGTCGTT GTCGGAAGTC TCGCTCTCGG GCCTTTTGTT CGACGGCATT
GAGGTATCAG GGGACATGCA AGCTTTTGTA TCAGCGGCGA TGGCTACAAT GGGAGATCAG
CTCGAAGCAT TGGCGGGTGC TGTGGAGACG GTCGCGAACT CAGCCGGTCC ACATGCATTG
GATGCGCTTC GACGAGAGTT GGGCATGGAC AGTGATGCCG ATAGCGATGC GTCCCCTATG
ATAGGCGCCA TGTCCGATAA CGGGCGGCAC AAAGGCCGTC CAAGATCCTG GTCGACTTCG
TCCGGAAGAA TTTCGGTCGA TTACGAAGCT GTTGCGGCTG CGGTCGATGC GGGAGAAGCC
GCTGACTTGT CCGGAGTTGC AGCTATCGAT CCTTCAAGTA GTTCAATATC AAAAGAACGC
CAAAGGCATG CGAGTCGACG ACAGCTGCCG TTGCAACGAG CTCGTGACGA CAGCGATCTC
TCCTTGAATT CCGACGAGCG GGCTCACCTA AATGCTTCGT TGGTCGGGTC TTCTCTGACA
GATGATGAGA TCAAGCGTAT TCAAGAACGC GCTCGGAAGA AAGCTGGGTA CATTCCACCG
ACCGCAAGAA GCAAAGCTGA GAATAAAGCG AACAAAGAGA AGACGGGGCC TTTCAAAAAG
CGTGTCAAAC GCAATTCTCC TGAGCCGGCA CAGATACGGT CGGCTTCGCC CGGAGGAACT
CACACACCGA AGGCTTCAAA TAAGTCGATG ACCATGTCTG ACAATCTGCC GCTGGTACCG
GACTTGGTGC TCTCAGGGAG CACGGCAGCC AGCAAAGCTG CTAAAGGACA AGCAAGTCAA
AAGTGGGAAA GCATGTTTGA ATGCCTTGTT GAGTACGTTG ACCAATGCAA AAAGGAAGAG
ACGAAAGGAT TTTCTCAGAC CGAGGTTGAT CAGTGGCAAT GGGATGGCAA CGTACCCACG
AGTTACAAGT CAATCGATGG CAAAGCGTTG GGTCGCTGGG TTAACAATCA GCGTTCAGCC
AAAAGCAAGG GAACGTTAAA AGGTGAACGC GAGCAACGAC TTCTGGATGC AGGTTTAAAA
TGGAGTGTGC TAACTTCCAA CTCTTGGAAC GAAATGCTGG AAGAATTACG ACTTTATATC
AAAGAACAAG CGGCGAAGGG GAAGAAATGG GATGGCAACG TCCCGACGAA TTATCAGATA
AAGAACCGAT CAAATGGCCG GTTTGCTGGC GAAGACAAAA ATCTTGGACG TTGGGTCAAT
CGTCAGCGAA GTCAATTCCA TGCAGGAAAG TTGAGGAAAG ACCGTCAACT AGATTTGGAG
AAAGTTGGTC TGAAATGGTC AATGCTTGCC ACCAACTCCT GGGATTCCAT GTACGAAACT
CTTTGTGAGT ACGTGGACCA GAGGAAAAAG GAGGGCGGAG GATGGGATGG GAATGTCCCT
GCCAACTATC GAACCAACGA TTATCCGCCT CGTGCGCTCG GCCGATGGAT CAACCGCCAG
AGATCGGCGT TCGCAAAAGA CAAGCTAAAA AGCGAATACG TTGAAAAATT GAGCGAAACG
GGCTTGAAGT GGAGCGTTCA CGAGCGTACT TGCTCTGAGA AGGACGACAT GGATGGCGAA
GACCACGAGG CTTCCTCTCA ACCGAGCGTG AAGCCAGAGA CGGTAACGTC TACCAAATCT
AACGAAATGG CGGTGGAAAC GATTAAAGTG TAG
 
Protein sequence
MASMLTAMSS LPAKKRRGSN ASKSSVEGES SDANEALPSL SQRNNPSDTT AAMASGSSEK 
SKAESATPRS PEGLHESSNH AKPLKISENR LPSKKRKGSH DSNPQGIHSQ SKRKGSQESS
SSLRNHLQVP LPALHDSNDC KQSSQRGSGC SEMPFETDHK ARKRKGSHDS NTVKFDKDTK
VPAERKGSHD ASVQFKEDVK SQKRKGAQEK SSTPSIDSAA EQKRKASHDF TESVLLNGVS
SSIPGRKMSH DLSVQFVTGF NERKWSQDIT IPSFEGVPRK DSYDSSLKLD ELLPFPPPEP
VSNLRHPSFN TTNTDTPSVL TGVHPTAISA MDHLKALSGA NGDTAAVASK NDLEDEDSRS
SSTFASSGQR ILLEAFPNPS QEGNFSSGNF GSHSDTSHPT GRHRLESWGA MSDLSAPLAG
GGCSDSTAAA LAHSALQHAD LADDVMDAAA GLDSINDLHE APDAIPNRIS LGRERFNSVA
SLSEVSLSGL LFDGIEVSGD MQAFVSAAMA TMGDQLEALA GAVETVANSA GPHALDALRR
ELGMDSDADS DASPMIGAMS DNGRHKGRPR SWSTSSGRIS VDYEAVAAAV DAGEAADLSG
VAAIDPSSSS ISKERQRHAS RRQLPLQRAR DDSDLSLNSD ERAHLNASLV GSSLTDDEIK
RIQERARKKA GYIPPTARSK AENKANKEKT GPFKKRVKRN SPEPAQIRSA SPGGTHTPKA
SNKSMTMSDN LPLVPDLVLS GSTAASKAAK GQASQKWESM FECLVEYVDQ CKKEETKGFS
QTEVDQWQWD GNVPTSYKSI DGKALGRWVN NQRSAKSKGT LKGEREQRLL DAGLKWSVLT
SNSWNEMLEE LRLYIKEQAA KGKKWDGNVP TNYQIKNRSN GRFAGEDKNL GRWVNRQRSQ
FHAGKLRKDR QLDLEKVGLK WSMLATNSWD SMYETLCEYV DQRKKEGGGW DGNVPANYRT
NDYPPRALGR WINRQRSAFA KDKLKSEYVE KLSETGLKWS VHERTCSEKD DMDGEDHEAS
SQPSVKPETV TSTKSNEMAV ETIKV