Gene PHATRDRAFT_49989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49989 
Symbol 
ID7198774 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp52983 
End bp58383 
Gene Length5401 bp 
Protein Length1498 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184819 
Protein GI219129278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACAGT CCGAGTCCAC AATAACTTCC CCCCCCAAAG GCACCTCGAG CAATCCACTC 
GAAGACGGCG GAGCGTCCGC GTCCTCCTCC TCCTCCTCCT TCACCACTCG CCAACCCCGG
ACCGCTACGA ATCTCACGAT GTCGCCCCCC CTTTCTCCTC AACCTCGTCC CAGCGGAGGA
ACCAATTCCC ATTGGGAAAC TTCCGAAACC GGCGCCTCCC AGGGTTCGGA CGGGGAAGTC
GTCTGCCACG CCTGCGGCTT TGAAGCCTCT TACGTTCCGG AAGGTATGTT GCACTGTGAA
CGCTGCCGGA ACGTTTCCTA TTGTTCCCTA CATTGCCAAC AGTGGGATTG GACGTCGGGC
GGACACTCCG ATCTCTGTGT CGACGCTCGG TCAACGGCTC ACGAAGACAG TACCACCGGG
AGTGCCAGCA ACAACAACAG TGGCAGTGGA ACTACTCGTC CCTCGGAGGA TTCCACCCTC
GTCTCCATGG ATATAGCCAA CGTATTCGGA CCCCGAAGTG TCGGATCTGC CTCCACTCCC
GCACCCGCCC CCGTGAATCG AGGTGTCCAG CGGAGACCGG AAACGAAGGA AGGCGGCCTC
GGGGCCTACC TCCACGCGCA CACCCCGCGT ACCACCCCAC CAACTCCACT CGCATCCCGG
TACAATTCTC CCAACGACCA GGACCTCGAC GACGATGATC CGAACGAGGG GTCCATTGTA
TTAGTATCGG ACTCGGAAAG TACCGACATT CTCGGAATGA TTCAGGAAGA ATCCGAAGGA
AACGAAGAAC CGGAATTGCA AGGTTCCTGG CGATCACGGG ACAATCCGTT CCACAAATCC
GCTTTGGGAG AAACCTCGTA CGACTCGGAA CGGGACGAAC GGGAACTCAT TCTCGCCACG
GCCCCCGCTG CGACGGTCCG GGACGCCCAC GAACACGCTC AAAACAACAA CAATAACACC
GTCACCGCAA CGGCATCTTT CGCCAGCCCT CAACGAGATT CGCTCAAGGC CTTTCGGGCC
GTAGCGTCCG AAACCACCAA CGTAGAGAGC CACGGCAAAC ATAGCCTCAA AGGCTTTCGA
CACGCCTACG ATGATGCCCC GTCGAAAGAA AAGATCGCTT TCTCACACAC ACTCATCCGC
CACAGTGATA CGGCCGATGA CACCATGAGT ACCACAGGAT CTCATCCTGG CCAAGTGAAT
GAGGCTACTG CGAACAGGAC AAACCAGAAC GCGACCGCCG TCACGAATTT GGCCCTCAAA
ACTAGCATCA ACAAAGCTTT GCGAGATTTC GAACGCTTGT ATGGAGAGGA AGCCGCACAG
CTGGCTGTAT TGCAACTCAC CCAAGGTCTA ATTACTGAAG ACGACGTTAT TGAACAAGCC
AGTCAAAGTC CGGACGAGTC AGAACAGTAC AATGAAACCA CAACCGAGTC TGGGGACTCA
TCCCCAGCAG ATCCGAAAAG TTCACCAACG ATTGTAGACC AGAGCCTGTC AAGTTGGGGC
TTGTCCGGCA TAGCGTCGAC TGAGGGGTTG GACAAGCCAT CCGCGCACAG CACATCGTCC
CTGATCACAG AAAATATGTC TCGGGACTCC AAAAATGCTG CTACCTTATC GTCATCCTCG
TCGTCGCAAC AGAATGCGTT GGCCTTTGCA AGTCCTCATT CCGTTGGGGC CACCACCACA
GCCGCATCTA CGGCTCCGCT TTTGACACCC GCTGAGCCAG AAGACGAAAG TGCTTGCAGT
GACGGATCCT TGTGCACACC TACTGAATCG TCGGTACAAG CAAAGCCATT TACGGTCCAT
ACCCCCCGGT ATCTTCAGTA CCGCAATTCC TTGTCCAAAT CTACCGCCAA AAGTGGTTTG
CCAGGAACGA CAGCTGTAAG AGTTGCACAC GACAGTGATA CAATTAAAAC CCAGGATACA
AAGAATCGTG CAAACCGACA AAAGGAGCCT ACCCAGAACG GAGCTGCCTT GGTCGGTGAA
ATAATAGAAG GTGCCGCTGC CGTGGGCACC AAAAGGACAC TTAGCCCCAA TGACCCACCA
GAGCAGCTAA GGGAAGCGTC ACGAGTCGTG CCGCCTCCTG CGATTGTGGC GACTCCGACG
GAGACGTCTA ACCAAGATTT TCCACCGTCT ATTCATGAAG AGAAGATCGC ATTATCCATG
CCCCGTTACC TGACGTATCG CTCGTCGCTG GCCAGATCAG TCGACAGAAA AAGCTTTTCT
GTCCAGCTTT CTGCGCAAGA ACTTGAATCC TATGAAGTTG GCCAGTTAAG AATGCCGGGC
AATGGAAAAC ACAATAATTC CGATGTTCCG GCGGTAGTCG CTACGGTTCA GGACAACATA
CATGAAACGG AAAGGGCCAA GACGGCGACG GTAGCTGCCG AGATGTCGAG CAATTACGAA
CTGGGGAATA ATGAATTCAG GCAATCTTTG TCTCCATCGC AGGTAGAAGT TGAGGCTAAT
AAGATGAGCG ATAGTACAGG ACTTGCCGAT GCTATTGGCG GAATTGCAGC CTTGGGGTCG
GGGGCTGTTG CTCTGGCAAG CACGAAGAAA AATTCTGACA ATAACCAGGT AATATCCAAT
GTGTTGCTTG CAGATTCAGA GCTGGGTCTG GAATCCCAGA TGGTGCCGAT CATGCGAAAA
CTGTCACCGA ATGCTATTAA GGAAGGGGCA CCGTCTCGTG CGGAGAGCTT TTACTCTCGC
TATCGGGCCT CGTTGGCTCA GAGACTTTCC CAACTCTTTA TTGTAGAAGA TGCGATGTTG
TCTGGCGATG ACTCGGACGA TTCTTTGAAT GCCGACGAAG AGCGAGAACT CACAGACCAG
CTTTCTTCGT ATTTGGAGAA GGGGAATTCC AAAAGAACCA TTGAAGAGAA TGCATCTCGA
CCCGGCGCTT TGGGACACAA GAGCTACGAT GGAATGAACA GCTCCTGGAG TGGTTTCGAC
GAAGAAAATT CTGTGGACGC AAACATCGAC GGCTCTCGCA GCAGCGAAGT CTGCGAGCGA
AAATTTCAGA CAGCCTTGGT TACTGACGCG GGTGCAATCA ATTTGCGAGA AGCTCGAGAA
AGCGCTCGAG CTGAACAAGC TCGAAAAATT GAGCTTGCCA GATCAAGCTC CAAACAAGTC
ATGTACCAGT CCGTATCTCA AGACTCGAAA CCCTCCAAAA TGACAGAGAG GCGCGCAGCT
CCTCTCACAG TAACCAGGAA TCAAGAAAGT TATAGTGAGA AGCGGATAAG CCGAAGCAAT
TCCAGCTCTT CTGTAGAAGA TGCTGAGACT GCGTTATCTG CAAAACGAAG AGCGCCAGCC
CCCATCAGTA GTCAATGTTT TGCTCCGAAA GTAGCAGAGT ACAGAAATCG GAAACGTTGT
GCGATGCTCG GCTTCATTTT CCTTCTCGTG GTACTTCCTC TCGCAATTGG ACTTGGGGTT
GGACTTCGTG GAAGCAATAA AAATCGCTCG ACTAACTTTC TACCAGACAC ACAACCCCCA
GCAGGATCAA ACCCAACTCC CTCGCCTGAC ACGCAAGAAC CTACGAATTT TCTGCGTACG
CGGGCGCCCT CGCAGTCAAC GGACGATACA CCAGGATCAC CGACCATTAA TGCTCCTACG
CAAAGCCCAA CAGCTCCACG AATGGAAACC CCCATCGTTT TGCCAATTGA ATCTCCTTCA
CCTTCCAATC TCAACCCGGT ATCTTCGTTG GTTCCCTCCA TCGCGCCCAC TCCAACAGTT
TCTCTATCAA ACGCTCCTAA TATCATGGAT TCATCCCAAG CCCCAACGGT ACTGTTGCTA
AACCAAGAGC TCTTCAGGAT GCTAAGCGAT TTGTCTGAGG ACAATGGAGC AAGCATCCTT
CGCCCCTTCA CACCGCAGCG CCGAGCGTTC GAATGGCTTG CATCTACTTC AGACCTCGAC
ACTTTGTCCA ACACACGAAA GGTTCAGCGA TTTTCTTTGT CTGTGTTCTT TTTTACTTCG
AATGGTAGCC TTTGGCGTAA CAATTCTGGA TGGCTAACAG AAAGTGATGA ATGCACATGG
TACTCGAGGT CTGGGCGCAC AACTTGCGAT GGCAGTGGAG TATACCTGCA CTTGGAATTG
GGAGACAACG ATGTGGCAGG AAGAATTGCC ACAGAAATTG GTTTGTTGAC AGGACTTCGA
CGTTTGGACT TGACAGGTGG TAGTGGAAGT CGCTTGAGCA GCACACTACC AACGGAGCTC
GGAGTACTTT CCGACCTTGA GTTTGTGAGT TTCCGGAACA ATTCCATATC TAGAAGCATA
CCGATCGAGT TGGGTCAGCT TACGAGACTC CAGCATCTTG ACTTGAGTAT GAACGTGCTT
AGAGACTCGA TTCCGACAGC CTTTGGTCAG CTGGCAGCCT TAGAAACGCT TGATCTAGGA
CACAATACTC TTTCCGGCTC CATCCCTACT GAGCTCGGCC GGTTGTTGAC TGCGCGAAGT
ATCAAGCTGA ACAATAATAT TCTTACTGGT GCATTGTCCA CTTTTATTGG TCAACTTTCT
GAATTGGAAT TGCTTAACCT TGCAACGAAC CAGGTGTCGA CCATCCCGAC CGAGCTCGGA
CAGCTCACCA GCCTTGCATC TTTTGATTTG CATGAGAATA GGGTGAGGGG TCGTTTTCCT
ACGGAAATTG GCTTCTTGAC CCGTCTTCAC TTTCTGGATC TCAGTAACAA TGCCTTTTCT
GGCACTCTAC CCACGGAGAT TGGGCTATTG CAAAATACTC TTCGCCAACT GAACCTTTCG
AATAATCGAT TTTCAGGGGA AATCCCTATT GAAATAGGAA ACCTCGTAGG GCTCTTCAGC
CTGCAGATGC AATCGAATCG ATTCATAGGT TCCGTTCCTG AAGAATTTGA TGGACTGCTA
TTGATTACTA CTATCCGCAT TGACAACAAT GATCTCTCTG GCATGGTGCC AGAACAAGTT
TGTGACCACT TCTCGAATCG ATTACCAAAG TTTTACTTGG ATTGCGGAGG TAGTCCCGCC
AAGTTGTCTT GCCCGCCAGG AACTTGTTGT ACTTACTGCT GCGAAGAAAG CACTGGATGC
GAGTGCGTGT ACGCTGGTAC CAGCTTTCAA TTCTTGTGTT AATGCAACAA AAGTGCGAGG
GTTGCTACCT AACGTAGCTT GAATTTTGTT TTCGCTTTCC CTATCCTTCA GGAATAAATG
TGAAAATTTG TAAACTGAAA ATGTAAGTTT TTTTATTTCG ACGCTTCAAA GAAAACGACT
CGAGACCTTC AAATACGCAC CAAAGGGAGA AGCAAAATAG TCTTTACAGT AATACTACTT
TAGTTCTAGA GTAAGGGTCA GGATCGTTGG TGCGCTTGAA CCACTAATGT CTAACAAAAA
AGCAAGCCTT GACATTAAGG TTGACTGTGA AGTGGACGAT GATGGAGCTA ATTACGAGTG
A
 
Protein sequence
MGQSESTITS PPKGTSSNPL EDGGASASSS SSSFTTRQPR TATNLTMSPP LSPQPRPSGG 
TNSHWETSET GASQGSDGEV VCHACGFEAS YVPEGMLHCE RCRNVSYCSL HCQQWDWTSG
GHSDLCVDAR STAHEDSTTG SASNNNSGSG TTRPSEDSTL VSMDIANVFG PRSVGSASTP
APAPVNRGVQ RRPETKEGGL GAYLHAHTPR TTPPTPLASR YNSPNDQDLD DDDPNEGSIV
LVSDSESTDI LGMIQEESEG NEEPELQGSW RSRDNPFHKS ALGETSYDSE RDERELILAT
APAATVRDAH EHAQNNNNNT VTATASFASP QRDSLKAFRA VASETTNVES HGKHSLKGFR
HAYDDAPSKE KIAFSHTLIR HSDTADDTMS TTGSHPGQVN EATANRTNQN ATAVTNLALK
TSINKALRDF ERLYGEEAAQ LAVLQLTQGL ITEDDVIEQA SQSPDESEQY NETTTESGDS
SPADPKSSPT IVDQSLSSWG LSGIASTEGL DKPSAHSTSS LITENMSRDS KNAATLSSSS
SSQQNALAFA SPHSVGATTT AASTAPLLTP AEPEDESACS DGSLCTPTES SVQAKPFTVH
TPRYLQYRNS LSKSTAKSGL PGTTAVRVAH DSDTIKTQDT KNRANRQKEP TQNGAALVGE
IIEGAAAVGT KRTLSPNDPP EQLREASRVV PPPAIVATPT ETSNQDFPPS IHEEKIALSM
PRYLTYRSSL ARSVDRKSFS VQLSAQELES YEVGQLRMPG NGKHNNSDVP AVVATVQDNI
HETERAKTAT VAAEMSSNYE LGNNEFRQSL SPSQVEVEAN KMSDSTGLAD AIGGIAALGS
GAVALASTKK NSDNNQVISN VLLADSELGL ESQMVPIMRK LSPNAIKEGA PSRAESFYSR
YRASLAQRLS QLFIVEDAML SGDDSDDSLN ADEERELTDQ LSSYLEKGNS KRTIEENASR
PGALGHKSYD GMNSSWSGFD EENSVDANID GSRSSEVCER KFQTALVTDA GAINLREARE
SARAEQARKI ELARSSSKQV MYQSVSQDSK PSKMTERRAA PLTVTRNQES YSEKRISRSN
SSSSVEDAET ALSAKRRAPA PISSQCFAPK VAEYRNRKRC AMLGFIFLLV VLPLAIGLGV
GLRGSNKNRS TNFLPDTQPP AGSNPTPSPD TQEPTNFLRT RAPSQSTDDT PGSPTINAPT
QSPTAPRMET PIVLPIESPS PSNLNPVSSL VPSIAPTPTV SLSNAPNIMD SSQAPTVLLL
NQELFRMLSD LSEDNGASIL RPFTPQRRAF EWLASTSDLD TLSNTRKVQR FSLSVFFFTS
NGSLWRNNSG WLTESDECTW YSRSGRTTCD GSGVYLHLEL GDNDVAGRIA TEIGLLTGLR
RLDLTGGSGS RLSSTLPTEL GVLSDLEFNK FVTTSRIDYQ SFTWIAEVVP PSCLARQELV
VLTAAKKALD ARINVKICKL KIVRVRIVGA LEPLMSNKKA SLDIKVDCEV DDDGANYE