Gene PHATRDRAFT_40968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40968 
Symbol 
ID7198902 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp13080 
End bp16387 
Gene Length3308 bp 
Protein Length1059 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185029 
Protein GI219129718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCTG CCACCCGGCA AATGACAAGT GTAGCCGTCT ATGCCCATTT TCTGGACAAT 
GTACTCCTTC TTCCCCAAGG ACATCCTATT CGTCTTGCCT TTGATCAACA AGGATATGAA
TCGGCTGACA ATCTCTTGTG CATCTTTGAG AACGAACTCG ACTCTCTTGA GTACACTCCT
CCTGCCATTC CTGACGGTCC CGAAAATCCG TCGTGCATCC CTCTAATCAT GGCACATCGA
CAGATCATAC GTCACTTCCT ACGTTGGCAA GCATCCTTAA AAGACCAAAA GGGGGCTCCT
TTGAAGAACT CCGAGCTCGT TGCACTCAAC AACGAGGACT TTGTCCTGTA CCGTCGGTCA
GCACTTGGTC AGGTTTCGAC GGCCACTGCA CCTGCCACTG TTCCCCCGAC TGTTCAGAGT
CCCACAGGAA AGACACGTTC GGCTGTCGAG GATTTCAAGC GTGGGATAAA ACGTGATAAA
ACTCACTATC CTGTGCTCAA AGATGACCGA TACTGGGACA ACTTCTATTG TTCGTTTGTT
GTTACTGCCG TAACACATAA CGTTGACAAG GTTCTAGACC CGAACTACAT TCCTACCGAT
CCTTTGGAAA AGTCCCTCTT TGAAGAACAG AACAAGTTTG TATATTCTGC TCTAGAGCAT
ACTCTTCAGA CGGACATGGG AAAGAACATT GTCCGTGAGC ACAGTTTTGA TTTCAATGCC
CAAGAAGTTT TCCGTAAAAT TGTGAAACAT TATACGGAGT CAGCCAGCGC AAAGATCAGT
TCGTCTACTA CTCTGGGGTA TCTCACAACT GCAAAATACG GATCGTCATG GACAGGTACA
GCAGAAGGAT TTATTCTCCA CTGGAAGAAT CACTTGCGTA TTTACAATGA CACTGTACCT
ACTGGTGAGC AACTTCCTCA GCAATTGTGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT
GTACCTGAAC TCCGACAGGT TAAAATCACA GCAACACTCG ACTTAGCCAA AGGTGGTAGT
CCCATTAGCT ACGATAGTTA CCTCAGTCTC CTCCTTGCAT CGGCATCGCT CTACGACAAC
GGTAATAATC TATCTAATTC TCGCAGTGGC AAGAACAAGC GCAATATCTA TACTACTGAA
CTAGCCTATC ATCCGACGGA TTTTGAAAGC GAACCAGATG TAGACTATGA TATAGATGTG
TCACCGACTG CCATATACGA AGCCAATGCC CACGTCCGTA ACAACAGTAC CCGTAACCGT
CCCCTGGCAA CTAATCGCGA ATGACCTTAC ATTCCTCGTG AAATGTGGAA TTTGCTCTCT
GATGATTCCA AGGCCATCCT CCAAGGTTTG GCTGCACCCG GCAAGCAGGC ACCATTAAAT
GGTAGCCCGC CTCATCAAAC GCTGCAGGCC AATACACACG AGACCATTGG CACGGAACAT
ACCGCAACGG ACACCTTCCA TGATTGCGCA CCTGAAACTG AATTACTCGC ACATCTTACT
GAGCGTGTCA GTCGCATGAG CAGCGGTGAT ATTCGTAAGG TACTCGCCGC ATCACGTGAC
GTATCAGAAA AGCCCAAATC ACTGCAATCT AACGTACTGC AATACCAAGT CTCTTGTCAT
ACTACCAACG AGACTTCTGC ATCCCTTGTT GACCGTGGCG CTAACGGAGG GCTTGCCGGT
GGTGATGTCA TTGTCCTGCT CAAAACAGGA CGTTCGGCAA ACATCACAGG TATCAACGAT
CATACCTTGC CAAACTTGGA CATCGTCACT GCCGCTGGAT GTGTTGAATC CCAAAATGGG
CCCATCATTC TCATTATGAA CCAGTATGCT CATCTGGGGA AGGGTAAAAC CATTCATTCA
AGTGCGCAGT TGGAACACTA TCGTAATCAT GTCGAAGACC GTTCACGCAC GGTAGGGGGT
AATCAGCGCA TTGTCACTTT AGATGACTAT ATCATTCCCC TCCATATTCG ACAGGGACTT
CCATACATGG ACATGCGACG CCCCACTGAT GCTGAACTAG CGTCCCTCCC GCATGTTGTC
CTAACCTCAG ACGTCGATTG GGACCCCTCT GTACTCGACA ATGAAATTGA CCTTGCGACT
TCATGGTACG ATGGCATCCA TGACTTGCCC CAGCCCCCAT ACGTTGAACC ACGTTTTGAT
CATACAGGCC AATACCTTCA CCGTCACATT TCTCTATGCG ACTACCGTGA TGACGCCATT
GCACGTATCA TGCAGTGTCA ACAGCATCAC GTCACACGTA ATGTGCACGA TTATGAAGCC
CTTCGTCCTT GCTTTGGCTG GGTCTCTGCC GACACCGTTC GGAAAACCAT CATGGCCACC
ACGCAGCATG CCCGTGAAGT CTATCACGCA CCGTTACGTA AACATTTTAA GTCTCGTTTC
CCAGCCTTAA ATGTACACCG TCGTAACGAA CCGGTCGCTA CTGATACCAT ATGGTCCGAC
ACTCCCGCCG TAGACAATGG TGCAAAATTT GCACAACTCT TCGTTGGCAG ACGGTCTCTT
GTCACTGATG CCTACCCCAT GAAAACTGAT AAAGAGTTTG TCAACACCCT TGAAGATCAC
ATCCGTTTCC GTGGTGCAAT GGACAAACTA ATCAGTGATC GCGCTCAAGT TGAGATCAGT
AAAAAGGTCA CTGATATTAC ACGCGCATAT AATATCGATC AGTGGCAGAG TGAGCCTAAC
CATCAGCACC AAAACTTCGC CGAACGTCGT ATCGCCACCA TCGAAGCCAA TACGAACAAC
ATTCTCAATC TTACTGGTGC TCCTGATAAC ACCTGGCTTC TTTGCGTGAC ATATGTTTGC
TATGTCTTCA ACCATTTGGC GCATGAATCT CTCGATCATC GCACCCCCCT CGAAGTGCTT
ACTGGTTCTA CACCTGATAT CAGTGTTCTC CTTCAGTTTC ATTTTTGGGA ACCGGTCTAT
TATAGAATTG AAGATGCGAC ATTCCCCTCT GGTGGTACCG AGCAACAAGG ACATTTTGTC
GGCATCGCAG ACTCCGTCGG TGACGCTCTC ACTTATAAGA TCCTCAACGA CCGCACCAAC
CGCATTCTAT ATCGATCTAG TGTTCGTTCT GTGGCCATTT CCGGGCAAAC CAACCTACGC
CTTGCGTCAC AGGATGGGGA GAATGGTCCT AAGCCCATCA ACTTTATCAA GTCGCGTAGA
ACCGAAAATC AAAATTCCTA TGCCATTAAG GAGTTGCCTG GTTTTACACC TGATGATCTT
ATCGGTCGCA CGTTTCTCAC CGACACTCGT GATGATGGAG AGCGTTTTCG GGCACGAATC
ACCCGTAA
 
Protein sequence
MVPATRQMTS VAVYAHFLDN VLLLPQGHPI RLAFDQQGYE SADNLLCIFE NELDSLEYTP 
PAIPDGPENP SCIPLIMAHR QIIRHFLRWQ ASLKDQKGAP LKNSELVALN NEDFVLYRRS
ALGQVSTATA PATVPPTVQS PTGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYCSFV
VTAVTHNVDK VLDPNYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA
QEVFRKIVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP
TGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGS PISYDSYLSL LLASASLYDN
GNNLSNSRSG KNKRNIYTTE LAYHPTDFES EPDVDYDIDV SPTAIYEANA HAILQGLAAP
GKQAPLNGSP PHQTLQANTH ETIGTEHTAT DTFHDCAPET ELLAHLTERV SRMSSGDIRK
VLAASRDVSE KPKSLQSNVL QYQVSCHTTN ETSASLVDRG ANGGLAGGDV IVLLKTGRSA
NITGINDHTL PNLDIVTAAG CVESQNGPII LIMNQYAHLG KGKTIHSSAQ LEHYRNHVED
RSRTVGGNQR IVTLDDYIIP LHIRQGLPYM DMRRPTDAEL ASLPHVVLTS DVDWDPSVLD
NEIDLATSWY DGIHDLPQPP YVEPRFDHTG QYLHRHISLC DYRDDAIARI MQCQQHHVTR
NVHDYEALRP CFGWVSADTV RKTIMATTQH AREVYHAPLR KHFKSRFPAL NVHRRNEPVA
TDTIWSDTPA VDNGAKFAQL FVGRRSLVTD AYPMKTDKEF VNTLEDHIRF RGAMDKLISD
RAQVEISKKV TDITRAYNID QWQSEPNHQH QNFAERRIAT IEANTNNILN LTGAPDNTWL
LCVTYVCYVF NHLAHESLDH RTPLEVLTGS TPDISVLLQF HFWEPVYYRI EDATFPSGGT
EQQGHFVGIA DSVGDALTYK ILNDRTNRIL YRSSVRSVAI SGQTNLRLAS QDGENGPKPI
NFIKSRRTEN QNSYAIKELP GFTPDDLIGR TAFSGTNHP