Gene PHATR_44239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44239 
Symbol 
ID7204070 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1406812 
End bp1410580 
Gene Length3769 bp 
Protein Length1180 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186246 
Protein GI219113325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAG AACCGCCTGC CGATTGGAGG GAGAACTCCA CCGGGGGATT GGCGGATCCC 
AACGTCCCCG TCGCTCCGTA TCCACCGATG ATTTCCGCAC GTTTTCCTTC CCATCATCTT
CGCTCTTCCC TTTCCCATTC TCCACACTAC ACGTCTCCCA CTCCTAGTTT TATCGGCAGT
CGCGCCGAAT CCAATGTGAG TGGGACTAGC AGTCTTTTCC CCGGATACGC CACGAACGTC
GGAGGAGCCT CCACCGCGAC GAGTTTGGAA GACGGCGGCA CGGCGGATGA AGCTTCTCGG
CAGACCAACG CATTCGCAGT AACCGACGAT TCCGCGGCAA TCGCAAACAT TGAATTTTGT
CGACCGGATC TCCTAGCTTG TATTAATTGT GTTTCCTCCT CTAGCAATAC CCACACCACG
TCCAACACTC CAACGTCACT CAACAGCGCC AAATTCGCAG ATTCTACCAC CCCGGCGCGC
AGCAATCGCA AAATGGGACC TGTTCTCTTG GAGCTCCGAA AACTTCAGCT TAACGAAGGA
ACTCACGGAA ATCATGATTT CTTTACGACA TCACAAATTT CCGTATCGAG AGGTAACCCC
TCACTGGGCA TGAGCATGTC GAGCACCTGC TTGCATGTAT CGCCACAGGC AATGCGGACC
GGCGAAAACG CACTGCACAG ACCACCACCG CCAATTGCGA CGGGACTAAC GACCGGAGCC
TTATGTATTC ACACTTTTGT CATAAAGTCG GATGGCGACA ATGAAAACAA TCAGGATTGG
ACGCCCAACG TGGAGTACTA CCATACACCA CGACATCATC GTGCCTCCAC TGCAGTACAG
TGGTGTCCAA CGACGGTGCG TCCTCAGCAT GTAGCAATTG GTTTACTTTC TGCTTCTTCC
AGTGGTAGTC ACCCTACCAC AAACAGTGTC GTACCCGGAC GGCGAGGCGT CTCGGGTGGT
GGTGTGGCCG CTAGTGTGGG ACTAGGCGCC CGATCGGCTA GTACTGGCGA CAAGGATTAT
TGTTGTTTCG TATGGGATGT TGAGCACCAA AGCGCTTCTC GACGGACGAA GACATCGCCG
ATTTACAAAC TAAGTCATCA GTCCGGCGTT GCTTCATTAG GTTGGCTCAT GGGCGGGGAA
ACCTTGGCCA TAGGCGGACA ATTGCGTCAT GTTCAATTGT ACGACTTGCG CGAAGCCACG
ACGTCGGCAC CCATGACTGT TATGGCCCAC AATTTTGCCG TACACGGAAT TGTCCCCGAC
CCTCACAAGT CCTGGCAATT TGCTACTTAC AGTCGGGTAT CGAACGAGCC CGTAAAAATC
TGGGATTGTC GAAGAATGGA CACCAACTTG ACGGAAATCA AAATTCCTTC CCAGTCAATA
TCTCCTTCGT CAGTATCGGG TGTGACACCT CCCGTCTCGC AAGTGCACTG GTCACCACTA
GAAGCAGGCT TCTTGTCAGT AGCGGTTGGA GACGCCATCT ACGAATTCGA TACAACGACG
CCGGCCTCAC GACCGATTCA TGTCAATACG ATGTATGCTC GGGGATCGGT TCTCGACGTG
GCATTGTACC CGTTTGTGGC GGAGATGGGG ACCGCCAAGG AAGCGAGTGT GCATAAACTC
AAGGCTGAAC AACGTATTCC AACTTTATTG ATCCAAGAAG ATGCGATGGA AGCAAAGCAC
CTGGAGGAGA TGAAGCTCAA TCATTTTCTG GAAAAACGTT CGGTACGTAG CAACCAACGC
ATTCTTGGAG AACTCTACCC CAATCGTATG ATGGTAGTCT ATACAGACAG ATCTCTACAC
GATTTTCCAC GCCATACCAT TGCTCCGTTG GCAGTGTCCA GCAGGGATGG CAGGTTGGTG
CACTCCATCG GACGTACGCT ATGGGTAGGG TCCAGCAGGC AAGGACCGGC TGCCATTGAA
CGTCTTACCG CAGCGCAAGA TGAGGATGTA TCCGCCGTAA TGTTGCGCCG GGCGCGCTGC
ATACAGTCCA TCAATTATTC TATGGATCCA TCAGCCAATA TTCAAATCCT TGCGCATGAT
GGGAGCGGAG TCGACTCGCT GTTACGGCTA TGGAGTTGGA TTGAACGAGT GGAAGTCTTG
TGCTCCAGTA CAGAAACAGA CGATGGATGG GATGATGGCA TGTCATGGCC GGCGAAGACT
TTGATGGACG CGGGTGCTTG GCGCCTGTTG CATATTGCCG GATGTGGCGA GGGGGAAATA
CGGGGCTTTT CTGAACATTC CTGCTGCTCA ATTTATGATA GTCCAGGCCG CCGGTAAGGA
AAAGTGAATG AAAAGATCCT TTACATTGTG TGGCCATATC ATTTCTAAAC TACATGAGAC
TTTTCTATGC AGTGCGGCGT TAACTTCATG CGGGTGGGCG GGAAGGTTTG ATCTCTCGAC
GGTTATGGGG GAATGTGAGG AGCTCGGTGA GTACGAAAGG TCGGCGGCTC TGGCCGTATG
GCACGACGAC ATCGGGGCCG CAGTTGACTG CCTGCAGCGT GGAGCCTCGG TGATCAGGCA
ACAAATGAAG AGGGGTGGAG AAAGTGTTAA TATGTATTGC TCCTCCGAGC ATGCCGAAAC
GCTGGATCTC GTTTCTTTCT GTGTAGCTGG TCATCGGGGT GACAGCATGG ATTCACCGGC
TTCCGGAATC TGGAGAAGAA CGTGCGCGAC TTTGATGAAA CGAAGCAGCT TTTCTGGCCA
ATCCCGATGT TTTGCCTACG TTCGTGGAAT GCTCAAATTT CTCATGACCT CGGGGTCGGA
CCAAGGGCAT GACGAGGTTC TCTTGTGCGA TGATTTGAGT CTTTGCGATC GCGTTGGCTT
TGCTTGTCGC TTTCTCTCCT GGAACGAGCT TCTGCAATAC TTGGAAACGT GCATCGTCAA
TTGTCAAAGA TCAGGTGACA TCGAGGGTAT GATAATTACA GGGCTCGAAA AGGAAGGTAT
TAAAATCTTG CAATCCTTTG TGGATCGAAC TGCTGATGTG CAAAGCGCTG CTTTAATAAC
GAGTCGAGTC ATTTTTCCCG TTGGTTGGAA TGGTGAACGT CGAGCCAGTA TAGAGTGGTT
GGAATCTTAC CGATCACTGT TAAATACTTG GCAAATGTGG CAGTCTCGCG CCTTGTTTGA
TGTCGATCGT GCGGACCTTT TACGCAAGGT AAAGTCGCGT CAATTTGATG CGTCCGGCAA
ATTTGGCAGC GTTCCCATTA GTCGTCGGCA AGTGTCTGCT GGTGGTAAAC CAGGGCTGCG
CCAACCCGAT CCGGACATTC AAGCCACCAT TCCGGCACAG CTTGACGCCC GCTGTAACTA
CTGCTCCGCT CCATTGAGCT TGAAGCTAAA AGACACGCAC GCCAATCAAT GGCTGTCCAA
AATGAAACCG GTGCTACCAT GCTGTGCACA ATGTCGCAAG CCGCTTCCGC ATTGCGCTAT
TTGCATGTTA TCAATGGGTA CCTTAAATCC ATACATGGAA TTGACGAAAG ACCGATCAGG
GCGGTCGTCC CGTAGTGGCC TTTCGTCGCT GCAGACCGCG GATGACATGT CGTCTTTGGG
GAATTTGCCC TTTGCAGAAT GGTTCACTTG GTGTCTACGA TGCAAGCATG GCGGCCACGC
CCACCATTTG GTGGGATGGT TTGCGAAACA TGAAGTATGC CCCGTGAGCG GGTGTGACTG
TCATTGTCAA TTCGACGGAA TTCATGAGTT GAATCGATAT AAGCAATCTT CAGAGAGAGT
AACAAACGAA AACGAGCAGG ACACGACAAG CAACACCGAG GCCGACTAA
 
Protein sequence
MSEEPPADWR ENSTGGLADP NVPVAPYPPM ISARFPSHHL RSSLSHSPHY TSPTPSFIGS 
RAESNVSGTS SLFPGYATNV GGASTATSLE DGGTADEASR QTNAFAVTDD SAAIANIEFC
RPDLLACINC VSSSSNTHTT SNTPTSLNSA KFADSTTPAR SNRKMGPVLL ELRKLQLNEG
THGNHDFFTT SQISVSRGNP SLGMSMSSTC LHVSPQAMRT GENALHRPPP PIATGLTTGA
LCIHTFVIKS DGDNENNQDW TPNVEYYHTP RHHRASTAVQ WCPTTVRPQH VAIGLLSASS
SGSHPTTNSV VPGRRGVSGG GVAASVGLGA RSASTGDKDY CCFVWDVEHQ SASRRTKTSP
IYKLSHQSGV ASLGWLMGGE TLAIGGQLRH VQLYDLREAT TSAPMTVMAH NFAVHGIVPD
PHKSWQFATY SRVSNEPVKI WDCRRMDTNL TEIKIPSQSI SPSSVSGVTP PVSQVHWSPL
EAGFLSVAVG DAIYEFDTTT PASRPIHVNT MYARGSVLDV ALYPFVAEMG TAKEASQPTH
SWRTLPQSYD GSLYRQISTR FSTPYHCSVG SVQQGWQVGA LHRTQGPAAI ERLTAAQDED
VSAVMLRRAR CIQSINYSMD PSANIQILAH DGSGVDSLLR LWSWIERVEV LCSSTETDDG
WDDGMSWPAK TLMDAGAWRL LHIAGCGEGE IRGFSEHSCC SIYDSPGRRA ALTSCGWAGR
FDLSTVMGEC EELGEYERSA ALAVWHDDIG AAVDCLQRGA SVIRQQMKRG GESVNMYCSS
EHAETLDLVS FCVAGHRGDS MDSPASGIWR RTCATLMKRS SFSGQSRCFA YVRGMLKFLM
TSGSDQGHDE VLLCDDLSLC DRVGFACRFL SWNELLQYLE TCIVNCQRSG DIEGMIITGL
EKEGIKILQS FVDRTADVQS AALITSRVIF PVGWNGERRA SIEWLESYRS LLNTWQMWQS
RALFDVDRAD LLRKVKSRQF DASGKFGSVP ISRRQVSAGG KPGLRQPDPD IQATIPAQLD
ARCNYCSAPL SLKLKDTHAN QWLSKMKPVL PCCAQCRKPL PHCAICMLSM GTLNPYMELT
KDRSGRSSRS GLSSLQTADD MSSLGNLPFA EWFTWCLRCK HGGHAHHLVG WFAKHEVCPV
SGCDCHCQFD GIHELNRYKQ SSERVTNENE QDTTSNTEAD