Gene PHATR_43968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43968 
Symbol 
ID7204385 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp565557 
End bp570692 
Gene Length5136 bp 
Protein Length685 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186373 
Protein GI219113579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTGTGGAG ACAATACATT CCAACAACTG TCCGTCTCAC TTTCTAGTGT GCTGCGACTG 
TTAACTTCTC AGTCTCGACG CGTGTGCTGC GCTCGCTTGG ACCTTTCTCG TCGTTGAAGG
AGATCAGACA CCCGAAGAAA TTCATTTGCG TTCTCTCTCG AACGACAAAG AATCGACGCG
GTGGGAGCGC TGGGTTCTAG TTCCGATTTG ACGATAACTT TGTCTTACCA GCATCGATTC
CAAGAAGAGT TTTAAATAGG AAGGTTTTCC GTCGTCACCA AGCATGGCTT CGCTGCCACA
TCATCAGCAA CAGCAGCCAC CGCTACACGC GCTCAGGGAA AATGGCGGTT CTGGGTCCGA
GGGAGGTCTG GCGACTCGAC AAAATGGCAT ACCGACGGAA ATACGGAGCA CCGGATCACA
GTCCTCCAAC AGTCGGAATC CGATTGAATT CCAAACACAG CAATACTATC CGCAGCGGCC
GACAGCACAA CCTATGAGCA CGCCTTGGAC ATTAGGGTCC ACGGATGGAT CGTACGCTAC
CACACTGACA CTCATGCAAC AACAGCAAGG CCAAATGGAT GAAAGCACCG TACCTTTGTC
ATCTCCGAAC GACACCGGTA TTGCTCCCGG CCTCGGCCCG TTGCCTAAAG GAAAAGCGGA
AGGCGGAGCG GCCATAGCCT CGTCGGGCGC CGCCGCGGTG GACCACAACG CCGCGCTCCA
AATTCTGGCG GCTCAAAAGT CGCTGTACGA AACCCGTTTG TTTCGCCAAC CATCCGCATC
GGTGGAAGCG GTCCTGGCAG CATCTTGTGA AGTCATGGGA TTTGATATTT CGGAAATGTG
GTTACGGACC GGAATTAAAA CCCATCAGCT CACCAACTCG CATTTGCGAC CAACCGCTCT
CGAAGACTCC GTCCGCAACG ATCTCGTGGA TGTGTACTAT GGAGACAAGT CTGCGGAGCG
GACCCATCGA CTCAGCCCAG CTCTGTGCAA GCGGGCCAAA GAGGCGAATG ATGTGGTATG
GGTCACGGCA CATACCCCTC ACGGAGCCGA AGCCTTGCGG TGCTCGATTT CCAACGTGCG
CACGGCCGTT GCCGTTCCGG TTTGCCATGA AGCATCCAAT ACCAATATTA CGATTATCTT
TTTCAGCATT CGAAGGTAGG AACGCGGGGA GTGGTTTTTG ACGTTTGCAT TCTCTGCCGC
TGTCTCTCAC TTTTTTGAAA TCATTTTAGA ATTGTTGTTC GTCCAACCGC AGTTGAGTTT
CTCGTCCACA TGTCGCTTGC AGCTGGCGGT GCGTCAGTGA ATTCACTAGC TGAAGATGGT
CTTATCGACC GTGAAGCACT CAGCAGGAAG GACGATAACG AAAAAATTGT CAAGAGTATG
AGTCGGTCCG AGCACGTCCC ACGCAAAGAA GATATTGCCA TTCGTCATCA GCGTGTAGAA
CGATATTCGA TTACCGGTGC ACCGCTGGAT CTCCAGTGGC GGCAGCTGCA CAACGTCGAG
TATTTGACCG ACGGTGGGAA CAGCTGGATT CATACAGCTG TTTTCCAAGG TAAACCAGTT
GTGGTCAAAA CGCTGAAACC AGAATGCCAA GACGTTGTGC TGGCAATAAA TGAAATCGAA
GGTGAACTTG CGGTGCATTC GCGATTGTAC CATACCAACA TTGTAGCGCT CATTGGTGCA
GGGACGACGA GCAAGGGCGT ACGCTTTGTT GTATTAGAAC GTTTAGACGG TGGCACATTG
ACACAGATGC TAGGGTACGA TACGCGTATT CGGGATCGTA GGAGACGTTT CTGGCGACGC
AAGCAATTTA GCTACGTCGA CGTGTTGCGA GTTGCACGGT CCATCGCTGA TGCCATGTCT
TATTGTCACC AAGAAGCGAT ACCAGGATGC ATGGTGCTAC ATCGAGATTT GAAACCAGAC
AATATAGGTA TGTAAAAACT GCATTGCACA CAACCTTTTC ATATGCCAAT GGATTCTTAT
TTCTTACACT TTCATTCGTG TTTTAAGGGT TCACTCTGGA TGGAACCGTG AAAATCATCG
ACTTTGGTTT GGCAAAAATC GTCGAGAATG CATCCGTAGA CTCTGACGAT ATCTACACCA
TGAGTGGAGA AACAGGGTCA CTCCGGTATA TGGCACCAGA GGTGGCCGAT GCTTTGCCTT
ACAACGCAGC TGCCGATGTT TATTCCTTTG GCATCATTCT ATGGGAAATG AACGCGACGA
AAAAGCCATT CGAGGGATTG AACCGCGAAT TGTTTTATGA GCGTGTTGTA CATGGCGGAG
AACGCCCATC ATTGAACCGA AAATGGCCGT CCCAATTGAC GAGTCTGATA TCTGAGTGTT
GGGACGCCGA TATGCACAAC CGTCCAAGAT TCAAAGAGAT TGTCGGGCGT TTAGATGCTT
TGTTAGCGAA GGAAAAGGGA GGGCCAGCGA GCGCTAAAAA GAAGCTGCTG CCCAAGATTA
CCGGGATGAT CGACCGACAT TCCACTTGGT TCTAGATTAC GCAAACCCTA CTCAATACTT
GTACGTTGAG CGATGTCGAC TTGTTTCGTC AGAAACTTTA TACTAAACGA AGTCTAGCAT
ATGCACCATA TTTGGTTAAG ACAGAATATC ATAAGCCAAA GGGCTCTGCA TCATTAATAC
AACGCTTGTG GTACTAGTCG CTGCCTCTGT GCTTCCAAGG TTTCGAGATC TTGATGCAAC
GGGGTGCTTC GTCAGTCTTG GGTTGATCCT GTGTCACGGT AGCATTGGAG TATGACTTTG
CCCGTCGGTA ACGCGGAATC CATTCGTTCA GTGTCCACTT CGGCCCCATA TCCTGCAAAA
CTTCCAGCAC ACCCTCTCTC ACTAGATCGC ACAGTGCCAT TACGACAACG TTGATAGAAC
AGTAGTCACG AACGGCTACC CATCCTTTGG GAACCTTGTC CCAGTCCCGG TTCATGGCAT
CCGAGAGGAC CTCGTCGCCC GTCAGCGTCT TGCTACTCAG AGCTTGTAAG AGCAAATCCT
TAATCGACTC TTTCATACCG CTTGAAAAAC TTTCGTAGGA TTTGAAGTGA CGAAGTCTCT
TCTGACCAGC TCCAATTCCT TTCCCAAACC CATTGTAAGA CTCTGCCGCA TCACGTAGTT
CCGCCCACAA ATGAGAGTTG TTCATGAGCC CAAGCATTGA TGTAGAGCCA TCTTCTCCTA
ATAAACCGGC CTTGGTACGA GAAACGAAGT TGCTAGAGTT CGCATACATC ACTGTGGGAA
CATCACCTTC TTCGCACGGT GTCGCAAAGA TGCTTTCGAC TGCTCGAGTC CTTTCGGAAA
TGGGCCGCCT TACATTTCGG CCGGTAGGTC CAAATATTAG ATAGCGATAT GACAACTTGT
TTCCGAAACC AGAGCTGCCC ATCCCTACAT CTTGCTGTCC ATCTGATTTG TCCTCGGGCG
CATCACAGCT CGCCGAACTT GGGTTCCACT TATTCATGTT GCAGTACCAG ACCTCTGGCA
ATTCGTCCGC GGAAATATGA GGTGGAAGCT TTCGCCACTT GTCACATTTC TCGCATTGCA
CCCACTCTAG ATTATCGGCT TCGTCCTGGT TTGTCGCGTT GGATACCTGA TTTGGCTTCA
CATTCAACCC GTTCCGACGA GGCCGTCCCC TTTTTCCCTT TATTTCCGCA CTCTCAACCA
GAGCATTGAG GGTGTCCTCG GATGCGTTTT GTCCCTTCTT TCCCATGCCT CCTGATTTTT
TATCTTTTGG GTTCGTGTCG CTACCAGTAT CAGAATGGGG TGACTGCATC GAAGGAGAAA
AGCGTTTGCT CTTGGCCTCG GCATCGGTAT CGTCTCGAGC GCGGGAGCTT TGCTGAAATC
CAACGGGTTC TTCTACAATT GCACTATCCG CCCCTTTCGC TTCTCCACCC TCGGCAGATG
CCGCAGCTTG GAGAATAAGT CTTTGCCGTT TCTTACGCCG TTTCTCTTTG GCTTTTTCCT
TCGCCACTTG TTTTAACGAT TGCTCGACGG CTTCGCACGT AGACCGTTTG ACGTCCCAAA
TATTATCCGA GCAGTACCAT TGTTTTGGTA GCGTTGCCAC CGCAGCGGAA GGAATGATTC
GCCATTTCCC GCATTTGTCG CACTGCACCC ATTGACTGTC GTCCTCTACA TCGACCGATC
CATACTGTTC GCTACCCCTT TTTTTCCTCT TTTTGGAAGC TTGCTCAGCG ACGTTACTTT
CTTGCTTAGA CGCGATGCGC TCGTTAGCTT GTTGTGCGGC TCGTCTTCCT GATCGAGCAA
CCGTACTCAA ATCTTCCGTG GAAATCAATT CTTCTTTGGT AAAATGCGGC AGACTTCGCC
TTTCGGTCTC GGCTGCAAAG TTCGAACCTG GATCTTCATT AGTCTGCGTC TCTTCTATAC
CACTTTTTGG TTGGTTCGCG TATTTCATAC TCTCGGACTT CTCAGCGGAC GATGAAGCTT
GTATGTGCTT AAGCTTGGAG GATTGTCGGT CTGTCAAACT TTCGACGGAG GCTGATTCGG
AGATCTCCAT TGCTTCATGC GTTGAGGGTG AAGCAGCTGA CGTTGTCGTG GAATCGACCA
AGTCCTTGTC ATGTGACTTC TCTATCGACT GTGATTTATC CGCAGACTGA GATTTGACCG
ATGAAATGTG AATATGCAAT CGCTTCGGAG CGGGTGGATC AGTCACGGGG CGTTCCTCAG
GCGGGGCGAG CATCTTGGAA GGCGGGGATC CCATATTGTC CGTCGATTGC TGTGCAATCT
CGCTTCTTGA AGCAGTCGCG CTACTGCATC TATCCATATG TCCCCGTCCC TCACGCTTGA
CGTTGGATAT ACGAATAGTT AGGGAGGACT TTGGCTTTTC TGTGGTTTCA CTAATGGGCT
TTGCATTGTC CGAAACACCG GCTCGACTGA ATGCTTCGAA CGGCAAGACT GTGCCAACAG
TAGACTGCTC GGATAAATTT GTCTCCGTCT GCATTGGATT TGCAATCTCG GCCACATCTT
CTTCGTCTTC CGCCGTTGTT CGATGCTTAG GAGGACGCCC TCGTTTACGC TTTGGTGGTA
CATCTAGTCT CTCATCGTGT AGCAATGCGC GACTCGGTAC CGAGATTTCC TCCGATTCAA
GTGTCTCCTC CAATGCCATG GGCTCCACAG CACCTG
 
Protein sequence
MASLPHHQQQ QPPLHALREN GGSGSEGGLA TRQNGIPTEI RSTGSQSSNS RNPIEFQTQQ 
YYPQRPTAQP MSTPWTLGST DGSYATTLTL MQQQQGQMDE STVPLSSPND TGIAPGLGPL
PKGKAEGGAA IASSGAAAVD HNAALQILAA QKSLYETRLF RQPSASVEAV LAASCEVMGF
DISEMWLRTG IKTHQLTNSH LRPTALEDSV RNDLVDVYYG DKSAERTHRL SPALCKRAKE
ANDVVWVTAH TPHGAEALRC SISNVRTAVA VPVCHEASNT NITIIFFSIR RIVVRPTAVE
FLVHMSLAAG GASVNSLAED GLIDREALSR KDDNEKIVKS MSRSEHVPRK EDIAIRHQRV
ERYSITGAPL DLQWRQLHNV EYLTDGGNSW IHTAVFQGKP VVVKTLKPEC QDVVLAINEI
EGELAVHSRL YHTNIVALIG AGTTSKGVRF VVLERLDGGT LTQMLGYDTR IRDRRRRFWR
RKQFSYVDVL RVARSIADAM SYCHQEAIPG CMVLHRDLKP DNIGFTLDGT VKIIDFGLAK
IVENASVDSD DIYTMSGETG SLRYMAPEVA DALPYNAAAD VYSFGIILWE MNATKKPFEG
LNRELFYERV VHGGERPSLN RKWPSQLTSL ISECWDADMH NRPRFKEIVG RLDALLAKEK
GGPASAKKKL LPKITGMIDR HSTWF