Gene PHATRDRAFT_10151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_10151 
Symbol 
ID7197480 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp951042 
End bp955020 
Gene Length3979 bp 
Protein Length1127 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177726 
Protein GI219111949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00773421 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTG GTGTTGTTAT GCGGGCGGCG GTGATCAACG TTCTTTACGA GCACTCATTG 
AAATTGACAC CCAGGGGACG AGCTGGGCTG ACTTCAGGAG AAGTTACCAA CCTGATAGCC
GTCGATACAC AAAAACTGTA TGAAGTTGCT CAAGAAGGCC ATTTGATTTG GGCTCTTCCT
CTCTCGATTA CGCTTGTAAC CGTCTTCTTG ATACTGATTC TTGGTCCGAT TACTCTGATC
GGTATAGCGG TACTGATACT CTTCGTGCCA TTAGTAGAAA GAGTGACGTC AAGAATGCTA
AAGATACGAC AACGAAGGGC AAAGATGACT GACCAGCGCG TTGAAATTGT TAGCACCATG
CTTCAAGGGG TAAGTGCCTT GAGATTTTTG TGAGAGAAGT ACCCATACAT CATCCAACAC
TTCATTATCC AGATTAAGGT CACGAAGCTG AGCAATCTAG AAGAAAGCTA CGAAACAAGA
GTAGCCGAAG CTCGTCAGCT CGAGCTTAAT GAGCTTCGTA AAGAATTGGC TGTATGGGCA
CTGACTCTCG TTATCACGTA AGTTCTGATG GACGCTCGTG ATTTCTCTTG AGTAAGATGC
TGACATCTGT TTGTTTGGTG TCTATTTAGC GTATCCTCTC CAGTAGTAGC AAGTTCGGCG
ACATTTGCTG CGTACGTCTT GGTTGACGAG CGCAATGTTC TTACCGCGGC AGAAACTTTT
TCCGTTTTGT TGCTGTTCGG CGCTCTCCGA TTTCCAATAA ACTATGCAGG CCGGCTTGCT
GGAAGTAAGT AACGCCAAAA GATATTGATT TTGAGAGCAT GACATAGCTT ACAATTTCAT
TCATTCTCAT GGCAGAGATG GTGCAGGCCC TATCCGCGAT TACTCGCATT AACTCATTCT
TTGAACGGGA AACGAGAGAC GTCGACTTTT CTCTTGTGCC GTCAAATAAC GTTGCTTTGT
CTGAGTCATC GGACATACCC CTTATTCTTT CTAGAGCGGC ATTTTTCTTG CAGCCCGCCG
ATGAATGTGT GCGTAACGGA CAAGACAACG GAAAGAAGAA CATACACAAG GGAAGCTTTG
AGCTCAGCGC GGCGTCGTTC AAAGTCTCGA CATTCGATTT TACAATTCGG AAGGGCGAAG
TCATTGCCAT TTGTGGCCCT GTCGGTTCTG GGAAATCAAC GCTTATTCAT GGTATATTGG
ACGAAGTCCC GTCCATTGAA GGCACGGAAG TTTCCAGATA TGGACGAACA GCCTTTGTTC
CTCAAACACC GTTCATTTTA AACACAACTC TAAGAGAAAA TATTCTGTTT GGGTTGCCTT
TCGAAAGTTC CGTTTACGAG CGAGTTCTTG ACGTATGCTG TTTGCGACAA GATATTCAAC
AGCTGGGAGA ATCAAAGGAT CATACCGAGA TTGGGGAACG TGGGGTGACT CTTTCAGGTG
GACAAAAGCA AAGAGTTTCG CTAGCTCGTG CAGCTTACGC AAGGCCCGAT TTAGTTCTTC
TTGACGATCC GCTGTCAGCT CTGGATGCGG GAACTGCCAA ACTTGTGTTC GAACGCCTAA
TCAAGTCGAC TGGGTCTTAC TTCTCGGATA CTGCCGTTGT TCTTGTGACT CATGCCTCGC
ATTTTCTGAA CCGAGTAGAC AAGGCACTTA TCATCGTTGG AGGCAAGAAT GAGTTTTATG
GGAGTTGGAA TGATCTTGCT ACCTACCATG CAAACGACTT CGAAACAAAT GTAGCCATTG
ATTTCTTGCG TACTTCCGTT CAAGAAGTTG CGAGTGAGAG CACCGATAGC GCGGACCAAA
ACAAGGATGA AAAACTCCTG TGTAAGCAAG TGGACGTGAA GGATACTTTG ATGGCAGCAG
AAGAGCGAGA ACATGGTCTT TCCAGTCTTA GTGTTTGGCT CCTATGGTTC AAGCGCGCTG
GAGGCTTTTA TTTCATTTTC TTTCAAGTTC TTTTCATGGG TATCGATCGT TTTTCTTACG
TTGCTACGGA ATATTGGCTT GCAAGATGGA CGCAATCTGC GGATAAGCCA ATCAGCGTGT
TTGGTGTATC CTTTCCATCC CAAGAAGAGG GTCGCACGGC TCAATTCGAT TATCTCAAGG
TCTACAGTAG TCTCGTACTT GTATCAGTTT CAACTACGAT TCTAAGGTAT GCGTACTGAT
TTTTTACGTG CGATGCCTAT TATGAATAAT CTGCTCACAA AATGAATTCA TGGTATGAAC
AGATCTGAAT GGAGCGGTAA GCTTTTTGCT TTTGTGAGAT CAAGCGAGTG GACTGTTGAA
GTAATAGGCT GGATTGCTTT CTCAAACTAT TCGATATGTT CTTACCTCTA CAGTTACCGG
TGGAACTCGG GCTGCCAAAC ATGTATTCTC TTCCATGGTT TACAGCGTGC TACGGGCACC
CATGTCGTAT TTTGATACCA CTCCGATGGG GAGGATTCTG AACCGATTTA CATACGACAT
GGATGTGGTA GACATTTTGC TGACCCAGTC CATGAGCATG TTCATGATAT CATGTAGCTG
GTATTTTGCT GGAGTTATTG TAATGTGCAC AATTCTTCCT TGGATAGCGT TGGCAATCTT
TCCCGTTACA GTGATTTATT GGGTGCTGAT GCTGCATTAT CGAAAATCAG GATCAGATCT
ACAACGTTTG GATGCTGTGT CACGTTCTCC TATCCAAGCG ATGATATCAG AAGGTAAAGA
ATGCTGTCAC CTCATTTTCC TGTAGTATCC AGTTTAAACG ACTGATCCGA TTTTTGTCTT
TCAGGGCTCG ATGGATCGGC CAGCATTCGA GTATTCCAAC AAGAATACAA TTTTTTGAAA
CGATTTCGTG CATTGACCGA TCTCAACAGC TCTGCCTTGC TCAATTTCGT CTCTGCTCAA
CGATGGCTGG GTGTGCGTAT CGAGCTGCTG GGTTCTTTGG TAGTCCTTAT ATCTTCATCA
TTAGTTGTAA CTTTGAACGA TTCCCTGCGG TTGGATCCTG GAATTGGTGA GTGGAATTCT
TTTTAGAAAG TATCATGCCA TCAAATACTG AGTATACTCA ACAGTAACTT TCAAAGTTGG
ATTACTCATT ATCTGGTCGA GTAACTTCAC GATAACCTTG GGGTTCCTGG TAGACACATT
CGCGGAAACT GAAGCTGCTA TTACGGCGAT AGAAAGAGTT GATGCCATGG CTGAGCTCCC
TCGCGAAAGA TCGATGAAGA CGGACCCAGA ACACACTGTG CGCTCATCTT GGCCAGAGAA
AGGTGCGATC GAATTTAAGA ATGTTTGCTT GCGTTACCGG GCAGGGCTTC CTTTGGCGTT
GGACGGGTTG TCTTTTCGAA TTCCCCCAGG TCTGAGTTGT GGCGTTGTAG GACGCACTGG
TGCTGGTAAG AGCTCAATCT CAGTCGCGCT TTTTCGACTC GTCGAAATAG AATTTGGTGA
GATCCTTCTC GACGGTATAA ATTTGGCTAC TTTGGGATTA TCTGATGTTC GGGGTCGGCC
AAACGGGATG ACCATCATTC CGCAGGATCT ATTCTTGGCC GGTACAACTT TGAGGGAATG
CTTGGACCCT TTTGGTGTAC GAGAAGACGA GGACATCTTG CAAGCTCTCA AAGCAGTTCG
TTTGGCAAAG TCGAACGATT TGGTTTCAAA GCTAGAGACG GCAGTGCACG AAGGAGGCTT
GAACTACAGT GTGGGAGAAC GGCAACTCTT GAACCTAGCA AGGGCACTGT TGTCCAAGCC
CATGGTGCTG ATTTTAGACG AGGCTACAGG TAGTGAAAAG CAGACCATGT TGTCCTATCT
ATATCCATTT GTCGTATCTC ACTGGACCCC GATCACTTTC CCTACAGCTA GCGTTGACGG
GGAGACTGAT GCCTTTATCC AGCGGATGTT GCGGACGAAG TTTACTGACA CGACGCTAAT
CACGGTGGCG CACCGGTTAA ATACTATCAT GGACTACGAC TTGGTTTTGG TCATGGACCA
AGGCAAAGCT GTCGAGCTG
 
Protein sequence
MKSGVVMRAA VINVLYEHSL KLTPRGRAGL TSGEVTNLIA VDTQKLYEVA QEGHLIWALP 
LSITLVTVFL ILILGPITLI GIAVLILFVP LVERVTSRML KIRQRRAKMT DQRVEIVSTM
LQGIKVTKLS NLEESYETRV AEARQLELNE LRKELAVWAL TLVITVSSPV VASSATFAAY
VLVDERNVLT AAETFSVLLL FGALRFPINY AGRLAGKMVQ ALSAITRINS FFERETRDVD
FSLVPSNNVA LSESSDIPLI LSRAAFFLQP ADECVRNGQD NGKKNIHKGS FELSAASFKV
STFDFTIRKG EVIAICGPVG SGKSTLIHGI LDEVPSIEGT EVSRYGRTAF VPQTPFILNT
TLRENILFGL PFESSVYERV LDVCCLRQDI QQLGESKDHT EIGERGVTLS GGQKQRVSLA
RAAYARPDLV LLDDPLSALD AGTAKLVFER LIKSTGSYFS DTAVVLVTHA SHFLNRVDKA
LIIVGGKNEF YGSWNDLATY HANDFETNVA IDFLRTSVQE VASESTDSAD QNKDEKLLCK
QVDVKDTLMA AEEREHGLSS LSVWLLWFKR AGGFYFIFFQ VLFMGIDRFS YVATEYWLAR
WTQSADKPIS VFGVSFPSQE EGRTAQFDYL KVYSSLAGLL SQTIRYVLTS TVTGGTRAAK
HVFSSMVYSV LRAPMSYFDT TPMGRILNRF TYDMDVVDIL LTQSMSMFMI SCSWYFAGVI
VMCTILPWIA LAIFPVTVIY WVLMLHYRKS GSDLQRLDAV SRSPIQAMIS EGLDGSASIR
VFQQEYNFLK RFRALTDLNS SALLNFVSAQ RWLGVRIELL GSLVVLISSS LVVTLNDSLR
LDPGIVGLLI IWSSNFTITL GFLVDTFAET EAAITAIERV DAMAELPRER SMKTDPEHTV
RSSWPEKGAI EFKNVCLRYR AGLPLALDGL SFRIPPGLSC GVVGRTGAGK SSISVALFRL
VEIEFGEILL DGINLATLGL SDVRGRPNGM TIIPQDLFLA GTTLRECLDP FGVREDEDIL
QALKAVRLAK SNDLVSKLET AVHEGGLNYS VGERQLLNLA RALLSKPMVL ILDEATASVD
GETDAFIQRM LRTKFTDTTL ITVAHRLNTI MDYDLVLVMD QGKAVEL