Gene PHATRDRAFT_42729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42729 
Symbol 
ID7196363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp915431 
End bp918453 
Gene Length3023 bp 
Protein Length1003 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177189 
Protein GI219110875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGA AAAGGAATGC CAAGAAACGA CGTCGGGAAC AACGCCGCCA GATGGAAGAA 
CATTCCAGGC TTACTAATGA TTCCTTCCAG GACCAGCAAA GAGCTACCAG TACGAACAAA
CATTCGAAGG GGGATGGTGC ACCACCATCA TTTGACAAGA GTCGCAGCGT CAATTCATCT
TCTCTCCCTA CAGCAGTGGA TGGCTTCGGT TCCCAAAAGC GTTCCATCGC TGCAATCCAA
TTTGTTTACC CCGAACGGAA TGGAAGAAAA CTTTACAGAT TTCAAAAGGA GCTGCTCGAC
ACCTTCATCG TTCCTCCCCA AGAAAAGATT GAAAAAGGGC TCCCTGAATT CGTCGCGACG
AACGTGAGTT TGAACGACAT TCTATACCCG GACAGCAACA GAGAAAAACA ATCCGATTCT
GGTAGTCAGC TCAAAGGCGA CTCCTCTCAG AAATCTCCAA AAATCCTGCA TGGCGAGCAT
CACACCAACA ACGTAGTAAA GGAGCTGCGT CAGAAGAACT TGGATGTACA GTCACTTTCT
GGGCAATCGC AAGGCCAGCT GTCGGGGATG TCCGTGGAAG ATGCCGTTCG CGACAAACTT
GGATGTCGGC CTCGTGCGAA TTCGACAGAT GGAGAACTCA ATTTACCGCA ACGAGGGCTT
TGTGATGAGC GGAAGGTACT AGAGTCTTTC AAATGGATCC CCACAAATGT CAATTTGTCT
TATCCGAAAG GCTTTGTCAA TCTGGGTAAT ACGTGCTTTC TGAATAGCAC TGTACAGTGC
CTCGCCTACC TACCACCATT CTGCCAATCC TTGCTTTCAA TATTATCTCA TGAATCAAAA
CATGGTGAAA AAAGAAAGAC CAGTCAAGGA AGGAAGGTAA CATTTATCTT GCGCTCCCTC
TTTTCTCAAG TACACGGCAT TGATGGTGGT ACCACACACT CTGGAAGTTC ATTGGCTCCT
CGTGCCATTG TCCAAGCAGT GCCAACTCTT GGTTCCTGCG GTAGTCGGAA AGGGTACAAG
TTTCGACCCG GTCGACAAGA AGATGCACAT GAGTTTTTGG TGCATTTGCT AGATGCCATG
AACGACGGTG AGCTGAAAGA AGCGGGAATC AACCAGAATG CAAGTGGATG GCGTGATCGA
TTGCCAATTC CTCGCCTAGA TGAGACGACT TTCATACATA GAATATTTGG TGGATATTTC
CGAAGTCAAG TTCGCTGCAC ATCCTGTAAC AATCGCAGCA ACACGTACGA TCCCTTACTA
AACTTGTCTC TGGAAGTCAG TCGCAAGGCG TGCAACTCAG TTGCTCAGGC TTTGCACGAG
TTTACTCGCA AGGAAACCTT AGACTCTCAG AATCAGTGGA AGTGTTCGGG CTGCAAAAAA
TATGTTTGTG CTACAAAGCA GTTGACTGTA TTTCGACCAC CTTTGTCTCT TTGCATTCAA
TTAAAGCGAT TCACATACAG CGGTAGACTA AAATTTAGTG TTGGCTTCGG GAGCTTCGGC
AATGGGGGAG GAGGACAAAA GATATCGAAA TCTATCGAGT TTCCAGCCCA GCTGAAGCTT
CCTTTGAGTG ATGGTCGCTC CTGTGGGTAT TCCTTGACCG GAATTGTGAT CCATGTAGGA
GGTAGTGCTA GTTCTGGACA TTACACGGCG TATGTTCGCA AGCCAGGTGG TGGTAGCAAA
TCACAATGGT TTCACATGGA CGACTCTTTC GTCGAAGCTG TATCGGAACA AACAGTCCTT
CGACAAAGAG ATGCCTACTT GCTGTTCTAC TGTCGCGAAG AAGTTAAACT TGAGTTTCCA
ACACCTCCAA TGTCGGCCAA GGAAGCGCAA GAACTTGGCA GAGCCAAAGC TCGCGCCCGC
GCGGACAGTT TAACAGAATT ACAAGCAAGC GCATCGACAT CTTTAATCAC AATCACTAGC
AAGTCGCTTT GCCATGAAAG TACAGCGGCT TTGGAACAGA AAAGGAAGGT TAATAGAGTG
GAGGAGAACT CGAATGGTAC GGTGGCTTCT TTTCCACCAA ATCTGCGAAA GCAGGAACAA
AATGAGAATG GAGACTTGTT GGAGAAACAA CGGAATGAAA CATCCTTTTC AGCCCCAGGC
AAGAAAACTG ACAGCGCCGA ACTACTGCCA ACGCCAATCA CAGCTCCGGA TTTTGGCTTT
GAGATTCAAC GCTCGTCAGT AAAGCAATTT AGCCGTAGGA GTCCGGATCC GTCGAAAGTG
GTAGTTTCCT CGAAATCCCA GCGGGAAAGA TACGGAGGCA ACATTCAGCT CGGACAATCA
CCAATGATGA AACCTGCCAT TGCACTTCCG GATCAAGGCG AAGAATCCTC GTCGGAAGAG
TCCTCACCCA GCGACGATTC TTCCGTACAA GACCAGAACG ACAGCACCAG TTCCGCAAGG
ATTTCTCTGC CGGTTATTAA GTCGGAGGCC AAGACGTCAT TCGCCGACGT TTCTTCCTCG
TCCGAAAACG AAGAAAAAGC GTTTAGGACA ATAAATATGA CAGAATCCGA AAGTCCACAT
GCTGTCCAGA AAAAGGCGTT GGAGAAACCG CGGACTCGCA TCGTGCTTGA TCGTGGTGAA
GGCCGTGAAA AGGTGGAAGT CATGATGGGT CCGCGCTCCG AGACCAAGGC GTGGACGCCC
AGGGCTGGGG CGGTTACAAA AAGTGAGGAC TATGCGCTTT TGGGGAACCA GCGAGTAGGA
AGGTGGGATG ACGAAGGCAA CGATGTTGCA ACTCGCCAAC ATGACCGCGG CCGCGAGAAC
CTTATCCAAC AAATGGACAA GAAAGAATCC AACCGAAAGC GCAAAATGTA CTTAGATCGC
TGGGACGCCA TGCTGGATCA AGGCCAAACC AAAAAGGTGA AAGAAAAGAC TGATTCTATC
AAGCCGACGA CGCCCAAGAA AAATGTGTTC CAGCGAATCC AAAGCAGCGT GCAGCGTATG
AATCGAGGCC GTGCAAAAGG ACACTTTCGA CCCGAGACGC AAAAGAAGAA ACGCGGTCGA
CGCAGTCTCT GACAGTGTAC AAA
 
Protein sequence
MSKKRNAKKR RREQRRQMEE HSRLTNDSFQ DQQRATSTNK HSKGDGAPPS FDKSRSVNSS 
SLPTAVDGFG SQKRSIAAIQ FVYPERNGRK LYRFQKELLD TFIVPPQEKI EKGLPEFVAT
NVSLNDILYP DSNREKQSDS GSQLKGDSSQ KSPKILHGEH HTNNVVKELR QKNLDVQSLS
GQSQGQLSGM SVEDAVRDKL GCRPRANSTD GELNLPQRGL CDERKVLESF KWIPTNVNLS
YPKGFVNLGN TCFLNSTVQC LAYLPPFCQS LLSILSHESK HGEKRKTSQG RKVTFILRSL
FSQVHGIDGG TTHSGSSLAP RAIVQAVPTL GSCGSRKGYK FRPGRQEDAH EFLVHLLDAM
NDGELKEAGI NQNASGWRDR LPIPRLDETT FIHRIFGGYF RSQVRCTSCN NRSNTYDPLL
NLSLEVSRKA CNSVAQALHE FTRKETLDSQ NQWKCSGCKK YVCATKQLTV FRPPLSLCIQ
LKRFTYSGRL KFSVGFGSFG NGGGGQKISK SIEFPAQLKL PLSDGRSCGY SLTGIVIHVG
GSASSGHYTA YVRKPGGGSK SQWFHMDDSF VEAVSEQTVL RQRDAYLLFY CREEVKLEFP
TPPMSAKEAQ ELGRAKARAR ADSLTELQAS ASTSLITITS KSLCHESTAA LEQKRKVNRV
EENSNGTVAS FPPNLRKQEQ NENGDLLEKQ RNETSFSAPG KKTDSAELLP TPITAPDFGF
EIQRSSVKQF SRRSPDPSKV VVSSKSQRER YGGNIQLGQS PMMKPAIALP DQGEESSSEE
SSPSDDSSVQ DQNDSTSSAR ISLPVIKSEA KTSFADVSSS SENEEKAFRT INMTESESPH
AVQKKALEKP RTRIVLDRGE GREKVEVMMG PRSETKAWTP RAGAVTKSED YALLGNQRVG
RWDDEGNDVA TRQHDRGREN LIQQMDKKES NRKRKMYLDR WDAMLDQGQT KKVKEKTDSI
KPTTPKKNVF QRIQSSVQRM NRGRAKGHFR PETQKKKRGR RSL