Gene PHATRDRAFT_43016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43016 
Symbol 
ID7196229 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1801809 
End bp1805526 
Gene Length3718 bp 
Protein Length1066 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177376 
Protein GI219111249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAACTACGA GAAGGGTATA CGACGCCGCG TCTCTACCTT TTACTCTACA GGAGCCAATC 
CTACCGCTCT CCGCTCTCCT AGTACTCTCA TTGTACGGCA CCAATCGTTT CGGCATCGCG
AGTAGTTCAC TGTCGAATCA TGTCGGAAGA ACCACCGGTG GGGGTCAATC CGAATGCGGA
AGATGAAACT CCCGCTCCAG AAGGGGAGAA GAAGCTATCG AAGAATCAAC TGAAAAAGCT
CGCCAAGGGA AAGGTACGAA GCGGCCCTGG GCGAACTCTG ACGTTTCGGG TCGAATGCGG
GTGTTTAAAT ACCAGCTTAC CTTTTCTCTT GTCCATTCCG TAGGACAAGA AAAAGAAAGA
CAAGCCTCAA TGGAACGCGC CGAGTAAGGA AAAAGTCAAA GCTCCGGTCG CTACGACTTC
CTTCGTCAAC ACGACCCCAA AAGGGGAAAA GAAAGATCTT TCGGCTCCCA TGGACGCCGC
ATATCATCCG TCAGCAGTCG AAGCCGCTTG GCAAGACTGG TGGGAAAAGT GTGGTTACTA
CAGCTGCGAT CCGAAAGATG CGGTCGATCG CCCCGTAGAT GAAAAGTTCG TCATGGTGAT
TCCACCACCG AATGTGACTG GATCGCTGCA TCTCGGACAC GCCCTGACGG CGGCCGTCGA
AGATACCCTG ACGAGGTGGC ACCGGATGAA AGGACACGCC ACTCTGTATG TTCCCGGTAA
GTTTCCATAG TGCATTTATT CGTTCGTTCG TCGTTGGCTT CTGTTTCCCC ATCACGCACG
GGTCCTCTGG GCAGCCACGG GTGAAATCAA TTCACAATAG ATGGCAATCG GCTCCGTGCA
AAAATTTACG CAAAACAACC CCTCGCACAA CGAGTTCTTC ATCAACGCAC TATTACTCAT
TGTTTTCGGG GCTGACATTT CACAATCACA GGTACGGATC ATGCAGGAAT TGCCACGCAA
TCGGTGGTCG AAAAAATGCT CATGAAGAGC GAAGGAAAAA GCCGGCACGA TTTGGGGAGG
GAAGAATTTG TGAAAAAGGT GTGGGAATGG AAGAAGGACT ATGGAAGTAA GATCACCAAT
CAATTGCGAT CATTGGGCAG CAGTGTCGAT TGGTCGCGTG AACGGTTTAC GATGGATGAA
ATGTTGAGTA AAGCTGTCGT CGAAGCCTTT AATCGATTTC ACGAAAAGGG ACTCTTGTAC
CGTGCGGATC GTCTAGGAAA CTGGTCGTGT GCATTAAAGT CCGCCATTTC CGATATCGAA
GTCGACTTTA TTGAACTCGA AGGCCGTACC TTTTTGGACG TCAAAACACA TAAGGGCAAT
CCCAACGACC CCAATGGTCG GTACGAATTT GGGACCTTGA CTTCGTTCGC CTATCCGATT
GAAGACTCAG AGGAACAGAT TGTTGTTGCC ACTACCCGAC TGGAAACCAT GTTGGGAGAC
ACTGCTGTTG CCGTTCACCC GGACGATCCT CGATACACCC ACTTGCACGG GAAGCATCTA
ATCCATCCCT TCAACGGACG CCGAATTCCT ATTGTTTGCG ATAAAGAACT GGTCGACATG
TCCTTTGGTA CCGGAGCAGT CAAAATTACT CCCGCCCATG ATCCAAACGA CTACGAGTGC
GGTAAACGTC ACGAGCTGGA GTTCATCACA ATGTTGACTG CGGATGGTTC AATCAACGAA
AATGGTGCCC CTTTCACCGG CATGATGCGG TACGACGCTC GCATTGCGGT AGAAGATGCG
CTTAAAGAAA AGGGATTATT CAAAGGCAAA GAACCCAACA AGATGCGATT GGGTCTATGT
TCGCGCTCGG GTGATATTTT GGAACCCATG ATTACTCCGC AGTGGTATGT TAACTGCGAC
GGTATGGCTA AGCGGGCTAC CGATGCGGTA CGCAATAAAG AGTTAACAAT TCTTCCAGAG
GAGCAAGAGA AGACGTGGTT TCATTGGTTG GATAACATCA AGGACTGGTG CGTCAGTCGA
CAACTCTGGT GGGGTCACCA GATACCGGCA TGGTTCGCCA CCAAGAAGGG AGAAAGTTTA
GAAAAGAATG ATATGGCCAA CAACGACCGG TGGGTTGTTG CTCGGTCCGC TGAGGTAGGC
AATAGGATGA AAGCATGCAC ACATTTTTTT GTTGCCGTAC TTCGTCTCAC AATACTGTAT
CTCGCAGGAA GCTCTCGAGA AGGCCGCTAA ATTACTTGGC TGTCCCGCTG GCGACATCTC
AATTGAGCGG GATGAAGACG TTCTTGATAC GTGGTTTTCC TCTGGACTGT TTCCTTTTTC
TGTCATGGGA TGGCCCGATG ACACTTCTGA TTTGAAGGCG TTTTATCCTA CGTCTCTACT
CGAGACTGGT CTCGATATTC TCTTCTTTTG GGTGGCTCGT ATGGTTATGA TGGGTTTGGA
ACTAACCGAC ACACTACCAT TCCACACCGT CTTCCTTCAT GCCATGGTGC GGGACAAGGA
AGGAAAGAAA ATGTCAAAAT CTCTTGGAAA TGTGATCGAT CCTCTGGAGG TCATCAACGG
GTGCTCATTG GCCTCTCTGC AAGAGCGTCT GGAAGGAGGC AACCTTCCGG CGAAGGAAGT
GGAACGATCG AAGAAGAATA ACGAGCTCGA GTTTCCTGAC GGTATTCCAG AATGCGGATC
GGATGCACTT CGGTTCGGCC TTATGGCCTA TATGGTCCAG GGACGTGATA TCAATCTTGA
CGTCAAACGT GTTGTTGGGT TCCGGCTGTT TTGCAACAAA CTTTGGAACG CCACACGTTT
TGCACTTCAA TTTGTTGCGG ACTTTACGCC TACTCCGACT CTGTTGGACG ACCTAATGGC
TAGCGGCAAA ATGGCGACGC GAGACAAATT TATGATATCT CGATTGATGA AAGCGGTGGA
AGCCGTCAAC GATTTCTTCT CGAGCTACCG GTTTGGCGAT GCACAACAAG CGGCCTATGC
TTTGTGGATT GAAGATCTTT GCAACACATA CCTGGAACTG ATCAAACCCG TCGTATACGA
CATGAGTGTC AACAACATAG ACAATCGGTG GGCAGCACAA GCAACGCTTT GGATCGCAAT
GGAAACAGGC CTTCGGTTAC TGCATCCAAT GATGCCATTT GTTTCTGAGG AGCTTTGGCA
GCGACTTCCT GGACGTGGGA CGCTAGGCAA AACGGAACCT GAAACTATCA TGCTCGCCCC
GTACCCCGAA ACTCACAACT CTTACAAAAA TGAGGCCGTG GAGCAATCTA TGATGAACAC
AATGGCTGTG GTTAATGCCT GCAGATCACT TCGTCAGTCG TACAACATTG CCAACAAGGT
ACAGACACAT TTCTTTGTGA ACGTATCTGG ACTCGCGCTA CATGCCGTTC TCGACCAACT
GGATGACATC AAGACACTTG GAAAAGCTTC TGCCATTGAT ATTAATCTTT CCCCAGCAGA
CACACCAGAA ACTGTCGGAA CTGCCATTGT CAATGATCAG CTGACTGTTC TGATTGACTT
ACAGGGACTG GTTGACTACA AAGTTGAGAT TGGGCGTCTG CAAAAGAATC TAAGGTCTAC
TCTACCAACA ATTTCGACTC TCGAAATGAA AATGGCTACT GATGGTTATA CAGAAAACGT
TCCAAACGAT CTTCAAAAAG CGAATCTAGA GAAACTTGAT TCGCTTTTGA AAAAGAAGTG
TGATCTCGAA GAGGCTATTG CAAACTTTGA ACGTCTGGCC TTATTGGATA AGAATTAA
 
Protein sequence
MSEEPPVGVN PNAEDETPAP EGEKKLSKNQ LKKLAKGKDK KKKDKPQWNA PSKEKVKAPV 
ATTSFVNTTP KGEKKDLSAP MDAAYHPSAV EAAWQDWWEK CGYYSCDPKD AVDRPVDEKF
VMVIPPPNVT GSLHLGHALT AAVEDTLTRW HRMKGHATLY VPGTDHAGIA TQSVVEKMLM
KSEGKSRHDL GREEFVKKVW EWKKDYGSKI TNQLRSLGSS VDWSRERFTM DEMLSKAVVE
AFNRFHEKGL LYRADRLGNW SCALKSAISD IEVDFIELEG RTFLDVKTHK GNPNDPNGRY
EFGTLTSFAY PIEDSEEQIV VATTRLETML GDTAVAVHPD DPRYTHLHGK HLIHPFNGRR
IPIVCDKELV DMSFGTGAVK ITPAHDPNDY ECGKRHELEF ITMLTADGSI NENGAPFTGM
MRYDARIAVE DALKEKGLFK GKEPNKMRLG LCSRSGDILE PMITPQWYVN CDGMAKRATD
AVRNKELTIL PEEQEKTWFH WLDNIKDWCV SRQLWWGHQI PAWFATKKGE SLEKNDMANN
DRWVVARSAE EALEKAAKLL GCPAGDISIE RDEDVLDTWF SSGLFPFSVM GWPDDTSDLK
AFYPTSLLET GLDILFFWVA RMVMMGLELT DTLPFHTVFL HAMVRDKEGK KMSKSLGNVI
DPLEVINGCS LASLQERLEG GNLPAKEVER SKKNNELEFP DGIPECGSDA LRFGLMAYMV
QGRDINLDVK RVVGFRLFCN KLWNATRFAL QFVADFTPTP TLLDDLMASG KMATRDKFMI
SRLMKAVEAV NDFFSSYRFG DAQQAAYALW IEDLCNTYLE LIKPVVYDMS VNNIDNRWAA
QATLWIAMET GLRLLHPMMP FVSEELWQRL PGRGTLGKTE PETIMLAPYP ETHNSYKNEA
VEQSMMNTMA VVNACRSLRQ SYNIANKVQT HFFVNVSGLA LHAVLDQLDD IKTLGKASAI
DINLSPADTP ETVGTAIVND QLTVLIDLQG LVDYKVEIGR LQKNLRSTLP TISTLEMKMA
TDGYTENVPN DLQKANLEKL DSLLKKKCDL EEAIANFERL ALLDKN