Gene PHATRDRAFT_49381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49381 
Symbol 
ID7195772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp99226 
End bp102641 
Gene Length3416 bp 
Protein Length969 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184182 
Protein GI219127938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATTGCGTTG CAGGTGCCCT TCTATTGCAA CATGGATTGC TCTTTGAAGC AATTGCTTGA 
CAACAACATT GCCAGCAAGA AGGAGTGGAT GACTGAGTGC ATCCCTTTTC CATCGAAAGA
AGGGCAGGAA GTTAATCTAT GCGATTACAT TGGTCTAGAT TGTAAACGCA TTGGAGAGTG
TTTTATGTTT CCGGAAGGAT ACGACTCTAC CTCAAATTGC CGAAATAGTC TAGCCAAGGC
CATTAAGATT GCTGCTGCAA ATGGAAAGTT TCCCTTAGTC GAACGAGGAT GGGATGCAAA
AAAGAGGCGA CTACGTTTTG AATGCTTTAG AAGTAGATCC CATGATGCGG ATAGGTACAA
AACAAAACAA AACAGTCTAT CCAATGTGGC TATAAAACAT CGCAACGTTT CCTTCATACG
TCCACAAAAG GGAAAAGAGT GTCCTTTTCG GTTCAGTATT TTTTGGATGA CAGAACATAA
ACAATGGTGT CTTTTTGGAG GTGTAAAAGG CAGCTGCAGG TTTCACTGTC ATCATTTACC
AATGGATCCA TGTGAGGTAA AAAAGAGTAT TTCCTATATT GACGGTGGTG AAGTAAAAAT
AGCACTTGAT GCAGCAAAAA GTAATGCCCC ACCAAGCGTG ATTGGGCGGC TGCTGAACAT
ACGAACTGGA GATATCTTGT CAGGTTCATC TCTAAAAAAT ATACGGATGC AAGCTGAAAA
AGGTGAACGG AACAAATTTG GTGCAAACGA TAATTTCACG ACGCAAGCGG ATCAACTTCT
AGCTTATCTT GAGAGCACCC CGGATGTCAG CTTCTGTGCC ATATATGATG AACCAGATTC
CCCTTTATTC ACTGTTTACA AGCAGAGGGC AAAAACTGGA CGTCGGCACC TACACACCAG
TACACGGAGT ATCTCTGGAG GAATTGCCCA ACAAGAAGTG CTCAATGAAA AGGTGCTAGA
TGCTATTGAT CCAAGGGGAG AGCTTGATGA CTATATAGAT AGGACGCGGA GGGCGTTCAA
GTTGAAGGGC AACGAAAAAA TGCTTTTAGG AGTGGCCTGG ACCAACAACG AGAGTAGAAG
AATCTTTGCT CGCTATTCCG AGATCATGGT AGCAGATGTG ACAGAAGGTA CCAACAATGC
AAAACGGCCG CTGTTTTTGT TTTCGGGAAA GACATCAAAT CAAAACACGT TTACAGCACT
TTGGGCCTTT CTACCGCAAC AATCTCGTTG GGCCTTCCGA TGGGTGTGGA CAAGATGCAT
TCCACAGCTC TTACCTGAAC AGGGGCTGAA CAGAATGCGT CTATTGATTA CTGATGGAGA
CCCGCGAGAG TATGGTACTT TTTTAGATGC AATACCTACT TGGTATAGCT TGTGTCGGCA
CAAACTATGC CATTGGCATC TACTCTATCG CGGCAGTCTT ATGAAAGCAC AGACTGGAAA
CTGTGGAACA AAAGCAAAAA TTCTATTCCA TGTGGTCCTG AAGTGGATAG AAAGCTGGAT
GACAAAAATT GAGACGCAAG AGGAGTACAA TTTGTCAACT GGGCTTTTGA TTGACTGGCT
GAAATCTCCG GAGGCACTTG ATACAAATTT GGGCGGAATG GGTTGTGCCC TTGTTTCGCA
GATAAATGCA TTTTTGACGT CCTCTCTGTT TCCGCACGAG CAACGCTGGG CTCGATACCA
TTTTCTGAAC GTGAGAGCAT TCAACACGTC CGCAAGTTCC TACGGAGAAG CAGAGAACAG
TGCTCTAAAA CGACGGGGTG ATGGGGTCAA GCCAAACTTT TCGGTGCCAA AAGCAACACG
GGCAATAAAC GAAGGGACTC AATTGCGAAC AGTGAAGAGG CAACAAAAAG CAGTTCATAA
CCTCAATGCT ACAAAGAAGA CAAAGGCAGC AAACTACACC AATATATCCG ACCTAGTAGA
TTGCATACAG GAAACCATAT CCCATGAATT CAATGCAGCC AAAAAATATG ACCTCTTTTG
CCCGGGTCCA AAAGAATTTT GGGTAAAGCG AGCATGGTAC CAAATTCCAA GCGAGACCTA
CCAGGATTTC AACGACAGCA ACTTTTGCCA ATTTATGATT CCACAGTTTG AGCGCACCCG
CATCGTAAAA ATTACGGAAA TTGAAGGTGA ACTCTATCTG GAATGTAGTT GCGGCAAGTT
CCAACGACAA GCTTCTCCAT GTGCTCACAT CTACAAAGTA CTTAACCGAC CACCACAATC
AACAGACGTT TCTGTGAGAT GGACAAAAAT TTGGGATGTT TACCTGCATC GACCTGGATA
TCATGATCTG TCGGACCAGT TAGAGGAATT GTATAAGAAG GAGCGGCCAG GGCCACATTT
CGAAAACACA AATCAGTGGG AAGTTGGAAA GGGTGAGAGA GAGTACAACT ATTTTAAGAG
ATCACTTCCA AGCGAGCCCA CCATTATCCA GAAGTACAGC AGATGGGCTG ATTCTTTTTC
ACGACAACCT GGATGTTATG TGCATAAAAG CACTGAACAG GAAACAGTTC CTGCAGCAAG
CGGTATGGTG CAAGAGTTGA CCAGCCTTTC CCAGGGGTAT GCTATTGAAA CTCAATTGGA
TAGTGAAATG GATGTTGGAG ATGTAACTGT CATGCAGGTT GAAGAAATTG ATTCAAATCT
CTCAAAATCG GGTAAAAGTC CATACACAAA CAATCTTCAT TTTTACGAGG AAATCTCAAA
ACTTGCCAAA TTCAATTCAA AAGCTGCTGA CATAATGACA AAAGGAATGC AGGAAACTTT
GGAATTGCTA CAGAAACATG TTGCAGAAGG GTCAGGTATG GTAGATTACA GTATTGGCCC
AGCTATTGGA AAAGAACCAG TAGGCCAAAG GCTCAGGCCA AGCTACAGTC CTTCAAAGAG
CAAGAATCTA AGAGACAGAC AAAAAGAAAC AAAGGCAAAA TTTTGGTGGC TGAACAAGTA
ATGATAGTGA GACCTATTTG TCATCATGAG AAGCATTCTA TAGGAAAATG TTAAAGACTG
TAGCATCCCT CATTATAATT GTGAGCCATT CTTGGCACTC TTCCTCTGTC AGAACCATCA
TGCCTTATAC TAGAGTCCTC AGACCTTCCA ATGCAAAATC ACTAAGGTGT ACTTGTAGCC
CCAACGTATT GGCCATTTCC AAGCCTTTGG TGTCAACGCC GGCTTTGCTT TCATTGTCAC
CTTTGTTGAC GGCTGAAGGA CGGATACGGC AATAGTCTTT TACCATACTC AGCTCTGTTT
TGAGGACCAA GCCCATACCG CCAGCAGCAG CAACATGCCC GAATCGACGC TTATTGAACG
GCAAAGCGGC GTCTTCCACA GTAACAGCTG TAGTGGCACA AAGCGCGGAA TCCGGATCCC
AACCAAGGCA GCAGCGTTTC TCCCACAGCA TTATCCCCGG CCACAACCAC TACTTG
 
Protein sequence
MDCSLKQLLD NNIASKKEWM TECIPFPSKE GQEVNLCDYI GLDCKRIGEC FMFPEGYDST 
SNCRNSLAKA IKIAAANGKF PLVERGWDAK KRRLRFECFR SRSHDADRYK TKQNSLSNVA
IKHRNVSFIR PQKGKECPFR FSIFWMTEHK QWCLFGGVKG SCRFHCHHLP MDPCEVKKSI
SYIDGGEVKI ALDAAKSNAP PSVIGRLLNI RTGDILSGSS LKNIRMQAEK GERNKFGAND
NFTTQADQLL AYLESTPDVS FCAIYDEPDS PLFTVYKQRA KTGRRHLHTS TRSISGGIAQ
QEVLNEKVLD AIDPRGELDD YIDRTRRAFK LKGNEKMLLG VAWTNNESRR IFARYSEIMV
ADVTEGTNNA KRPLFLFSGK TSNQNTFTAL WAFLPQQSRW AFRWVWTRCI PQLLPEQGLN
RMRLLITDGD PREYGTFLDA IPTWYSLCRH KLCHWHLLYR GSLMKAQTGN CGTKAKILFH
VVLKWIESWM TKIETQEEYN LSTGLLIDWL KSPEALDTNL GGMGCALVSQ INAFLTSSLF
PHEQRWARYH FLNVRAFNTS ASSYGEAENS ALKRRGDGVK PNFSVPKATR AINEGTQLRT
VKRQQKAVHN LNATKKTKAA NYTNISDLVD CIQETISHEF NAAKKYDLFC PGPKEFWVKR
AWYQIPSETY QDFNDSNFCQ FMIPQFERTR IVKITEIEGE LYLECSCGKF QRQASPCAHI
YKVLNRPPQS TDVSVRWTKI WDVYLHRPGY HDLSDQLEEL YKKERPGPHF ENTNQWEVGK
GEREYNYFKR SLPSEPTIIQ KYSRWADSFS RQPGCYVHKS TEQETVPAAS GMVQELTSLS
QGYAIETQLD SEMDVGDVTV MQVEEIDSNL SKSGKSPYTN NLHFYEEISK LAKFNSKAAD
IMTKGMQETL ELLQKHVAEG SGMVDYSIGP AIGKEPVGQR LRPSYSPSKS KNLRDRQKET
KAKFWWLNK