Gene PHATRDRAFT_49028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49028 
Symbol 
ID7195284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp350407 
End bp353656 
Gene Length3250 bp 
Protein Length1064 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183596 
Protein GI219126715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAAG ATGAGAATGA AATTATCTTA ACGCGCGAAG AGAATATACC GTTGTCTGAT 
GTCGAAGAAT CGCCGCACTT TATGGAATCC TCGTTAACGA CACAGCGAAA AAAGGCCGAC
TTGGCTGCGA CCGCTGCTCT AGACCACAAT GATGGCGACG CTACATCGCT CCAAAGTGCG
CAAAACAGAT TATCCATCAA GCTATCACAG AAACCAACTC CCTCGTGGAG ACAGGCTTTA
GCAGATACGT TGTTTGAGCG GAATTATGAA AAGAAGAAAC TTTTTAATGC GTATTCCTAC
TCCAGCAATA CTTCGACGAG CACCTTCACG CTGATTTCTG GCCAGACCGG CAGTGGCAAA
ACGCGCCTTG CACAAACGTT GCGGAAGCCT GTCGAGGATG CGGGCGGTTA TTTCCTGACA
GGGAAGTTCG ACCAACTTCG CCGTCCTGTT CCTTATATAG CGTACGGATC CGCGTTTACA
GAATTTACAC ATCAGGTTAT TGCCCGTGGA AAGGATACTA CACAATCGAT GCGTGACCGG
ATCAATATCG CAGTAGGCAA CGAAGCACAC GTCTTGACAA GCGTCATACC TGCTTTGGAA
ATTCTGATCG GTAAAAAAGA CGACGAAGAA TCCAGCATGC AACGGGATGA AGCTATACAA
CGTTGGTTGT TTACTTTTCG ACGTTTTACG AGAGCGGTAT GTTCGCCTGA AGAACCTGTG
GTTCTTTTGC TAGATGATCT TCACTTTGCC GACAAGTGCT CGCTCGACAT TCTTGCCTTT
ATGGTGGCAG ACTCTGAAAA TCGAGGCTTG GTGGTCATTG GAACGTGCGA CGACAGCGAG
ATTACGGCCG AGAGCTACTT GTCTACAAAG CTTCGAGAAA TGGAAGATAA ATCCCAAGCC
GAAATAACGA ACATTGCGCT CGGCAACATG GGCAAAGAAG CCGTAAATCA TGTGATTTCG
GAAATAATGG GATCGAAGGA CAGCAAAGCA GTGACCGAGT TTGGAAATCT TGTTTGGCGT
CAAACAGAAG GAAATGTTTT TTATCTCATC GAATTGCTAC GATGGCTTCA CAACTCAGAC
CTTCTCTATT TTGACGACAA GTCCGCTGAA TGGGTGTGGG ATGCAGAGGA AATCGATATA
ACACTATCGG ATCGGAAACT CAATGGCTTC TTGTTCGATA GGCTACAGCA GCTTCCACGA
CATTTAAAAG ACGTCCTTAA AGTCGCGTCT TGCCTGGGTC CGTATCCTGA TGCTGTTCTT
ATAGAGCACG TCCTTGACCT TTCGGTTGGC CATTTGTTGG AAGAAGCTAA GGAGCTTGGT
ATCTTAAGTT ATGATGAACA TACTAGAAGT TACATGTTCG AAAACGATTG TGCACAGCAA
GCTGCGTACG AACTGATTCC TGTACACGAA AAAGAACTTT TCCATTTGGA AGTGGGACGT
CGCTTGTGGC GAAAGCTTTC AAGTAAAGAC GAGCTCGACA GAAATATCTT TCAAATCCTG
TCTCAAATGC AATTCGGTCG TCGATTGATA GCAAGAGACA ACGAGAAAAT AAGAGTGGCA
TCGCTCTGTC ATGTCGCTGG ACTAAAAGCT GCAAGATGCT CTACTTTTCG AGTCGCAAAT
GCCTTCCTTT CGTTTGGTGT CAGCTTGTTA AGCTCACGAA GCTGGAGAGA CAATTACGAC
CTATCACTAG CACTTTACAA TGCAACTGCT GAAACAGAGA TGTGCCTCGC AAACTTCGAG
GCCATGGAAA ATCTGCTGAA GGACATATTT ATCCATGCTC GTTCGTTTCG AGACAAGCTT
CCGGCATACA GCACACAAAT GTATGCTTTG AATGTACGGG ACCGGCAGCG CGAATCATTG
GATCTTGGGG TCGAAGTTCT CAGAGGACTT GGTGAAAAGT TTCCTCGTCG TTACTGCAAA
GCGAGGCTGT TATCGGAATT GCGAAGTGTC AAGATTTTGC TTCGTGGCAA GAGCGACGAA
CAATTGTTGC GCTTGCCTGC GATACAAAGG AATGACAAGC TCCAAGCCCT GCAAGTTCTA
CAGCTTATGG TCCTTACTGC TCTTTCCACT CATCCAGACC TTGCTCCGTT TGTTATTTCT
CGTATGGTCA AAATCACGCT GGAATACGGT ATGAGTCCTT TCGCTTCCGG CGCCTTTGCA
ACATACGCAA TGATACGTAT CCCATTGGGA CCGTATGGTA ACGTTGACGA AGCCATTCGA
TTCGGAAACC TGGGGATGGC TGTACTCGAG AGGTACAACA TATTAGAATA TGCGCCGCGC
GTGTACGCTG CATACTACGG ATGCGTCTGG TGCTGGAAGT TTCCTTTGAA GGATTCAATG
GAACCCTTGC TTCGAGCTCA CCGAATTGGT ATTCAAACCG GCGACTCTGA GTTTGCGGTT
CTTTGCGCCG ATCTCTACCT CATGAATGCA CTCGAAGGGG GCGTACCACT TGATGCTATC
GATCGTGAGT GGACTGGTTT CTTTGATCTC ATGGTGTCGC GGCGACACGA GACAGCCATA
GCATTTACTC TTCCTTGGGC TCAAGCGATT CACCACTTTA TGGGGTACAC GGACAATCCA
TTGCTTTCTA AGGGTGATTT GGTTGATTAC GACGAGGCTA TGGAGCGTAG CGTTCAACGG
CAAGCGTTCA TTCAAGTTGT TTCGATCAGC TGCACTCGTA TGATGGTTTC CTACGTTTTT
AATGACTATG ATCAAGCCGC AAGATCGGCT GAGACACTTC CTGATCTGCT AAAAATTCCG
CCTAGTTTCG AACGAGTATC AACACTTTTC TATTCTACGT TGACATTTCT CGCCGTTGCA
CGGACTGGTA AAAATGTGCG ACGACACGTT GGCAAGGCAA AGGAAGCCAT CAAGACGTTT
CGACGTTGGG CGACGGATTC GCCCAAGAAT TGCCTTGACA AGCTCTTTTT ACTGCAAGCT
GAGCTTTTTT CCGTCCTCGG AAAACATTCC CGAGCATACG AAAAATACAT TGCCTCAATT
GCGTGTGCCA AGGACCAAGG ATTTTTGCTG ACGCACGCCT TAGCCAACGA GCGTGCGGCT
CGCCATTTGT ATGGTCTTGG GCGTACCGAC GAAGCTTTTC TGTTCTTTGA AAATGCGTGC
AAGTGCTACG GCGAGTGGCA TGGGCATGCT AAAGTCACAC GACTCAAAGC CGAAGTTGAA
GAACTTTTTT CTTGACTGGG GAAAGAAGTA GATACTGTAA CATACATTAT CAACATTATT
CTACAATTCG
 
Protein sequence
MMQDENEIIL TREENIPLSD VEESPHFMES SLTTQRKKAD LAATAALDHN DGDATSLQSA 
QNRLSIKLSQ KPTPSWRQAL ADTLFERNYE KKKLFNAYSY SSNTSTSTFT LISGQTGSGK
TRLAQTLRKP VEDAGGYFLT GKFDQLRRPV PYIAYGSAFT EFTHQVIARG KDTTQSMRDR
INIAVGNEAH VLTSVIPALE ILIGKKDDEE SSMQRDEAIQ RWLFTFRRFT RAVCSPEEPV
VLLLDDLHFA DKCSLDILAF MVADSENRGL VVIGTCDDSE ITAESYLSTK LREMEDKSQA
EITNIALGNM GKEAVNHVIS EIMGSKDSKA VTEFGNLVWR QTEGNVFYLI ELLRWLHNSD
LLYFDDKSAE WVWDAEEIDI TLSDRKLNGF LFDRLQQLPR HLKDVLKVAS CLGPYPDAVL
IEHVLDLSVG HLLEEAKELG ILSYDEHTRS YMFENDCAQQ AAYELIPVHE KELFHLEVGR
RLWRKLSSKD ELDRNIFQIL SQMQFGRRLI ARDNEKIRVA SLCHVAGLKA ARCSTFRVAN
AFLSFGVSLL SSRSWRDNYD LSLALYNATA ETEMCLANFE AMENLLKDIF IHARSFRDKL
PAYSTQMYAL NVRDRQRESL DLGVEVLRGL GEKFPRRYCK ARLLSELRSV KILLRGKSDE
QLLRLPAIQR NDKLQALQVL QLMVLTALST HPDLAPFVIS RMVKITLEYG MSPFASGAFA
TYAMIRIPLG PYGNVDEAIR FGNLGMAVLE RYNILEYAPR VYAAYYGCVW CWKFPLKDSM
EPLLRAHRIG IQTGDSEFAV LCADLYLMNA LEGGVPLDAI DREWTGFFDL MVSRRHETAI
AFTLPWAQAI HHFMGYTDNP LLSKGDLVDY DEAMERSVQR QAFIQVVSIS CTRMMVSYVF
NDYDQAARSA ETLPDLLKIP PSFERVSTLF YSTLTFLAVA RTGKNVRRHV GKAKEAIKTF
RRWATDSPKN CLDKLFLLQA ELFSVLGKHS RAYEKYIASI ACAKDQGFLL THALANERAA
RHLYGLGRTD EAFLFFENAC KCYGEWHGHA KVTRLKAEVE ELFS