Gene PHATRDRAFT_36310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36310 
Symbol 
ID7201634 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp131341 
End bp134474 
Gene Length3134 bp 
Protein Length882 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180948 
Protein GI219120419 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.278303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA ACTTCATTAT CCACCCGTTA AGGCCTGAGT ATCATCGCTT GCAGAGTTCC 
GGGGGAACTG GCAGCTCGGC TGGTGGAGAA ACGGAATTCG ATGACAGCCA GGTGCTACGT
CAGACTTTCA TGGTGTATTT GCCTATACTG GTAGTTGTAG TATTGTTGTT CTCACTGATA
CGCCGAAAAT ATCGCCGCGC ATTCAACGTT CGTAGCTGGC TGGATCCAAT TAAGACACCG
CTTGCATATA ACCAGTACGG GTTCTTTTCG TAAGTGCCTC ACGCAGAGTG GAGAAGTGCA
TCGCAGTGTA TAGATCTTGT CTTACTTTTT TTCAGCTGGT TTTGGAAGCT GAAAAGCATT
TCTGATGACA AGCTCATGGA TGAGTGCGGA ATGGATGCTT TGTGCTTTGT CAGAGTACTG
CGAATGGGTT TCAAAATAAG CCTTTTGGGC GTGCTTTGCT CGGCCGTGCT CATGCCGTTA
TATGCCACCG CTGACGACTC GCAAAACACA CGCTCTATCA CCGATAATAT TGCACAGCTA
ACCATAAGCC ATGTTCCTGA AGGATCTCCA CGGCTTTTGG GGGCGGTTAT AGCTGCTTGG
ATTATTTTCG GATACACAAT GCGATTGATA TTGAAAGAAT TCGTTTGGTT CATTGAAAAA
CGACACAAGT TTCTTGCCAC CATCCGGCCT CGCAATTACG CGGTTTACGT GCGAAATATA
CCAAACGAAC TTCGATCTGA TGCCGAGTTA GAAAACTTCT TTCGTCAGTG CTTTCAAAGC
GAATCCATAT TAGAAGGGAA CGTGGCGCTT AAAGTTCCTG AATTGTCGAA ACTTGTGGCC
CAGCGCGAAG CTGCAATTAC CAAATTCGAG CACGCTGTGG CAGTTGAAGA CCGCACGGGC
GAAAAGCCAC AGCACGCTCC TTCTCTCGCG TCGGCAATCA GAGGTTCACT AAAGGGGGGA
GGAGAAAAGG TGGACTCTAT AAATTATTTT GCATCAGAAA TCAAGGAATT AAATCAAGTC
ATTTCGAAAC ACATTGATGA TCTAAACGAA AACAAGACTT GCTTCTCGCA TGATGTGGAG
CAGCCACATG CTTCGGGCTC AAACAGACAG ATAAGTAATG GGGTACGAAA AACCAGAAGC
GATTCAGAAA ACGAAAGGTA TGGTAATATG GAAAGAACGG GTCTCCTTTC TGTATCGGAA
GGCCGTAGCG ACCACAGCGT CAAGGAGGAT TGCTCTACTT CCCCACGAAA TGACATTGAC
ACCCCGAATG GAGACCCGGG CAAAATAGAA ACGGCCGAGA ATGCAGGATA CAATGCCTAT
CGCGCAACCA ACGACGCGAC TCCTGGAGAC TCAGAAGGAA AAACAAATCC TGGTATCGTC
TTGAAAAACG CCACCAAGAA ATGTAGAGAA TCTGCTTCAT CCATTGGAGG TGCTGTCAAA
GACTCTGCCA AGGCAGTGGC GGAAAACTCT ACAAGTTTGT TAAAGAGCGC GGAAGATGGC
AAACCCGAAA GCGCCGGATT TTTGTCCTTT CGCAGTCTCC GTTCAACTCA CGCTGCCCTG
CAGCTGATAC ATCATGGCAC TCCTTTTACC ATGGAGGTCC AGGAAGCGCC AGCACCAGAT
GATGTTTTTT GGTTCAACGT TGGTCGCGGA CATAAGGAGC TGCAAGTAGG TCGACTTATG
TCGTTCGCAG CAACTGCTGT TCTTTGCCTC TTCTGGACAA TCCCAGTTAG CTTTGTTGCG
TCGCTGTCTA CCATCGAATC TCTGCGAGCA GAAGTTGGTT TTGTCGACGA TCTACTTGAT
ACGCTTCCAT TCCTTGCTCC ATTCTTTGAG ATTGCAGCGC CGCTACTTCT TGTTGTCGTA
AATGCGCTGC TCCCGATGAT TTTGAGAGTG TTTTCCATGA TGGAAGGTCC AGTCTCTGGA
GCTGTTGTGG AGGCCTCCCT ATTCACCAAG CTCGCCGCCT TCATGATTAT TCAGACGTTT
TTCGTAAGCG CAATATCAGG TGGATTATTG CAGGTAAGAC ATCCTTCGTC GAGATTACAA
ACCATTGCAG AATTCTTGAC TGGTCTCGCT TACACGCACT TGAATAACAT CAATGGAACA
GGAACTCTCA TCACTGGTCC AAAGTCCCAC ATCAATAGTT GATTTGCTCT CCACGTCACT
GCCTGCGCAG GCGACGTACT TCATTCAAAT TATCTTTGTA ACAACGGTAT TTTCTTGTGG
AATGGAAATT CTCCGTGTTG TGCCACTACT CAAAGCAATG CTGCGTAGAT TCCTTGGACC
TCGACTCACA GAAAGAGAGC GACAACAACC CTTTCTCACG CTTCGACCGT TATCAAACCC
TCTTGACTTC GAACATGCTG GATTCTCTTC AAATATAGTA AGCTTGCCTG TCCCTCTTTT
CGGTGCACAA TTTCCATCAT CTCACAAGCT TCTCTCCCAC ATCCCAGGTG TTATACTATA
TCGTCTTTTT GGTATATTCC GTGATCTCGC CGCTGACAAG CATCGTTGTT GCATTTTGCT
TCGCGTTCAT GGATTCAATT TTTTGCCATC AATTTGTATA CATTTACCCC AACCGTTCTG
ATTCAGGAGG AAAGCTGTGG CTGAACTTTA TGCGGGTTCT AATTGCCTGT ATGTTCGTAG
CTGAGTTTAC AAGTGAGTCA AGGATGAAAC TTTGTACATG TGACCAGTTC GCCATTTCTC
TTATGAACTT GCTTCTTTAC TAAAGTTGTC GGCCTTTTGG CGTTGAAGAG AGCGCCCATA
GCCACTCCGC TCATGGTCCC ATTGATTGTA GTTACAGCGC TGTTTTCAGT ATACATCAAC
GAACAGCATT TCAAGGTGAC AAAAAATCGT AAGTTCCTGT GCTACGGTGT CCCCATAAGT
TTAGTTGTCG GAATCAAACG ATCCTTGTTC ATACTAACTT TTGCTGGCTT CCTTCCTTCT
GGCCAGTTCC ATGTCGAGAG TGTACTTTCA AAGACATAGA ACACAGTTCA ACTTTCGATT
CCGCTTTTCT TAAAGACGCG TATCTGCAAC CCGAATTACA AACCAAAGAA GGTATGTTCT
CACCAGTCAG TAGCTTTATG CTGTGTTTGG TGTCAACGAG TCTCACAGCA AGATTTGGTT
TCGCCCGACT GTAA
 
Protein sequence
MNENFIIHPL RPEYHRLQSS GGTGSSAGGE TEFDDSQVLR QTFMVYLPIL VVVVLLFSLI 
RRKYRRAFNV RSWLDPIKTP LAYNQYGFFS WFWKLKSISD DKLMDECGMD ALCFVRVLRM
GFKISLLGVL CSAVLMPLYA TADDSQNTRS ITDNIAQLTI SHVPEGSPRL LGAVIAAWII
FGYTMRLILK EFVWFIEKRH KFLATIRPRN YAVYVRNIPN ELRSDAELEN FFRQCFQSES
ILEGNVALKV PELSKLVAQR EAAITKFEHA VAVEDRTGEK PQHAPSLASA IRGSLKGGGE
KVDSINYFAS EIKELNQVIS KHIDDLNENK TCFSHDVEQP HASGSNRQIS NGVRKTRSDS
ENERYGNMER TGLLSVSEGR SDHSVKEDCS TSPRNDIDTP NGDPGKIETA ENAGYNAYRA
TNDATPGDSE GKTNPGIVLK NATKKCRESA SSIGGAVKDS AKAVAENSTS LLKSAEDGKP
ESAGFLSFRS LRSTHAALQL IHHGTPFTME VQEAPAPDDV FWFNVGRGHK ELQVGRLMSF
AATAVLCLFW TIPVSFVASL STIESLRAEV GFVDDLLDTL PFLAPFFEIA APLLLVVVNA
LLPMILRVFS MMEGPVSGAV VEASLFTKLA AFMIIQTFFV SAISGGLLQE LSSLVQSPTS
IVDLLSTSLP AQATYFIQII FVTTVFSCGM EILRVVPLLK AMLRRFLGPR LTERERQQPF
LTLRPLSNPL DFEHAGFSSN IVLYYIVFLV YSVISPLTSI VVAFCFAFMD SIFCHQFVYI
YPNRSDSGGK LWLNFMRVLI ACMFVAEFTI VGLLALKRAP IATPLMVPLI VVTALFSVYI
NEQHFKVTKN QHSSTFDSAF LKDAYLQPEL QTKEARFGFA RL