Gene PHATRDRAFT_41351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41351 
Symbol 
ID7199161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp249747 
End bp251786 
Gene Length2040 bp 
Protein Length679 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185347 
Protein GI219130385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA CTTCTAATAC CGCAACGACT ACCGTCGCCT TGCCTCCCGC TCTGTCCTTG 
GAAAGCTTGT CTTGCTCGCA CAATGGCGGA GAAACCTGGC AGCTCCGGGA CGTTTCGTAC
GTACTTCCCC GGGGTGCCAA GGTGGCACTC GTCGGACGCA ACGGGACGGG CAAGTCCACC
TTGCTCCGCA TCTTAGCTTC CCGGGCGTGT GCGGACGCTG CTGACGAAGC ACAGAATATC
AAATATACCG GACAAGTCGT GACGCCACGG GACGTCAAGG TTGCCTACGT CGAACAGGAG
CCGCACCTGT CCATGGACTT GAACGTCGCC GACGCCCTCT TGGGGTTCCG TGGCGACGGC
ACCGTCGAGA CCAGTAGTGC CAAAAATAAA TACGCAGCCG TCCGGAAATA CCGTCTAGCC
GTCCAGGAAG CCGAAGTTAA GCCGGAAGCC TTTGCCCAGG CCTGCGCCGC CATGGATGCC
TTGGAAGGAT GGAACGTCTG GACAAAAGCC GAAGAAGTCG CCACTAAGCT CCGCGTACGG
CACCTACAAG ATCAGCCACT GGCCAAATTA TCCGGCGGGG AACGGAAGCG GGTCGCATTG
GCCGCTGCGT TGGTCCAAGA ACCCGACGTA CTTTTGTTGG ATGAACCGAC AAATTTCTTG
TCCCTCGCGG GGGTTCAGTG GCTGAGTGAT TTGCTGCTCG GTGACAAGAA GCTTACCATT
CTCATGGTGA CGCACGACCG AGCCTTTCTG GACGAAGTGT GCGATCGAAT ACTGGAACTG
GATCAAGGAT CCGTTTACGA ATACGTCGGA TCGTACGCCG ACTACTTGGA AGGAAAACAG
GAACGGCTCG CCGTGGAAGA CGCCGCCTAC CAGTCCGCCA AGGCCAAGTA CGCGGTCGAG
CTCGATTGGA TGCGCCGACA ACCACAGGCC CGTCAAACCA AGGCGAAAGC TCGCATCGAC
GCATTTTACA AGCTGGAGCA AGCGACCAAA CCGAGACCCC GCGATCCCAC TCTCAATCTA
GCCAGCGAAT CCCGACGTAT CGGTGGCAAA ATTATTTCCA TGAGAAACGT TTCGCTGAAA
TTCGGAGACC GGACCATGCT GAAAGATTTT TCCTACGATT TTTGCAAAGG CGACCGGATT
TGTCTGAGCG GCGGCAACGG CATTGGAAAA ACCACATTTT TACGCGTGTT GACAGGCGAG
CAACCGGCCG ACGCTGGCGA TATTGACATT GGCGACACGA TCGTGCTCGG CGTATACGAA
CAAAACGGCA TCGAAATCGA GGACCCCGAG CAGACCGTGC TCGAATTTGC CGTCGAGCAG
GCCCGAGCCC GGGACGGAGC CAGCGCCGAC GAAGGTCCGG ACGACGCCCG AAGGTTGTTA
CGGCAATTTG AATTCCCTCA AGCCCGTTGG GCCGAACGCA TTTCGGTCCT CTCTGGTGGA
GAAAAGCGAC GGCTACAAAT GCTTTCAGTC TTTAGCCAAC GGCCAAATGT GCTGATTATG
GACGAACCGT CGGTAGATTG TGACTTGGAT ACGTTGACGG CTCTCGAAAA GTATTTGCAA
GAGTTTGATG GGGTGCTGCT GATCGTCAGT CACGACCGCG CATTCGCCGA CAAGGTCACG
GATCACTTGT TTGTCTTTGA AGGACACGGG GAAATTAAGG ATTTTCAAGG AAGTCTTTCT
GAATACGCGA CCACCTTGAT TGAATTAGAG AACGACCGTA TTGCGGAAGG GTCCCGCGGA
CAGGCTGATA CGGAAGAGAA AAAGGGAGCC TACAAAGAAG ACAAGGCCAA ACGGAACGAG
CAACGCAATC AAGTGCGCCG GGCAAAAAAG GATATGGTCA ATGTCGAAAA GGCGATTGAA
AAACTAAAAG AAACCGCAGC CTCGTACGAG AAAGAAATCG ACGTTTGTAG CGGCGAAGGC
TGGACCATTC TGGCCGATTT GACTGACAAA TTGAACAAGG TGAACGAGGA AATCGACGAG
AAAGAAATGC GGTGGATGGA ATTGGGAGAG CTAGTGGAAG AGAGTGAGGT CGAAGCGTAA
 
Protein sequence
MSSTSNTATT TVALPPALSL ESLSCSHNGG ETWQLRDVSY VLPRGAKVAL VGRNGTGKST 
LLRILASRAC ADAADEAQNI KYTGQVVTPR DVKVAYVEQE PHLSMDLNVA DALLGFRGDG
TVETSSAKNK YAAVRKYRLA VQEAEVKPEA FAQACAAMDA LEGWNVWTKA EEVATKLRVR
HLQDQPLAKL SGGERKRVAL AAALVQEPDV LLLDEPTNFL SLAGVQWLSD LLLGDKKLTI
LMVTHDRAFL DEVCDRILEL DQGSVYEYVG SYADYLEGKQ ERLAVEDAAY QSAKAKYAVE
LDWMRRQPQA RQTKAKARID AFYKLEQATK PRPRDPTLNL ASESRRIGGK IISMRNVSLK
FGDRTMLKDF SYDFCKGDRI CLSGGNGIGK TTFLRVLTGE QPADAGDIDI GDTIVLGVYE
QNGIEIEDPE QTVLEFAVEQ ARARDGASAD EGPDDARRLL RQFEFPQARW AERISVLSGG
EKRRLQMLSV FSQRPNVLIM DEPSVDCDLD TLTALEKYLQ EFDGVLLIVS HDRAFADKVT
DHLFVFEGHG EIKDFQGSLS EYATTLIELE NDRIAEGSRG QADTEEKKGA YKEDKAKRNE
QRNQVRRAKK DMVNVEKAIE KLKETAASYE KEIDVCSGEG WTILADLTDK LNKVNEEIDE
KEMRWMELGE LVEESEVEA