Gene PHATRDRAFT_43785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43785 
Symbol 
ID7197057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1476032 
End bp1477798 
Gene Length1767 bp 
Protein Length588 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177841 
Protein GI219112179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000378872 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTTTC GTCCTTTTGT GATGGTCATT TGGCATTCTT GTTTCTTCTT TATGGAGCTA 
ATTTGCTCTG TATCTAGCCT GCCCCCAACA ATAGTCCGGC ATCGTCCTAT GGGTATTGGC
TTGTTGAACG AGCATCGTCG TAGTCATCTT CGTCGGACAG CGCGTGACGT GCACTCAAAG
GATTCTACGG GAAAGTACTA TTCAGATATA TTGGGCCGGC AAGAACCTAA CTTGCCTGAA
AAGCAAATAA GCGACCGTCT AAAGTACGCG GAAGGTATTC AGGAGGTTCT GGCAAAACCC
ATGCACGTGG GGACTTCGCT AAAATTTCGA CCTACTGCGT ACGATTTATT TCAAGCTGGA
ATTGTAGGGA TTTTCACGGG ATTTTCTGTC GCACTGTTCA AGCTTTCAAT CAATGCCGTG
AAGAGCCTGT GTTACCGTCA AATTTTTTTT CAAACAAACC CAGTCTTGAT GGTCACTGTG
CCGGCTATGG GTGGTGCAGC TGTCGGAGTC TTGATGCTTC TCGGGGATTT CCCTCCTGGT
CTTCGCGGAA CGGTTATTGA AGTAGATAAA GAATCCCAAG GTACCGTCCA GAAGTTGCGA
GATCGTGTAC AGACTCAATT TCGCTTTCTG CGAAAGTCTG CTGCGGCCAC TGTCACTTTA
GGAACAGGGT GTAGTCTTGG GCCTGAAGGG CCGTGTGTCG AAATTGGAAT GGATGTGGCG
CGCAGCTGTA TGGATATTAG CCGACGCACA GCAGAGCGCC AGCGCCTATG GAACCGTATG
CTCTTGTCTT GCGGAGCAGC TGCTGGCGTT TCGGCAGGCT TTAACGCACC GATCGCGGGT
ACTTTCTTTG CGCTGGAAAT TATGCACCGA ATGTTTTCTT CAATTGATGG TGAGGAAAAC
GCAGACAAAG ATGCCTCCGC CGGGCTGAGT TCTCTGAACA CGGCGACCAT TGCTCCCGTT
CTAATCGCTT CGGTCATGTC AGCTTTATGC GCAAGAACTC TACTCGGAGA CCATCTCGTA
CTAGCCCTAG GCGGTTCTTA CTCTCTCAAA AAGCCTCTGA TTGAATTACC GTTGTACATG
GTTCTCGGGC TCGTATCCGG AACCGTTTCC TTTGCTTTTA GCCGAGCTGC CAACCTCAGC
CAAGCTGTGT TTGTCGGGGA TTATGGAAGC GATCGCTTTC GAATGGGAGT GCGTAGCCTG
TCACCTGCGT TCAAGCCCGT CATTGGTGGC ATTCTTTGTG GGCTCGTTGG AATCAAGTTC
CCGCAAATCC TTTTTTTTGG ATATGATTGC TTGAACCCAC TTCTAGCCAA CAACTCTTTG
CCAACACCCC TACTTCTTTC CCTCCTGGCA GCAAAGATAT CTATTACAGC AATTTCCGCT
GGCTCTGGAC TAGTCGGCGG CACTTTTGCT CCGTCGCTAT TTTTGGGAGC AGTAACTGGC
GCTGCATTTC ACAACATTGT TTCGAGCATT CTCTATTGTG GCCTTGGCCT GAGTGCTGCT
TCAGGACCTT TACTTGCCGA CGTCCCGGCC TATGCCATGG TAGGAGCGGG ATCTGTACTT
GCTGCTCTCT TTCGAGCACC TTTGACAGCT TGCTTGCTTC TCTTTGAAGT AACCCGCGAC
TATGACGTTA TTCTCCCATT GATGGCGAGT GCTGGCTTTG GCAGTGTCTT CGCGGATGTT
TTAGATGGAA AGTTCAGCAG AGCTCAGAAG AGAAGAAGGC TTCGTCGAGA TAAAGATGCA
GTATCTTGGG GCGACCTGTC AAGCTAG
 
Protein sequence
MYFRPFVMVI WHSCFFFMEL ICSVSSLPPT IVRHRPMGIG LLNEHRRSHL RRTARDVHSK 
DSTGKYYSDI LGRQEPNLPE KQISDRLKYA EGIQEVLAKP MHVGTSLKFR PTAYDLFQAG
IVGIFTGFSV ALFKLSINAV KSLCYRQIFF QTNPVLMVTV PAMGGAAVGV LMLLGDFPPG
LRGTVIEVDK ESQGTVQKLR DRVQTQFRFL RKSAAATVTL GTGCSLGPEG PCVEIGMDVA
RSCMDISRRT AERQRLWNRM LLSCGAAAGV SAGFNAPIAG TFFALEIMHR MFSSIDGEEN
ADKDASAGLS SLNTATIAPV LIASVMSALC ARTLLGDHLV LALGGSYSLK KPLIELPLYM
VLGLVSGTVS FAFSRAANLS QAVFVGDYGS DRFRMGVRSL SPAFKPVIGG ILCGLVGIKF
PQILFFGYDC LNPLLANNSL PTPLLLSLLA AKISITAISA GSGLVGGTFA PSLFLGAVTG
AAFHNIVSSI LYCGLGLSAA SGPLLADVPA YAMVGAGSVL AALFRAPLTA CLLLFEVTRD
YDVILPLMAS AGFGSVFADV LDGKFSRAQK RRRLRRDKDA VSWGDLSS