Gene PHATRDRAFT_49209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49209 
Symbol 
ID7195518 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp231167 
End bp232964 
Gene Length1798 bp 
Protein Length526 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183834 
Protein GI219127213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0370698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCGTTCGA GCTTCGCTGG TATCCGAACC ATGACTTCCA AATTTCTATT CTCCGTTCTC 
GCGCTTAACC ATGCGTTTGC TTCGGCAAGT CCTGATTTTC TTATGATCGG AAACTCATAC
TACTTTACGA AACAATTTGG TGAATCAACT CAAATCACTT CTTCGGGAAG GGCTTTCTGA
TAACCGTGTG GCATCTAGTT TAACTGCCAG CGGAGGTAAG GCGCTGACAC AACATCTTCA
GGATGCCCAG GGCGCTAACG GCAATGACAA AGATCTGCGT CAGTGGCTTT ACCTTGATCC
GAAGCCCTTC GAGTGGGTCA TTCTACAGGA GCAAAGTCAA ACCCCAGGCT TTTATGGATA
CAGTAGCTTC ACGACAAGTT TAAACGCCGC GGTTGGCCTG AACGAAATGA TTTCCGACGT
TGGCGCCAAA ACTGTATTTT ATCAAACTTG GGGACGTCGT GACGGTGACA GCCGCAATGC
CTGGCTCTTC CCAGATTTCT CTACCATGCA GGACCGTCTT GATGAAGGAT ATGGTCGCTA
CAAAGCGGCG ACGAAAAACT CCAAATTGGC ACCGGTAGGC CCAGCGTTCC GTATCATATA
CGATACTTTG ATTGAAGCGG AGATAGACCC TTTGAAATCA GAAAGTGCCT TTCACTCTTT
GTACAGCTCA GATGGCAGTC ACCCTTCTGT CACTGGATCC TACCTTGCCG CTTGCGTCTT
GTACTCCACC ATGACTGGCA AGGATCCACA AGGACTGAGT TACCGGCCTA GTGGCGTGTC
AGAGGCCCAA CAGGCTATGC TCCAGAATGT TGCAGCTCAC ACTGTGCGAG ATGCGCCGTT
GGTGAAGGCA CTGATTGCTC CTACCGTCTT TGCACCACCA GACAATGAAG TAACTGACGC
TCCTACTAAT TCGCCAGTGA AAACTAGTGG TCCTTCACAG GCTCCGGTCA AAACCAGCAC
TCCTACTCAG ACGCCCGTGG AGACTGACGC TCCTACTCTA GCCCCCGTGA AAACAGACGA
TCCAAGTCAC TCTCCAGTTC GCCAGCCGCA ACCAGTGCCC GTAGCAGGAC CGTCTTGGGA
CAAACGATGC GATCAAATGG TATCTGATAG TGACTTTGAG TCCGGCCTCG AAAGCTGGAC
TGCTCAAGGT GCTGGTAAGA TCGAAAGTGT GTCTCCTGGC TACAAATCTG ACAAAGCCTT
AGCTTCAACG GGAAGACTCC GTTATTGGAA TGGTATTGGC CTCGGCATTT CGCGCCGAAA
CTACAATGGA TGCGTGGAGG CGGGATCCAA GTGGGAAGTG AGTCTGCAAG TCCGTCTGGT
CAATCCTGAA ACCGGCAAAG GTGTTTCCTG TGATCGCAAT CCCACAAGAT TTACACCAGC
GGACAAAAGA GGCTCCGGTT TTTGCCCCGC GGTGACCCTT TACCTACGTG ATGGATCCTG
GCGACTAGGC AAGTTTACAC TGCGTGACTA TACCTCTAGC TGGAATCCCA GCGAGTTCAA
CGAGCTACGA TCTGTATTTG AATTTCCGGC AGGCTCTTCT CAATGGAACG CTGATATTCG
GAATTTTATC ATAAAGATCG ATCAGGCAGA CTACGACCTC GAGATGATTG TCGACGACTT
TTCGATGAAA CGTATCGGCT AAAAAGGAGC CGCAACTCTT CTTTTCCATT CATTGTCGTG
TATGGAAAAC AGCCATGTAC AAGTAAAATC ATCGCATCTT TCAAGGAATA GTGCTATTGT
GTCTCAAACC GTTTTGACTA ATTGCTCAAT AGTGCCTTTT TCATTACCGT CCAAGAAA
 
Protein sequence
MTSKFLFSVL ALNHAFASAS PDFLMIGNSY YFTKQFGLSD NRVASSLTAS GGKALTQHLQ 
DAQGANGNDK DLRQWLYLDP KPFEWVILQE QSQTPGFYGY SSFTTSLNAA VGLNEMISDV
GAKTVFYQTW GRRDGDSRNA WLFPDFSTMQ DRLDEGYGRY KAATKNSKLA PVGPAFRIIY
DTLIEAEIDP LKSESAFHSL YSSDGSHPSV TGSYLAACVL YSTMTGKDPQ GLSYRPSGVS
EAQQAMLQNV AAHTVRDAPL VKALIAPTVF APPDNEVTDA PTNSPVKTSG PSQAPVKTST
PTQTPVETDA PTLAPVKTDD PSHSPVRQPQ PVPVAGPSWD KRCDQMVSDS DFESGLESWT
AQGAGKIESV SPGYKSDKAL ASTGRLRYWN GIGLGISRRN YNGCVEAGSK WEVSLQVRLV
NPETGKGVSC DRNPTRFTPA DKRGSGFCPA VTLYLRDGSW RLGKFTLRDY TSSWNPSEFN
ELRSVFEFPA GSSQWNADIR NFIIKIDQAD YDLEMIVDDF SMKRIG