Gene PHATRDRAFT_42255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42255 
Symbol 
ID7194985 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp668045 
End bp669209 
Gene Length1165 bp 
Protein Length349 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183528 
Protein GI219126571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACG TTCAGCTTGG GGATTCGGAT CTCGTCGTTT CTAAGGTTTG CATGGGAACT 
ATGACTTTCG GTGAGCAAAA TACTCTAGAA GAAGGTGTGG AGCAGCTTAC AAGGGCTTTT
GATGAATTCG GTGTTAATTT TTTGGACACT GCCGAAATGT ACCCGGTACC AACGAAAGCC
ACAACGCAAG GCGCCACCGA CAAGGCGATA AAAATGTTTT TGCAATCACG GAAGCGCGAA
GACGTAATTT TGGCCACAAA AGTATGTGGT AGATCCGAGC GCATCAAGTG GTTGCCCCGA
CGAGATTCGG AAACGCCGGC CGCTTTGACC AGGGATCAGA TTCTCGATTC GGTAGATGCA
TCTTTGGAAC GACTTGGAAC CGACTACATT GATCTTCTGC AGCTTCATTG GCCAGGTAGG
AAATTAGTCA CGGATTTCTC GAGTACGTGT ATTTTTCTTC GAATTCTGAA TTATCTCATC
GTGTTTATTG CTATTTTTAG ATCGTTATGT CGGCTCGATG TTTGGATCAG GGGATTTTAG
ACCGTCGCAG TACCAAGATA ATCCCAAGCC AACAAGTTTT GAAGAGCAGC TTTCTGCCTT
GCAGGAGCTT GTGACGACGG GAAAAGTTCG TTACGTAGGC GTTTCCAATG AATCTGCATA
TGGCGTGTGT TCCATGGCTG CACTCACCAG GCAGTTTCCC GAGCTCTATC CCAAAATTGT
ATCGATTCAG AACAGTTTTT CGCTTGTCGT ACGCAAAGAC TTTGAGGCCG GTCTTGGTGA
AGCCTGCTTC CATCACAATG TAGGACTCTT AGCATATTCA CCTCTGGCAG CAGGTACGCT
GAGTGGAAAG TACCGCAAAA ATGTTCCAAA AGGTGCTCGC TTGACACTCT TTCCTGGATT
TATGGAACGA TATTTGGGTT CTTCTAATGA GGAAGCCGTG AACGCCTATT GTGATCTTGC
AAAGAAGGCA GATTTGACGC CGACACAACT TGCTCTAGGC TGGTGCTACC ACAATGAACT
TGTAGCGAGC TCCATTATTG GTGCTACAAC CATGGACCAA CTGGAAGAAA ATATCCAAGC
CTACGACGTC CGATTAAGCG ACGATGTCAG TAAAGAGATT GAAGCGATTT ACGCAAAGTA
CACGGACCCG ACCAAGGCTC GCTAA
 
Protein sequence
MDYVQLGDSD LVVSKVCMGT MTFGEQNTLE EGVEQLTRAF DEFGVNFLDT AEMYPVPTKA 
TTQGATDKAI KMFLQSRKRE DVILATKVCG RSERIKWLPR RDSETPAALT RDQILDSVDA
SLERLGTDYI DLLQLHWPGD FRPSQYQDNP KPTSFEEQLS ALQELVTTGK VRYVGVSNES
AYGVCSMAAL TRQFPELYPK IVSIQNSFSL VVRKDFEAGL GEACFHHNVG LLAYSPLAAG
TLSGKYRKNV PKGARLTLFP GFMERYLGSS NEEAVNAYCD LAKKADLTPT QLALGWCYHN
ELVASSIIGA TTMDQLEENI QAYDVRLSDD VSKEIEAIYA KYTDPTKAR