Gene PHATRDRAFT_50050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50050 
Symbol 
ID7198743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp250746 
End bp252290 
Gene Length1545 bp 
Protein Length482 aa 
Translation table 
GC content51% 
IMG OID 
Producthomeobox protein 
Protein accessionXP_002184846 
Protein GI219129334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCTC CAAACGGCAA TAATTGCTCA ATCAATCCGG CGACCCCGGT ACGATCGACT 
TCCAGCACGG CCCCTCCTAG CAAACGCCGC TCCGTCCGCC GTCCCAACGA CGATCTCGAG
CTCATGCTTC AGCACGAAAA CTTTCCTCAT CTCTTGAATT TGGCACACAA AATCAAGACC
GCGCATGTTC GAATTCAAAG CGTGACCACC GGCTTTGAGC CTTCCCAAGC GCTAGCCAAA
TATAGTTTGG CCCCATCGGA AGTCATGGAA GCCGTTACAC GACTACAGAA TCAATTCGTC
TCCAGCGAAA CCTCGCGTGA GACGATTCGG CCTGGAGTTT ATCCCATGAC TGATATTCTC
GTCATGGATT CCATAGAAAC CTTGCAGCGT GAGTGGGAGC ACTGCCAGAA ATGGCTGGAA
GCGTCGGAAA GTACTCGCCT TGACAAGGAT CCCACTGCCG AGCCTGCTGT ATCTGAGCCT
CCGACCCTTC CAAAGAAACG AAAAGCTTCT GGCGCCAACA GCGGTAGCAA AAAGGAAGCT
ATTGCGGTGA AGTACTCCAA ATGGCAGACA GATATTCTCA TGAACTGGAT GATCCAACAC
GTCGATGAAC CCTTTCCGAA ACAAGGCGAG ATTCATCAGT TGATGGACAT GACCGGGCTC
ACGCAATCAC AGGTCATCAA CTGGACAACA AATGTTCGCA AGCGCAACCG CAAGGCCACA
TGCCAAAACG GAAAGAAGCC CCACCACTTT ATCGACTTTG TATTCCTGGC ACACGATCGC
GAACGAAGAG CACGCAAGGC GTCTTCCATT GCGACGGCCC ACCCATTAAT GACTGGTTTG
GACTCTTTTC AGAATGCCCC CAGGGAGAAC ATATTGGCGG TCCCGCCATC ACCGAGTGCA
TGCGCTGTTC CTACCCGGCA AGCGGCCAGT TCGTACTCGT ACCCTTCTCC TCCTTCCTAT
CCACAGCCAA CGTACAATCA TACAAGTCTA AGCTACTTTG CCAACCAAAC GCCAGATTTC
AAGTGCGTGC ACACGGTGCC GTTTTCACCA GGTACAATGG CATCTTCACC TCCGCGTTGC
GCGGAAGACT CGGCAATTCA GGAACATAGC CAGACACTCA TGCAAGAAGA AATGTGGGAG
ATACACGATG ATTTTGATCC TGTACCAATG GAAGAAGAAT CAGACGAATT CATAATGGAA
GAATTTGCCA AATCTTGGTT GTTTGAAAAC CCGATGGACG TGAACGATCC GGACTCCTTG
ACAGTGCAGC AAGCTCCCTG CAGCACAATG CCTCGTCTGA ACGACCTGGG TTTGCTGCCC
AGTGTTACTG AGGACAGTCA CGAAAAATTA CACCGTAACC GAACAGCTAG TTTTGATCTC
GGAGAACTGG AGGACGAAGA CATTGACGCC TGGGCCGCCG ACATGGGACT GACGATTGAA
ATTCAGTAGC CGCGCGGGTC TATCATGTGA GGGTCACATG ATGGAACTAG GTTGTACATT
CGAAACGCGA CATAAATCTT TAGCTTAAAC AACGTAAATG AAACC
 
Protein sequence
MQSPNGNNCS INPATPVRST SSTAPPSKRR SVRRPNDDLE LMLQHENFPH LLNLAHKIKT 
AHVRIQSVTT GFEPSQALAK YSLAPSEVME AVTRLQNQFV SSETSRETIR PGVYPMTDIL
VMDSIETLQR EWEHCQKWLE ASESTRLDKD PTAEPAVSEP PTLPKKRKAS GANSGSKKEA
IAVKYSKWQT DILMNWMIQH VDEPFPKQGE IHQLMDMTGL TQSQVINWTT NVRKRNRKAT
CQNGKKPHHF IDFVFLAHDR ERRARKASSI ATAHPLMTGL DSFQNAPREN ILAVPPSPSA
CAVPTRQAAS SYSYPSPPSY PQPTYNHTSL SYFANQTPDF KCVHTVPFSP GTMASSPPRC
AEDSAIQEHS QTLMQEEMWE IHDDFDPVPM EEESDEFIME EFAKSWLFEN PMDVNDPDSL
TVQQAPCSTM PRLNDLGLLP SVTEDSHEKL HRNRTASFDL GELEDEDIDA WAADMGLTIE
IQ