Gene PHATRDRAFT_50253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50253 
Symbol 
ID7199024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp110146 
End bp111819 
Gene Length1674 bp 
Protein Length557 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185127 
Protein GI219129924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG TTCGGAATTC CACCGCGGAT TCCTCGGTTG TGTTGGTAAC TGCGCCTGGA 
TGGTCCGAGA GGTCTCGTTC GGATGGGTTG GTCCAGGTGG AAGCGGTGCG GTCTGAAAAC
GTCGATATTC CTAAAACAAA CTTGATTAAA CACTTTGTCG AGTCTAGGTT CGGCGGTGAA
ACAAAGGATG GTCATTTGAC AGAAATGAAG GAAATGCCGC TGAAGGTCAC GCCGTCTCCG
AATCAACAAA CCGAGGCGGA TGCAGACTTT GACGCCTGTC AACAATACAA CTGCGAAACG
AGTGGAGAAA ACGAAAAGGA GGAAGCTAGT GATCTTAATG GTCCTTTCGG TACTATGAGT
GCGTTTGCTT CCGCGGTTAT GTCCTCTATC CTGCTGGATG GCAATAATTC GGAAAGTCCG
CAATCAAACC CTTCTGTTGA AAGCGTTTCA ATGTCTTTTG TGCTTGGAAA GACATACCAT
CCGCTACACG ACTATTCCAT TCGTCGAGAT GATGAAAGGT CGCTCTTCTG GTTTACGTAC
CGCTGCGACT TTCCGGAGAT TGCCCCCTAC AACATTACAA GTGATGCTGG ATGGGGTTGC
ATGCTAAGGT CGGCACAGAT GATGCTGGGT CAAGCCCTTC GCTTGCATTT CAAGTCGCGG
GATTGGCGAC CTCCGCAACT TTTGGCACGC AGACGGCAGG ATTCATTTAT TAGAAGCGTT
TTGACCTGGT TTGCGGATTA TCCTTCTTCA AGTGAGAGCG TATACTCACT GCATAACATG
GTAGCAGCTG GACTTTCCAA GTACGATAAG CTTCCAGGGG AATGGTATGG ACCAGGTACA
GCTTGCTATG TGATGCGCGA CTTGGTACAT ATTCATGAGA AGCAACAAGC TTTGGGAAAA
ACTCGTCTTG ATCGGCGCAT ATTTCGGGTC TATGTTGCTC CACAAGGTAC CGTATATCGA
GATACTATTC ATGCCTTCAT GACGACAGAA GCTAGAGTAC GGATCGAAGA AAAAAAGAAA
GTGAAGGAGC AAACTCAACC TCAAGCTCAT CCCTTAGATT TGGAATGGGA AGAAGAGCTC
ATGGAATCGG CGAACACTGT TGAATGGGAT ACAGCACTGT TGCTATTGGT ACCGTTGCGG
CTTGGACTGA CTAGCTTAAA TGAAGAGTAC GTGCAATCTC TTGCCCACAC CTTCAGCTTG
CCACAATCGG TAGGTGTTTT GGGTGGTCGT CCGCGTGGAG CCCGCTGGTT TTACGGAGCG
CAAAAGGACG GGAGTAAAAT TTTCGGGCTG GATCCTCATA CGGTACAAAC AGCACCCGGT
CGACAGACGG CACGCGTCAA CGGTCAAGCT TCGTCGGTCG TTGAGCTATC TGACGACTAC
TTACGATCAT GCCACACAAC CTGCCCTGAA ATGTTTCCTT TTTGCAAGAT GGACCCAAGC
ATTGCACTTG GATTTTATTG TCGGACGAGA GCTGATTTGA ATCACGTTTT GAATTCCATG
GGGGCTTGGC AAAAAGAACA TTCATCTATT CCAGAGCTTT TTAGTGTTTT GGATAGGGCT
CCAGATTACT CGGCCAACGT CGACGATCTT CTTTTGGGAG GGGATTCCTC AATGATGGAG
ACTTCTGGCT TTGAAGACGA AGCAAGTGAC GCAGATGAAT ACGTTATGCT GTGA
 
Protein sequence
MSIVRNSTAD SSVVLVTAPG WSERSRSDGL VQVEAVRSEN VDIPKTNLIK HFVESRFGGE 
TKDGHLTEMK EMPLKVTPSP NQQTEADADF DACQQYNCET SGENEKEEAS DLNGPFGTMS
AFASAVMSSI LLDGNNSESP QSNPSVESVS MSFVLGKTYH PLHDYSIRRD DERSLFWFTY
RCDFPEIAPY NITSDAGWGC MLRSAQMMLG QALRLHFKSR DWRPPQLLAR RRQDSFIRSV
LTWFADYPSS SESVYSLHNM VAAGLSKYDK LPGEWYGPGT ACYVMRDLVH IHEKQQALGK
TRLDRRIFRV YVAPQGTVYR DTIHAFMTTE ARVRIEEKKK VKEQTQPQAH PLDLEWEEEL
MESANTVEWD TALLLLVPLR LGLTSLNEEY VQSLAHTFSL PQSVGVLGGR PRGARWFYGA
QKDGSKIFGL DPHTVQTAPG RQTARVNGQA SSVVELSDDY LRSCHTTCPE MFPFCKMDPS
IALGFYCRTR ADLNHVLNSM GAWQKEHSSI PELFSVLDRA PDYSANVDDL LLGGDSSMME
TSGFEDEASD ADEYVML