Gene PHATRDRAFT_50558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50558 
Symbol 
ID7199387 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp108755 
End bp109904 
Gene Length1150 bp 
Protein Length319 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185486 
Protein GI219130678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTCT ATGGAGTTTC CAATAAATGT AGGTATGTCT CTCTCTTCAC AGTCCAGTGA 
CCGCATACCC GTTTTTGTAT TTATGATCTT CTTGGATTGG TTCGAATATA TCCTCCTGAC
GTCGTCGTCC TTCCGACTCG ACCCCAACCT CGACGAACAA AACACAGTCC AAGCCAAGTC
TCGGGAACGG TTTGTTGAGA GCGGGGGCGC TAGCTAGCTA CCGCTGTGAA ACCCCCATAC
CTATTCTCCA TCCACATCGA CACTGTGTTC ACACAGTCAT TGCTTTCCGC ATTTAGTGTT
TTCCCACCAC CGCTCGATGA CGTTGCGTAC GACTCCCGTG ATAGAGACGC GTTCGTCGCT
GCCCGACACG CCGGTGAGTG CGCTCACGAT CCCCACGTCG CACGCGTCGT CACTATCGTC
GCCGTCAGCA GCGGCAACAG CGAACGATTC TTTCGCGTTG GAAATGAAGG AACGGGCTCG
TACCCGTCGC GCCCGACAAG ATCAGGTCGT CTCGGACCTC AAAGTCCAAA TCCGTCGTGT
CGAAGCCGCA CTCCAGGCGG AAACCAAACG ACGGGTACAC GGACTCCAAA ATCTCCAACA
ACAGACGGAG GCCAGTATCC GCACACTGCA AGAAAATTTG GAAGAGTCCT GGCGACTCCA
ACAAACAGCC ACGGACGATC GCTGGCAACA ACTGGCCGAC CGATTGACCG CTTTGGAAGA
GTCCTGGCGT GTCCACGTGG CGCGCTGGGA AGACCGCACC GGTTCGGAAA GTGCACAGGC
GCGGGAGATG CTACGGGAAC TGCAGGAGCA GGCGGAAGTG GCGCAGAGGG AGCGAGAGAT
TCGTGAAGAG AGTTTGCGAC AGCGATTGGA AAACGTCGCG CAAGAAGCGG AACAAGCCTG
GGACGAAGCC CGACGCGAAC GGCGCGCGGC GACCGACGAA CTGCAAACGC GGCTCGAACG
ATACGACGAC AACGTGGAAG CGCACGTTCA GGGATTGCAA CAAACCCTAC GGCAAGAATT
AGCCGCCCTG CAGGCCGATC TGCAAAGGGA ACAAACGGAA CGAGCGACGG CGGATCAAGA
TATTGCCAAT GTCTTGAATC GCTACACGGA AACGATACAG AACAGTTTGG CCGTCGTTTC
GGACGTATAG
 
Protein sequence
MAFYGVSNKC SDRIPVFVFM IFLDWFEYIL LTSSSFRLDP NLDEQNTVQA KSRERVFPPP 
LDDVAYDSRD RDAFVAARHA AATANDSFAL EMKERARTRR ARQDQVVSDL KVQIRRVEAA
LQAETKRRVH GLQNLQQQTE ASIRTLQENL EESWRLQQTA TDDRWQQLAD RLTALEESWR
VHVARWEDRT GSESAQAREM LRELQEQAEV AQREREIREE SLRQRLENVA QEAEQAWDEA
RRERRAATDE LQTRLERYDD NVEAHVQGLQ QTLRQELAAL QADLQREQTE RATADQDIAN
VLNRYTETIQ NSLAVVSDV