Gene PHATRDRAFT_42594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42594 
Symbol 
ID7196281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp529321 
End bp530692 
Gene Length1372 bp 
Protein Length353 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176601 
Protein GI219109694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCATCGTTCG AAGATATGTG GTTTTCATAC AGCGACCGAA GCCTCGGGGT ACCAATCGAA 
ATCGAGACGC CTGGTACTAG ACCTAGAAAT CAAAATAGAA GCCATCCCAG TCATGCCAGC
CATGCCCCTC TACGCTTTCG GCAGATGGAT TGCTTACTGC CAGTGAAACA TACAGCAACT
CACTGTCAAA TAAGCGCCAA TTGACTTTGT TCAGTTGGCG AAATCAAATG TTTGGAAAAG
AACCTTCCCA TAGCGCTATT TGTAGTTTGT TCCTACCAAG CGTCAATCAA CTCGATGGCT
ACCATCGACA GGAGATGCGA TACCACCCTT TCTGTGAGGT TACGATCCAA CGACAAGAGT
CTTGACCGCC TCATTCTTCG TGAGCCTCGA TCAGTTCACT GCTCTCCTAG TTTTGAAGAA
TTCGTCGAGG CTATGAGCGG GAACATCACT GTTAAGGTAG TTTTTGTTGG AGAATTGGTG
GTGCAAACGC TAGAGGAAGA TAAATTATGC ACGCTCTTGG AGAAACTCGG TTGCCTGACG
GCGCTGGAAA ACCTAGAAAT AGCTCTGCCT CATCTCGGCG CGAAGAAGCG CATCACGGCA
TCATCCGTCG TTCGATTTCT GGGCCGGGCC AAAGCTTTAA AGACATTCGT CCTATGGCCT
TTTCTCCGAA TCGACTCAGG AGAGGATGCT AAGCTTATTG CCTGTGTCCT CAAAGATCAT
CTCAATCTCC AACATTTAGG CTTTATGCAT TTGGTAATGA GTGGAGGAAG TCGAGGCATT
GTGATTGATC CAATTCTTCG AAGCTTGGCA ACTGTTCCGA AGTTAGAAAC CTTGCAGATC
TCTGCGGCTT TTAGTATGCA AGAGGGCCGT CAAGCTGTCA GAGAGGAATC CAGCTTGACG
AACCTCCTCA TGCCTTCTTG TGCCTTGAAG TATTTTGCAC TGCGCAATTT AAACCTCGGT
GACTCACACT GTACTGCCAT TGCCAATATG TTATCTAAGT ATGGCAGCGT TTGCTCCTTG
AAACTGTTAG ACCTCCGCTT CAATTCGATA ACTGAAAAGG GATTCACAGC GCTACTTGCT
ACACTTCAAG AAAATTATAT TCTTGAATGG CTGGAAACGG ATTGCAGGAA CAAGAACTGG
CTGGAGAAGA TATCTTTTGC TCTCGCCTTG AACCGAGCAG GGCGCTTTGC ACTGCTTCGA
GATCCATCAA CTTCACGGCA AGAGATGATG GGTGTGTTTG AGAAGTGCTG TGACAACCTC
AATGTATCCT ACCACCTGCT TCGCCACAAT CCATCGATCT GTGACCGAGG AAAGGACAAA
GTAGAAAACG GTTTTACACC AGCGTCTCAA ACTTAAACTT ACAGTTAGCA GT
 
Protein sequence
MATIDRRCDT TLSVRLRSND KSLDRLILRE PRSVHCSPSF EEFVEAMSGN ITVKVVFVGE 
LVVQTLEEDK LCTLLEKLGC LTALENLEIA LPHLGAKKRI TASSVVRFLG RAKALKTFVL
WPFLRIDSGE DAKLIACVLK DHLNLQHLGF MHLVMSGGSR GIVIDPILRS LATVPKLETL
QISAAFSMQE GRQAVREESS LTNLLMPSCA LKYFALRNLN LGDSHCTAIA NMLSKYGSVC
SLKLLDLRFN SITEKGFTAL LATLQENYIL EWLETDCRNK NWLEKISFAL ALNRAGRFAL
LRDPSTSRQE MMGVFEKCCD NLNVSYHLLR HNPSICDRGK DKVENGFTPA SQT