Gene PHATRDRAFT_35099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35099 
Symbol 
ID7200524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp55917 
End bp57155 
Gene Length1239 bp 
Protein Length412 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179783 
Protein GI219117998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACTCT TCCAACACCG GTCCCAACCT TCACGACCCA CCCTGGCCGT GTGTGTTCGT 
GACCCTTCCC TCACTTCCGT TACCAGCCAG GATTCCGAGA TTGACAGTAC CACGCCGGCC
CACGGTACGC GTCGCCGCCA CGCACTCGTC CGCTGGCGGA GCCGTCGCGA GATCTCCACG
GTCGCCTCCA CTGCTCCCCA CCACCCCAAT CTACCCCGAC CACACGGTCC CACACTCCGT
CCTTTGGAAC GTTCACGGTC ACTCGCGAGC ATCATCAAAC CCGCTTCCGT CCGTTCCCTC
GTACAACGTC CGTCTCCGGA TCGTCACGCC GTATACCGGG AACTCGGGAT GGGTCCCTTA
CGGTTTGCTC CCGCTCCCGA CGTACCGTCC ACGTCACTGC ATGATCTCAC CGTCCCGCAA
CGTCTCGTCG TGCGGATCTT GTGGGAACAG TGGCGGGACC GGAATCCTGC CGACGTATCG
GCCGAATGGG AACACTGGTG TTTGTGCTTT GCCCGGTGCA GTCCGGGAGC AGCCAATTTT
GATTCCCGGA ACGCCTGGAA AGTCATGAAG CATTTCGATA AGCGTTACGT CAATCTCAAA
GCCGTCACGC TGGAATCGCG GTTGGCCGCC AAGACCGTCG TTCCCGTTCC GGGCCTACGC
ACACACCAGG GTTTGGACGT TGTCTACGTA CGCCCGTCAC GCTTCCATCC CAAAACGGAC
AACGTCCCCG CGATTCTTGA TCCTCTCGTG TACGTGCTCA TGAACATGAC CGTCACCCAC
GAATCCGCGT CGACCAACGG CCTTTGCATA GTCCTCAACA TGGAACAGTG GACCATGCGG
CATTATACGA CCGATTTTCT CCGACGCTTT TGGGCCGTTT TTCAGGGCTT CAGGGCTCCC
GTCCGGGTTC GTCAAGTCCT GATCGTCGAC CCACCATCCT GGTTTGCGAC TATTGGGAGA
CTCATGACGT CGTCCATGAT GACTGACGAC TTTGCCGCAC GCGTACACCG GACGCCGTCC
GCCGCGCTCG GCCAATACCT GGCTGACGGC TATACGCAAC ACTTGCCGGA CGATATGGTG
GGCGGCAGCG TTCCCACCGC CGACCTGGTA CGGGACTATC TCGCGTTCCG CAAATACGTC
GAAGCCGTTG AAGAAGTCCC ACCGTCGACG CGGCCTCCGC TGACCCGTGG TTTCCAGTCG
GAACGCCGCG TTCGCTTCGA ATTTCCTACT TTCAAATAG
 
Protein sequence
MVLFQHRSQP SRPTLAVCVR DPSLTSVTSQ DSEIDSTTPA HGTRRRHALV RWRSRREIST 
VASTAPHHPN LPRPHGPTLR PLERSRSLAS IIKPASVRSL VQRPSPDRHA VYRELGMGPL
RFAPAPDVPS TSLHDLTVPQ RLVVRILWEQ WRDRNPADVS AEWEHWCLCF ARCSPGAANF
DSRNAWKVMK HFDKRYVNLK AVTLESRLAA KTVVPVPGLR THQGLDVVYV RPSRFHPKTD
NVPAILDPLV YVLMNMTVTH ESASTNGLCI VLNMEQWTMR HYTTDFLRRF WAVFQGFRAP
VRVRQVLIVD PPSWFATIGR LMTSSMMTDD FAARVHRTPS AALGQYLADG YTQHLPDDMV
GGSVPTADLV RDYLAFRKYV EAVEEVPPST RPPLTRGFQS ERRVRFEFPT FK