Gene PHATRDRAFT_47737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47737 
Symbol 
ID7202727 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp687269 
End bp688668 
Gene Length1400 bp 
Protein Length361 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181956 
Protein GI219123281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTGTTGGGG GAGCATGGCG TCCGAAGGAG ATATTGACAC GGCTGCATCA GCTTCTGGTA 
CTACCGAGGA TCCCGACTCT GGCGAAGTGA TTGGTTCTGC CGCTCTGCGT CCCGTCTTCT
TGGGGAATTT GGTCCCGAAT TACAGTACGG ATGAAGTGAC GACGCTCTTC GAACGGCCCA
TGCTGCCAGC CGCTGCTGCG GAAGGCGCGT ATCGCCCCAT TCCCGTGGAT CGGATCGATC
TCAAGCGGGG GTACTGCTTT GTCTTTCTCA AGGATGCTGC TACGCAAGCG GACAAGGAAC
AGGCCGAGCG ATTTGTGTCC GACATCAACG GCATGTGAGT CTGTCTACCC GTTTTTTCGC
AACGTACTTT CTACGTATTC TGACTACCGG AAGTTGCCTT GTGCGAATCT TTCTCTCGCT
CCCGCACCTA TATCATCGTG TCTTCCCTCT CTACCCTCTC CTCCGTTCGG GCACACTCTA
CTACGACACG TGGGAGAAGC GCGCCGTCTC CGGGCGCCGT CCGTCGACGC AAACTCGTCC
CACCCAACCA ATCCGTGACT GACCTCTACG TTTTCCCTTT TTTCTTTCTT TCTTTCTGTT
CTACCAATCG TCGCAGGCAA ATCGCCAACG TCTCCAACTC TCTCCGTGCC GAGTTTGCTC
GTGGCGATGG TCGGGTCAAA CGCAAGGAAG ACGAACGGCG CAAGAACATT GCGCCCTCCG
AAACTCTCTT TGTCGTTAAC TTTCACGAGG AAACCACCAA AAAGGAAGAC TTGCAAATGC
TGTTCGAGCC GTTCGGGGAA CTCGTGCGCA TTGATTTGAA ACGCAATTAC GCCTTTGTGC
AATTCAAAAC TATTGCCGAA GCGACCAAGG CCAAGGAAAC GACCAACGGA GGCAAGTTGG
ATCAGTCCGT GTTGACGGTA GAGTACGTGG CTCGCGAACG CAACATGAAC GGTGGGGGCG
GCGGCGGTAT GGATCGCCGC GATCGGGATC GTCGGGGACG CGACTACCGT GATCGCTACG
ATGACCGTCG GGGTCCTCCC AACCGGGGCA TGCCCCCGCC TCCCTACATG GACGATCGAC
GCGGCGGTTA CGACCGTCGC GGAGATCGAT ACGATCGTCC CGGATACGAT CGGATCGACG
ACCGATACCG ACGGGAGCGT AGTCCCCCCG GCTATCGGGG ACGCCCGCGG TCCCGCTCGC
GCAGTCCACC CCGACACTAC CGTTCGCGCA GTCCACCGCT GCGCTACGAC GAGCGCTACG
ACGATCACCG CCGCCGTCCC CCGAGCCCGC CCCCCGCGGC GGATTACCGA GACCGCCGTG
GTGGGGCTCC GAGTCCGGAT CGGGATTATC GTGGGGACCG TGACCGTGGC TACCGCTCGT
AAACAAGTCG GACCAGTGCC
 
Protein sequence
MASEGDIDTA ASASGTTEDP DSGEVIGSAA LRPVFLGNLV PNYSTDEVTT LFERPMLPAA 
AAEGAYRPIP VDRIDLKRGY CFVFLKDAAT QADKEQAERF VSDINGMQIA NVSNSLRAEF
ARGDGRVKRK EDERRKNIAP SETLFVVNFH EETTKKEDLQ MLFEPFGELV RIDLKRNYAF
VQFKTIAEAT KAKETTNGGK LDQSVLTVEY VARERNMNGG GGGGMDRRDR DRRGRDYRDR
YDDRRGPPNR GMPPPPYMDD RRGGYDRRGD RYDRPGYDRI DDRYRRERSP PGYRGRPRSR
SRSPPRHYRS RSPPLRYDER YDDHRRRPPS PPPAADYRDR RGGAPSPDRD YRGDRDRGYR
S