Gene PHATRDRAFT_47990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47990 
Symbol 
ID7203217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp635963 
End bp637193 
Gene Length1231 bp 
Protein Length391 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182266 
Protein GI219123926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.077823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACCAGATAT GCTCGCCAAC GTAACTTGCC AAATAAAACT AATCGAAAGC ACACAATGTC 
AGCGGCGATC AGGAATGAAG CTAACAGTAA GCCTCGAGAC CGACGCTTGC GATCCAACCT
TCTCATGAAG CTTGTGATTA TGTCCGGAGT CGGAGCACTC GGCGGTTTTT GGTTGTTCTA
TTCCGTGTGG GAAACACAGT GCTCGGATTT ACTCAATCAG GCACAGATTC GGCACAGTGC
AGTTATCGCC GAGCTTCAAG AACTACATAC TAAACAGACT AGTGCATTAC AGAATTGCGT
CGAAGGAGAT GCCACGAAGG AGAAAGCTTT GGCAGAACGA TTGAAAACCC AAAGCACCTT
AGTGATTAAG CACCATGATT TATTGCAACA GTACGATGAA GCAAAATCGC GAATAGCTCG
TTTAGAAGTG GAACTCGAGG GTGTGACGCA AACAAATCAA TCGTTGAATG AGCAACTCGA
AAGGCTACAA AATGAAGTTG CGGAGTCAGC ACGTTCGACA CAGGGTGCCG AGAAGCAAAT
CGTATTGCTG CGTACTCAAC TGGAGGCAAC CAGATCCGTG ATAAAGCAAT CGTGTGCTTT
CAACGAAACT ATCGACATGA GTTCGCAGCG ACACAAAGAA GAGGTTTTGC AAATATACGC
AGCCATTCAA AGACAGAGTT TTGCGCAGCT CTTTCAACGA TTCGGCGAAG GTCCTTACGA
AGTTGAATTT CTGCTTTCCA CCGAATCGAC TACGGAAACG GTTGCGCATG AAACTTTCCG
AGTAGAGTTG CTGCGATCAA AAGATATGCC TCACACAGTT CTAACATTTT TGAGCCTTGT
GGAGCTCCGT CTCTACGACG GAACAACGAT TGCAGGAACA GATGGGACAG TCATCAGTGG
TGGGATCCCC AAACAAGCTC AGACACGTGC GCAATCGTAT CTTATGAGAA TGTACGTGGA
GCATGGATTT GGTTTTTCTC CTCTCGTAAT TGAAGAAACA TCTCCAACAA TGCCTTGCAT
GGCGCATACT TTTGGTTTTA CTGAAAGAGG GCCTGGTTTT ATAATTCCAC TCGAGAGCAT
GTCAAAGAAC GAAAGCCCAT CTTGCCCCGG TCGTATTTCG AGCGGCCGTG ATGTATTGGA
GCGACTAGCA AGAGACCGAG AGAGTCAGCT TACAATTATT GAAGCTAAAC TTGTCAGTCG
AGATATCGGT TCGCACGACA CCGAGCTGTA G
 
Protein sequence
MSAAIRNEAN SKPRDRRLRS NLLMKLVIMS GVGALGGFWL FYSVWETQCS DLLNQAQIRH 
SAVIAELQEL HTKQTSALQN CVEGDATKEK ALAERLKTQS TLVIKHHDLL QQYDEAKSRI
ARLEVELEGV TQTNQSLNEQ LERLQNEVAE SARSTQGAEK QIVLLRTQLE ATRSVIKQSC
AFNETIDMSS QRHKEEVLQI YAAIQRQSFA QLFQRFGEGP YEVEFLLSTE STTETVAHET
FRVELLRSKD MPHTVLTFLS LVELRLYDGT TIAGTDGTVI SGGIPKQAQT RAQSYLMRMY
VEHGFGFSPL VIEETSPTMP CMAHTFGFTE RGPGFIIPLE SMSKNESPSC PGRISSGRDV
LERLARDRES QLTIIEAKLV SRDIGSHDTE L