Gene PHATRDRAFT_33054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33054 
Symbol 
ID7197278 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1414613 
End bp1415782 
Gene Length1170 bp 
Protein Length363 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177825 
Protein GI219112147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.608605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACT ACCGACCTAG TTTGACCCCC GAGGAACAGC GTAAGTTAGA GTTGCTACGA 
CATGCCAGCT ACAACACTGC TCCCCGGGAA AGTACTCGTA TAGAGCGTCT GCCCCCTCCC
CAGCAAGCGA TGATATCGCC GTCGGAACTG AGTCGCCTAA GCCTAGCAAA CATCGAAGCT
CGGATGGCTT CTCAAGGGGC GCTGGCTGCA CGTGTTCAAG CTGCTCAGCA AGCTCATGCC
CAGGCTCAAG TTCAGAATAT GCAGATGATG GGGCGTCCAC AGCTCTCTCC CGCCCTGCTG
TCCAAACATC GACTGGCCTT TCACCCGTCG TCGCCCATAA AGCAAGGCCA GAGGCCGTAT
TCACCGCAAA AGCAAAGTGT CGGTGGCTAT CGTTCTCCAA TGAGACAACC CACCTACTAT
AGATCGCAGA ATGAATTCTC TTCCTTCCCT TCTCCGGCTT CATCATTTCT TTCACCAGTT
GAACACGCGC ACGCATATCC CTCTCCTGCT AGTTTCCCCT CGTCAACTAC TAGCTTTCCT
TCTTCGGCGG GGAGCTTCCC ATCGTCTACG GGAAGAATCA GAAGCCCGGC AGAAGGACTG
AACATGCTTC GAGCAGTAAG CTTGGGAATG CGAGGAGATC GTTCAGAAGG CCTTCCTCCC
ATGTTAGCCC CTCCACTGAC CTCTCATATG GCACCGCCCC AATCACACCC ACCGCCGCGA
CACAATAGCA ATGCGTCAAT GGATGACATG TCAACTTCTA CGATACACCC TTCAGAGGGA
AAGTTGTACA TTGACGAACT GCAACCTTAT GACGTTCTTT GTGGACGAGG TGGAAAGTCG
AACCATCACC CCGGAAAGTA AGTCAGTGTT GCAAATCCAT AGGTGGTTGG ACATGCATCT
CATTTTGTCT AACTCTCATC GCTTTGTGGC TTCAGCAAAC GGTACCGACA CGTCGTCAGC
GAAATGAAAA TGATGTATCG CAAAACAGAA GCAAAAGCGA TCAAAACCGA TCTTAGTCGT
GCTATTGTGG AACATGTATG CAACTATGGA GGACGGTTTA TAAAAAGAGA AGAAAACTCG
GGTCGATACT ATGTACTCAC CAAATCTGAA GCCAGGAAAA AGACCAGCCA AGCTCTGCGG
GAAACCAAGG AATTAAAGTG GACCGCGTAG
 
Protein sequence
MMDYRPSLTP EEQRKLELLR HASYNTAPRE STRIERLPPP QQAMISPSEL SRLSLANIEA 
RMASQGALAA RVQAAQQAHA QAQVQNMQMM GRPQLSPALL SKHRLAFHPS SPIKQGQRPY
SPQKQSVGGY RSPMRQPTYY RSQNEFSSFP SPASSFLSPV EHAHAYPSPA SFPSSTTSFP
SSAGSFPSST GRIRSPAEGL NMLRAVSLGM RGDRSEGLPP MLAPPLTSHM APPQSHPPPR
HNSNASMDDM STSTIHPSEG KLYIDELQPY DVLCGRGGKS NHHPGNKRYR HVVSEMKMMY
RKTEAKAIKT DLSRAIVEHV CNYGGRFIKR EENSGRYYVL TKSEARKKTS QALRETKELK
WTA