Gene PHATR_33342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33342 
Symbol 
ID7204403 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp652665 
End bp653852 
Gene Length1188 bp 
Protein Length371 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186390 
Protein GI219113613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0652338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAA CTGCTTCCCC CATTTTGACA GCAAGTCTTC CTCGCAACAG CGCTTCCGAC 
GCATCGAAAC CGGTCACCAA CACCTTTTTT GCTCAAACCG CTCTCGATGA GAGCGGTGAT
AAAGGAGAGT TCAAACGTGT GGATGCTTCA TGGAGAAATT GGGTCAAGAA AGGTATGTTT
ACTTTCCTTC GTAACGAGAA GCTAACGAAA CTTGCTCACC AATAATTTCT TTTCCTGGTC
TTAGAACCTG ATGCACAGTT TCCAGCCGAA AAGGATAGAT ATCACTTATT CGTTGCGTAC
GCTTGTCCCT GGGCCCACCG TACATTGATG ACGCGAGCCG TCAAAGGTCT TGATGATACA
ATCGCCGCTA CCGTCGTCCA CCCAATTTGG CAGAAAACAA AGCCTGACCA AGACGAGCAT
TCGGGTTGGG TATTTGGAAA CGCCGAGGGA GAAATGCTGA CGAATACTGA AGGTAACGGT
GGTCCTTTCC CTTCAATCTT CCCACACAAC GAACCGGAAC CATTCTTTGG ATCTCAAAGT
ATTCGCGAGC TGTACGAAAA GGCTGGCGAT ACCGATGGGA AGTACAGCGT TCCAATTCTC
TGGGACAAGA AAAGGAACAC GATTGTCAGT AACGAATCCT CCGAGATCAT CCGAATGCTC
AACTCGGAGT TCAATGACTT TGCAAAAAAC CCTGATCTAG ACCTTTACCC TATTGAGATG
CATGTCGCCA TAGACAAAGT GAACAGTTGG GTCTATCCAA CGATTAACAA TGGAGTTTAC
CGGTGCGGCT TTGCCAAATC CCAGGAAGCA TATGATACAG CGATCACTGA GCTGACGGAA
TCGTTTGATC GGATAGCGGA TATTCTACAG AAGCAGCGCT TTATTGCAGG AAACAAATTC
TCAGAGGCTG ATATTCGTCT TTTTGTTACC CTTGTGCGGT TTGATGAGGT TTACACGGTC
TATTTCAAAA CAAACACGCG CTCTGTGGCC CACACTCCGT CTATTTTGAA CTACTGCCGT
GAAATTTACC AGATGCCGGG AGTGAAAGAC ACTGTGAACA TGGAACAGAT TAAGGCCCAC
TACTATTGCT CACACCCCAT TCTTAATCAT TTCTCAATTG TTCCTCGCGG GCCCGATTTT
GTGGATTTGT TGGAACAGCC CCACAATCGC AATAATAGTT TAAACTAG
 
Protein sequence
MSKTASPILT ASLPRNSASD ASKPVTNTFF AQTALDESGD KGEFKRVDAS WRNWVKKEPD 
AQFPAEKDRY HLFVAYACPW AHRTLMTRAV KGLDDTIAAT VVHPIWQKTK PDQDEHSGWV
FGNAEGEMLT NTEGNGGPFP SIFPHNEPEP FFGSQSIREL YEKAGDTDGK YSVPILWDKK
RNTIVSNESS EIIRMLNSEF NDFAKNPDLD LYPIEMHVAI DKVNSWVYPT INNGVYRCGF
AKSQEAYDTA ITELTESFDR IADILQKQRF IAGNKFSEAD IRLFVTLVRF DEVYTVYFKT
NTRSVAHTPS ILNYCREIYQ MPGVKDTVNM EQIKAHYYCS HPILNHFSIV PRGPDFVDLL
EQPHNRNNSL N