Gene PHATRDRAFT_46145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46145 
Symbol 
ID7201363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp401468 
End bp402649 
Gene Length1182 bp 
Protein Length393 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180429 
Protein GI219119333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0356931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTGT TGAATACTAT TTTCGGCAAA AACGCCGAGA AGAAATCGGA ACAAGTGTCG 
GTATTTGAAA CGAAAGTATC CGTTCCAGAA CCGAAGCCGT TGCCTCCATC AAAAATACCA
AAAGAAAAAC CGCCTTTGAC GCCAACTGCT ACTGGCGATG GGAGTACGAA GAAACGAAAG
CGCAAGGAAA CGAAAGTTGA GTTTACTTTG AAAGAGCCCC TCGAGGGTCC TACGGTAGAA
AAGCCCTCAG AAGAATCCAA ACAAGCGGAA GAACGAACGG TTTTCGTCGG TAACCTACCG
ACCCAATACA ATCGCAAAAG CCTAGCCAAA CTCTTTAAAG ACTGCGGCAA AGTAGAGAGT
TCACGCATCC GGTCGCTTGC CGTGACGGGA GTGAAGCTAC CACAAGAAAA TGCAGGCAAT
CAAAAGTTGG TTAAGAAGGT TTGCGCCAAC ACCTCCCAAG TCGACACTAA GGCAAAATCT
TCCGTTCAGG GGTACGTTGT CTTCGTGAAT AAAGATGCGA TTGAAAAAGC CTTGGTGCTG
AACAATACAG AAGTGAAGGA CGAAAGGACC GGTACAACAC GTCGGATTCG CGTCGACCAC
GCAAATGCTG AATACGACGC TGCACGTTCC ATTTTTGTGG GAAATCTCCC GTACACTGCT
GACGAAGACT CTTTGGCAGA ACATTTCTGC GAAGGCTGTG GTTTGAATGT AGACGACATT
CAGGGAGTTC GAATTGTACG TGACAAGGAG ACCTTTCAAT GCAAAGGCTT CGGTTACGTG
TTGTTTAGTG ATCAAAGCAT GGTAACATTG GCCTTGCAGC GTATGTCGGG AAGTTTATAT
GCAAAACGTG AACTTCGAGT GATGGTTTGT GGACGGCGCT TCAAAGGTAA GAAGGGAGAT
GCAATGCCGA AGGAAAACAA AAAGCGTAGC TTTGAAGGAC GACGAGCTTC GGCACCAGTA
TCACCGGCTG CATCCGTAGG CGCCTTGCGA CGCATAATCA AAAAGCAAGT TTCCGAGGCC
CCGACCAAGA AGCGCAGAGC TCGTGGGGAA AAGACCAGTG AAAAACCGAC GGCGCGCAAA
GCGGGAGTCA GTCGAAGAGC CGCTGTAGAA GCGAAGGTCG AGAAGCGTGT CAAGAAGTTA
CAGAAACGTG CTGCCAAAGG AATGGGAAAG AAGAAGATGT AG
 
Protein sequence
MSLLNTIFGK NAEKKSEQVS VFETKVSVPE PKPLPPSKIP KEKPPLTPTA TGDGSTKKRK 
RKETKVEFTL KEPLEGPTVE KPSEESKQAE ERTVFVGNLP TQYNRKSLAK LFKDCGKVES
SRIRSLAVTG VKLPQENAGN QKLVKKVCAN TSQVDTKAKS SVQGYVVFVN KDAIEKALVL
NNTEVKDERT GTTRRIRVDH ANAEYDAARS IFVGNLPYTA DEDSLAEHFC EGCGLNVDDI
QGVRIVRDKE TFQCKGFGYV LFSDQSMVTL ALQRMSGSLY AKRELRVMVC GRRFKGKKGD
AMPKENKKRS FEGRRASAPV SPAASVGALR RIIKKQVSEA PTKKRRARGE KTSEKPTARK
AGVSRRAAVE AKVEKRVKKL QKRAAKGMGK KKM