Gene PHATRDRAFT_45339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45339 
Symbol 
ID7199975 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp884613 
End bp886247 
Gene Length1635 bp 
Protein Length334 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179530 
Protein GI219117471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTTT GGAAGAAGGT ACGTCCCGAT TTTTCATCTC TGTTGTATAT CGCTAGTCGC 
GTGATTCAGA AAGCATCATA CGCGTCCTAA GCAAAGCGGT CCATATGTTG TCGTGCCTCT
GGTGAACTTC CTTTCTTCGA CGCTTCATAT TGATTATATC AACATCTACG GTTGATATTT
CCCAATGACG CTACATACGA TTCTGCTTTC TACATTTTCT CTCACCTCCG GGGTTGTAAG
ATTCTTCTAC TGGTAAGCCT ATGCGCTTTC TCTAGAGTAG CGGAGGGCTT CGTTATCCGC
TCGAAGAGCT TCTCTTCAAC CAGACATACT GCTGCTACTT ACCGCAAGAT CGATACGTGT
CGTTTTTCCT CAAATCCCTC CGATCCCCAG CATGAAAGTT GTACCAGCTT GGTCTCAATC
GAGCAACAAC CTGCTACCCC TTCCCGTCGG AAGTTTTTGC AGAAAGTGAC TGCAACCACC
TTGGTCGTTA CCGCGGCCTC GTCCGGTCTC GTGCCCGTTG ATTCCACCGC TTGGGCTGCC
TCCGGCACCG ATGATACTGC GATTCTCAAT CTGCCATCTC TGAATCTTAT TCCTCAGTTC
TCCACCGCAG ATGACGTTCC CAGCGACTAC TTTTCCGACA ATCGCTACAT ATATGGTTTC
GTGGAACGTA TAATTGACGG AGATACTATC CGAGTCCGGC ATGTACCGGG CTACGGACTC
CGCCGCCAAT CCACACAACC GCTCCAACAG CGGGGCATCG CCAAAGACAC ACTCAGCATT
CGCGTGTACG GCATCGATAC TCCCGAAATT GGCAAGAATA AACGACAAGT TTCGCAACCT
TTTTCCGAAG AAGCCAAATC TTTTACCTCC AAACTCGTTT ACAACAAAAT GGTCAAAGTA
ACCTTTTTGC GGAAGGACCA GTACAGCCGG GCGGTAGCGT CGGTGGAAAC GGTACCACCA
CGATTCCTTT CTTGGATTCC CGGATTCGGG CCGAAAGATT TGTCGCTGGA ATTGGCCAAG
GCTGGGTTGG CGGAGCTTTA TACTGGCGGT GGTGCCCAGT ACAATGTACG TCTATTGCGA
TCCTTTTTGT TCGGCAGACG ACTTGGCTGT GGATTGGCAT GAAACACTGC AAAAGAGCTA
TCGGACCCTT ATCTGTGTTG CTCCTTATAT TCCTACAGGG CAAGCGCGCG GAACTGGAGC
AGGCTGTTGC GCAAGCGCAG CGTAAAAAGC TTGGTCAATG GTCTTTATCG GAATCGGAGC
GGGTCAGCGC AGCCGAACAA AAGCGTCTCC TGAAGCAAGC AGCAGTGACC GGCACTGCTC
CGGTACCAGT GTCCCGAAAC GACCGATCGT CGGGCGCGAT GCCTCTCGCG TCGACGAGTC
GGAACGGGCA GACGGGTGTC GGCGAATCAC TGCTCGACGC GGCTGTCACT GGTCTAGAGT
TTATGTAGAA ACACGTAGAC GACGGCCGTC TACGGCACAC CAGCTATACC GTCTTTTCCT
CCTTCTCATC TGCAATGAGA TGCTTGGAAG AATGCTTTGC ATTTCTACAT AGCTTCTAGA
TCTAGAAACA GTTCCAATGC AGGCTTTGAC AAATATAAAC GCTGAGTAGA TTCTAAGTGC
AAGAGCTCTT TTGGT
 
Protein sequence
MMFWKKIDTC RFSSNPSDPQ HESCTSLVSI EQQPATPSRR KFLQKVTATT LVVTAASSGL 
VPVDSTAWAA SGTDDTAILN LPSLNLIPQF STADDVPSDY FSDNRYIYGF VERIIDGDTI
RVRHVPGYGL RRQSTQPLQQ RGIAKDTLSI RVYGIDTPEI GKNKRQVSQP FSEEAKSFTS
KLVYNKMVKV TFLRKDQYSR AVASVETVPP RFLSWIPGFG PKDLSLELAK AGLAELYTGG
GAQYNGKRAE LEQAVAQAQR KKLGQWSLSE SERVSAAEQK RLLKQAAVTG TAPVPVSRND
RSSGAMPLAS TSRNGQTGVG ESLLDAAVTG LEFM