Gene PHATRDRAFT_42439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42439 
Symbol 
ID7196624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp54762 
End bp56537 
Gene Length1776 bp 
Protein Length570 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177008 
Protein GI219110513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00659721 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTGGTCGAA AGGCACTGTC AGTAAATAAC TTGATAAGCT GATCCAGAGA GCCCCCACAA 
ACGATGGCAG CCAACTCCTT GACTCCTTGG ACTGATCTCA TTTTGTCCAA TTCAGAGAAC
TTTTCATGCC AACGTCAGTT TCCACGAAGC GCCCCTGACC ACGAAAATGT TGACAATGAG
CAATCTACGG AGCCTTGGCC AAATCCAGCA ACGGATCATA TCGCCGAGGG ACATGGCGAC
GATAAGGTGA ATTCGCAACC CTTTCATGAG GACTCAATGT CGAGAGACTT GATGGCGGAG
CCAATCACTA CCGGCGCAGG GTTCAAGAAA GCCTTTTGGA AGTCATCTCG CTTCAATACC
TCGTCAAACA GCGTTTCCGA TCAAAATAAA GGCGTTGCCA ACACCGATTC CCTTGAAGAA
AAGACTCAAG ATTCTTTTCC GCAAAACGAT TTTGACGCCT CCTTCTCTGA TCTGGCTGAC
ACACATGACT ACATTTACCC TTTGGAGGAT TCATTGGCTG AACAGCCAAA CACCGGTTTC
GCTTTTCGTG CGGAAAAATC AAAACAGAAA CTCGTCGATG GAGAAGAGCA TGGCTGGACT
ATTGCTGGTG CACAGAGTTC CTCGGCAACT ACTCGGTCCG CCGCAGAGTG GGTCGAAAGT
ACTTCACACA CTAGCGACTT TGAATCTCAT ATACTCACTG AAGAAGTGCA TCAAGCCCAA
CAAAAAAGAA CGAATCGTAC TTGGTTCACT CACGCAATTG CACTTGTCTT TACGACTATG
ATGCTGGTTG AGTTGGGATT GAGCGGCTGG AGGTTTGCTC CGTTAAACAT CAACCCACTG
ATTGGACCAT CGTCCGAGCA GCTCATAGAT CTTGGTGCAC GTCAAACGTC GTTGATTTTG
GAGGAGGGGC AGTGGTTCCG TATTGTTACA CCTATTTTTC TGCACGCTGG AATAGTGCAC
TACCTAACCA ACATGCTGGC ATTCTGGTTT ATTGGTGGTG CCATCGAGGA AGCGCACGGA
ATTGCTACCG CCATAGTGCT GTTCTTCATT CCGGGCGTTG GCGGGAACAT TTTGGGGGCC
ACGTTTTTGC CTCAGTACAT TAGTGTGGGA GCTTCCGGTG GAACATTCGG AATGATTGGG
GGCTACTTTG CGGACATCGT GTTGAACTGG AATATTTTGT GCTCTAGGGA CCATGACGAA
GACGTATTGA ACTGGCGCAA AAACATTGCC GCAATAGCCC GTTTGGCTAT CGGAATCATT
GCTCTCTTGG TGCTGGGTGT TACCCCGTTT ATTGACAATT TTACCCATTT GGGTGCACTG
TGCTACGGTC TACTGTGTGG CTTGTTTGCC ATCGAGCCAG TTCCCTTGGA GGGATCCATT
GTTAGACTTC CATCTCGAAA AATGAGCGAC TTGCTATTTC GCCAGATTGG TGCTATAGTG
AGCGTCTTTT TGCTGGTGAT TACGTCGGTA GTCTTGAATT CTATGAATGT TGACGACAGT
CCTTGTCATG GCTGTCAATA CCTTTCGTGT GTACCATTTC CGTGGTGGAC GGAGGCAGAC
GAGCGTTGGT GGAGCTGCGA CGACTGTCCC TTCGTCACAG CGAAAGTACA CAATACCAAC
GGGGACTTAT TCTACGATCG TATGGATTTG GTATGTCCCA ACGATGTGGT TGAACAAATT
GATATTACGG GGAAGAATCT AACCAGCGGA GACGAGATCA GCAAGCTATT GCCTTCCTAC
TGCCGGGCTC GTTGTGAAGA CAAGTTCCAG TATTGA
 
Protein sequence
MAANSLTPWT DLILSNSENF SCQRQFPRSA PDHENVDNEQ STEPWPNPAT DHIAEGHGDD 
KVNSQPFHED SMSRDLMAEP ITTGAGFKKA FWKSSRFNTS SNSVSDQNKG VANTDSLEEK
TQDSFPQNDF DASFSDLADT HDYIYPLEDS LAEQPNTGFA FRAEKSKQKL VDGEEHGWTI
AGAQSSSATT RSAAEWVEST SHTSDFESHI LTEEVHQAQQ KRTNRTWFTH AIALVFTTMM
LVELGLSGWR FAPLNINPLI GPSSEQLIDL GARQTSLILE EGQWFRIVTP IFLHAGIVHY
LTNMLAFWFI GGAIEEAHGI ATAIVLFFIP GVGGNILGAT FLPQYISVGA SGGTFGMIGG
YFADIVLNWN ILCSRDHDED VLNWRKNIAA IARLAIGIIA LLVLGVTPFI DNFTHLGALC
YGLLCGLFAI EPVPLEGSIV RLPSRKMSDL LFRQIGAIVS VFLLVITSVV LNSMNVDDSP
CHGCQYLSCV PFPWWTEADE RWWSCDDCPF VTAKVHNTNG DLFYDRMDLV CPNDVVEQID
ITGKNLTSGD EISKLLPSYC RARCEDKFQY