Gene PHATRDRAFT_34943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34943 
Symbol 
ID7200146 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp718507 
End bp722127 
Gene Length3621 bp 
Protein Length1005 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179277 
Protein GI219116965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00018875 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTTG AACAGCTCCC ACTTGGTAAG GAAAGTGTAA GTGTGAGCCA ATCCGATGGA 
CGCAGCGGTT ACATGTCCCA TATGGAGTTT GTGTCGGCCT TATTGCGCAA ATCTACCGTA
TTTGTTGATC AAACTCCTGG TAAGGACAAC GTCTCACCAG AAGTTCAATC GCCTACTGCT
AAAAGACAAG TCGCTTCCGT CCCACACACA ACGAATCACA GCCCCACAGT CCCTACGAAT
ATCCAAGAGT ACAATATAGA CTCATAATCT TCGTAGTTCA TCGTTTTCCT CAACTGTGAT
CTCCATCGTC GTTTTGTGTT CTTCAATAGA ACTACGAAGC AACCCCACCG TCTTTACGCC
CCTAACCCTG CCCTCCCACC CATCCGCAAC CTCTTCGGTC CCGATGTCGA CCTCGGCTCA
TTTCAAACTG AGCGACTTTC CTCACAAAGT CCTCGACCCG ATCGCCACCC TCACCGTCCC
ACCGACCTAC GCAACCATTA AGCGTGCCCA ACGCCAGCTC ATGACTAACG CCGCCGCCAT
TCCCACACTC AACGGAGGTG GCGCCCACGG CCATATGGCC TTGACCCTGA CCGCCCTTGC
CTACGCCGAC ATCAGCGACG TCCCGTTTGT CATTCCCGTC GCCCCTCCGG CCAATCCGCC
TCCCGGCGCC ACGCAACCGC AAATCACCGA AAACAACCGC ATTCATCAAC ACGACGCTGA
CATCTACAAC CTTTATGTTG CCGTCAACAA CGCGCTTCGC CAGCAACTTC TCGACGCAGT
CCCCCGCATT TATGTCCGCG CCCTCGCCCA TCCCATGTTC GAGTTTAGCA ACGTCACTTG
CCTTGACTTG CTCTCGCCCC TCTGGACCAA ATACGGTACC ATCAAGCCCG CCGAGCTCCA
GAAAAATTTC CAGTCCATGT ACACCCCTTG GAACACAACC GAGCCGATTG AATCAGTTTT
TCTTCAGCTC GACGAGGCCA TCGCTTTCTC TGTTGATGGT AACAACCCCA TCTCGGAAGC
TGCTGCTGTT CGCGCCGGCT ACGAAGTCAT TGCGCACTCG GGCCTGCTCC CCCTGGACTG
CAAAGAATGG CGCAAATTGC CTACTGCTGC TCACACCCTT GCCCATTTCC AGCAGCACTT
TTCCCTTGCC GACGAAGACC GGCGCCTCAC GGCCACCACC GGTTCCCTCG GCTATGCCAA
CGTGCTTGCT GCTGCCCCCT CTCTCGCTCC TGCCACGACC TCCGACACTC TCAGCCTTCC
TTTCTCCGCG CTCTCTGTGT CCCAGACTTC TGTCTCTTCG CCGGACATGA CCTATTGCTG
GACCCACGGT ACCAGCAAAA ACCGGTGCCA TACAAGCGCC ACGTGCAAGA ACAAGGCCCC
TGGCCATCGC GACGACGCGA CCGCCACCAA CACTCTCGGC GGCTCCACCA AGGTTTGGAC
CGCTCCCAAG CCCCCTGAAT AGGAAAGAGG GACGGCTACG CCAATGGTTA ACTCTAGTAA
TACCGATTAT TTAAATCATA TTACTAGTCT TAATTCATCT GTAGTCCCCT CCCCGCCTAG
TCCCCATACC TCGGCCATTG CCGACACCGG TTGCACCGGC CATTACATCA CCGTCAACTG
CCCCCACACC CACAAACGTC CGGCAAGCCC CAGCCTTGCC GTACGTGTCC CTAACGGCGC
CGTCCTCCGC TCAAGCCACA TTGCCACCCT GGCCCTCCCT GGCTTCTCCC CTTCTGCTTG
CCAGGCCCAC ATCTTCCCCG GGCTCACCTC ACACCCACTC ATTTTGATTG GACAACTTTG
TGACGACGGC TGCACCGCCA CTTTCTCAGC CACACGCCTC GAGATCCACC GCGACACTAC
ACTACTCCTC TCCGGCACTC GTGCACCCAC CACCGGCCTC TGGCACCTTG ACCTTACCCC
TGCCAAGCCT CCTGCCACAG CCCACGCTCT TGTTCCCAAC ACTCCCCTTG CTGACCGCAT
CGCTTTTGTT CATGCCTCGC TCTTCTCCCC GGCTATCTCC ACATGGTGCC AGACCCTCGA
CTCCGGCCAT CTTGCAACCT TTCCCGAACT TTCCTCCCGC CAGGTCCGCA AGCATCCACC
TCATTCCCCC GCCATGGTCA AGGGCCACCT CGACCAACAA TGCGCAAACC TTCGCTCCAC
CAAGCTTCCC CCTGTAGGTT CCCCCATCAC GACGGAACCC CTTGCCGCCG CTGTGCCCGA
CCTTGACCCT CCCGACGCCC ACGACGTCAC ATGCACACAC CATGTCTTTG TTGCCCACCA
ATGGGTTACC GGTCAGATCT ACACGGACCA ACCGGGCCGC TTCCTCACTC CCTCCAGTGC
CGGCCACAAC GATATGCTTG TTCTTTATGA TTATGACAGC AATGCTATCC ACGTCGAACT
CATGAAGAAC AAGTCCGGCC CCGAGATTCT AGCCGTCTAT AAGCGCGCTC ATGCTCTTTT
CACCCAGCGA GGCCTTCATC CCCAACTCCA GCGTCTTGAC AACGAAGCCT CTGCAGCCCT
CCAGTCCTTC ATGTCCTCCG AGCACGTGGA CTTTCAGCTG GCACCCCCCC CATCTACACC
GCCGTAATGC AGCCGAACGG GCCATCCGCA CCTTCAAGAA CCACTTTATT GCTGGTCTCT
GCACCACTAA CCCGGATTTT CCCTTGCATC TTTGGGACCG CCTCCTCCCA CAGGCCCTCA
TTACCCTCAA TCTTCTTCGT CGCTCCCGCA TCAATCCCAA GTTGTCCGCC CACGCACAAC
TTCACGGTGC CTTTGACTAC AACCGCACCC CGCTCGCTCC TCCTGGCACT CGCGTCTTAG
TTCATGTCAA GCCCGCTGTT CGCGAAACCT GGGCCCCCCA TGCTGTTGAA GGTTGGTATC
TCGGCCCCGC TCTCAACCAT TATTGCTGCC ATCGCGTCTG GATCACGGAA ACACGTGCCG
AACGTGTTGC TGACACCCTT TCCTGGTTCC CGACCCGCAT TCCCATGCCC GCCGCTTCGT
CCACCGACCG CGCCCTGGCC GCCGCCCGTG ACCTAGTCCA TGCCCTCCAG AATCCTTCCC
CTGCATCTCC GTTCGCCCCC CTTGATGCCA CCCAGCACCA GGCCCTTACC GACCTCGCCA
ATCTCTTTGC CACTGTGGCC GCCCCAGCCG ACGACGTCCC TGCACCCGCT CCGGTGCCTC
CGGTCCGTCC CCCTGCCCCA GCAACTCCCG GTCCGTCCCC CTGCCCCAGC AACTCCCCTT
GCGCAGGTCC GTTTTGCCGT TCCTCTTGTC ACGGCCGAAC ATGCCCCGGC ACTTCCGAGG
GTGCCCATTC CTGCCCCAGC ACTTCCGAGG GTGCCCACCC TGGCCACCTA TCACTCTCGC
ACCGGCAACC CCGGCCGTCG CCGCCGCACC GCACGCAAAC AACCGGCAAC CCCAACCCTA
GTCCCGGCGC ATCCCCTGTT ACCGAGTACT ACCAAAGCAC AAGCGTAGGA CAGATCGCTG
TTAGGACAGT GAGTGGTAAA CGAAGGTGGG CGTTGAGACG AACGACCTGC AATACACCTA
AGAGACTTAG GGACTGTAAA ACTCAACTCG TTGACGACGC TCCCCACGGG TTAGACAGCG
GATACGAGGA CCTATGCGTA G
 
Protein sequence
MSLEQLPLGK ESVSVSQSDG RSGYMSHMEF VSALLRKSTV FVDQTPELRS NPTVFTPLTL 
PSHPSATSSV PMSTSAHFKL SDFPHKVLDP IATLTVPPTY ATIKRAQRQL MTNAAAIPTL
NGGGAHGHMA LTLTALAYAD ISDVPFVIPV APPANPPPGA TQPQITENNR IHQHDADIYN
LYVAVNNALR QQLLDAVPRI YVRALAHPMF EFSNVTCLDL LSPLWTKYGT IKPAELQKNF
QSMYTPWNTT EPIESVFLQL DEAIAFSVDG NNPISEAAAV RAGYEVIAHS GLLPLDCKEW
RKLPTAAHTL AHFQQHFSLA DEDRRLTATT AKTGAIQAPR ARTRPLAIAT TRPPPTLSAA
PPRFGPLPSP LNRKEGRLRQ CLNSSVVPSP PSPHTSAIAD TGCTGHYITV NCPHTHKRPA
SPSLAVRVPN GAVLRSSHIA TLALPGFSPS ACQAHIFPGL TSHPLILIGQ LCDDGCTATF
SATRLEIHRD TTLLLSGTRA PTTGLWHLDL TPAKPPATAH ALVPNTPLAD RIAFVHASLF
SPAISTWCQT LDSGHLATFP ELSSRQVRKH PPHSPAMVKG HLDQQCANLR STKLPPVGSP
ITTEPLAAAV PDLDPPDAHD VTCTHHVFVA HQWVTGQIYT DQPGRFLTPS SAGHNDMLVL
YDYDSNAIHV ELMKNNPSCP PSTWTFSWHP PHLHRRNAAE RAIRTFKNHF IAGLCTTNPD
FPLHLWDRLL PQALITLNLL RRSRINPKLS AHAQLHGAFD YNRTPLAPPG TRVLVHVKPA
VRETWAPHAV EGWYLGPALN HYCCHRVWIT ETRAERVADT LSWFPTRIPM PAASSTDRAL
AAARDLVHAL QNPSPASPFA PLDATQHQAL TDLANLFATV AAPADDVPAP APVPPVRPPA
PATPGPSPCP SNSPCAGPFC RSSCHGRTCP GTSEGAHSCP STSEGAHPGH LSLSHRQPRP
SPPHRTQTTG NPNPSPGASP VTEYYQSTSV GQIAVRTTAD TRTYA