Gene PHATRDRAFT_44362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44362 
Symbol 
ID7197838 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp334874 
End bp338357 
Gene Length3484 bp 
Protein Length1083 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178211 
Protein GI219114831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTAATGTAG CCTCACCGAC TGGTAAATAT ACACCAGCAA GTGTCTGGTC TCCATGATGC 
AAGAGAACGT TGGCGGTTCG GATGGCAACA AGCGCCGCCG GCGGGTTTGG ATCGCCCAGG
AAGAGCTCCA TTCGCTCGCT GCCTATACGG ACGCAACGGA TCGACGTCTG GAGCAGACAG
AGATGAGTTT ACGAAAGGAA GTTACGCAAA GATTGGTGCT GCAGAAACGC CTCCGTGATG
CCGAATTGAA TCGGCAACAC TTGCTGGCGG TCCAAACCCA ACAAGAAATA CTGCGCGACG
CATTGGCCGA CTCGGTAGCG CAATTGGCGA AAAAAGTACT GGTGGAGGAG TCGCCCTCCA
ACGTGACAGA GTCCGACGGA GGGGCGGACG ATCTCGTGCG AATAATTCAT ACGGCAGGAG
ATAGGACTCC GCCTTCTGAC GTACATGGAA CAGGCAGTGC GGACGAGATT GCTCCACACT
CGCCACGGGA TGCAGCAAGG ACGGCACGCG ATGAACCAGA GATCTGGCCG GACAACAACG
ATACCAACAA CAACAACGGC GTGACGGTGT ACACTATTAC GGAGGCACGG CCACGCGGTA
CGGTACATAT TGACGCGTAT CCTCGCCATC ACGAAACGAC CACATCGCAC GGGCACCGTA
AATGCGGAAC CCCCCGAGTC TCCTGCTTTA TTCCGTGCCT GGCATCGCCA AACGCGGTGG
CGCGAATACT CGCCGTTTCA ACGAGAGAGG TCTTTGCTTT AGATGCTGTC ATGCGCGACG
CCGGTCATCG TCGTCAGCTA CTACGGTGGA CGGCACTGGA CTTTGGACTG TGGAGCTCGA
ATACCGAATC TTCCAAGGAT GGGTCCGGGG CGAAAAATGG TAACGGCGAT CTCCGTAGAG
ATGCTTCGCC GAAACGACTT GACCCACATC TTCCGCTTTG CCCCTACGAA TTGGCAGGAG
AGTGTGCCGA TCCTTTTTGC TCCTATCAAC ACATCACCCC GCGGTCGAGT ATAGCAAACG
TGATGCCGCG CGAGTTTTTG CCTCTACCAA CCATTACAAT CTCCAAAAAA GTGGGCATTT
TGTCCGATCG GAGGCCTCAT GATCAAAGCG AGGCAACTAG GAAATACGCG GCAAAAGTTC
GAACGAATGC GGTCGACTCG GAAGATGGAT ATATTCCGTT ACCACAACCA GCCAAGCCCC
CAATTGCTGC GGCTCCGCAA GATAGTAGTT GCAAAAACGA ACATTTAGGC TGCCTTGCAT
GGTGGGATAC CTCTGGTATC ATTTCCAACT TTATGCAATG CCCTCATCCT TTCTCTCAAA
TACGCGAGAT CTTTTCGTTT GATACTGATG AAGTGAGATT TCTTATTGAA GACTCGGATC
GCATTCCTTC CGCTCTTTAC ATATGGCTGG GTAAAGTTTC GGGAATATGC GCTCTTTCAA
CCCATGCCGG GCGCTTTGAT GTCGCATGTT CGCTACTGCA GGACATCAAT CGGAGGTTGC
GAGCAAAACA GCAGGAAACA CAGGAAACTG CCCCTAACAA ATCTCCGATG GTACTGACGA
GGTCGAGGAT TGCGGTGGGT ATAACGACTT GTTTTAGTCG CCTTTTGGAA AAGTCTTTTC
TATACGATTC CAATCCCGGG GACGACATTT TTTTGTTTGC GTTTCAAACG CAATTGAAGA
TTTCTCTTGT TTCTGCCTAT CTCCATAGTC TCTACACACA GGAGATCGAT CCGACATTTT
CGTCAACACA AACACTCCAT CACTTCGAAG AGATATGGGT TGGTTTGGAA GCTGCACTTA
CATCCGTACC GCCCGCTTAT TGTGAGGTGA TGGAATGGGA AAAGCTGCAG CAAGTACTCT
TTTCCGCTTC AAGGGATGAG GACAGGGGGG AGAAGCCTCG GAAACCTAGG CAAAACGATA
TCAACCACGG CACCGCTAGC CTATTCTTTT TGTCGTCAGG CGGAACTTTG GAGTGTTTGA
GCCTGATTAA TCGGGTTCAA TTTTCATTGA ACGCAATCGC TTCCCTTGAC TCTCTGATGA
ACACTGCTCT TCGTTCTTCT TGGAGCGCGT TCAACCACTT GTTCGAAGAC AAGAAATCAG
GATCAAGCTC CTCTCGACAC GATCTCCTAT TTTTTTCGCA GATTGGATCC ATTGTTCTCG
CTTGTTTGCG TCGAGCTTCG ACCGCTATCG AGTTGAGTGA TACAGGATTT GCAAAGTCGC
TGATGGACTA CATCGAACTG TACAATTTGA CCGAGACAAT TCTTTGCTGG CTTGAATCGA
TTCCGTCCAC TGAGTCTTGG ATCGATCTTC TCCTTGCACC GCTATTTGCG GCCAACATCG
CTCTCGGCTG TAGGCTGCAG CAATACGATA AGATTTGTCG CCGGCTTCAA GATTTTTTGA
TGCATAGGCC AAGAAATGAA AGCTGTGCAG GGCTTTGTAC ATTTTCAGAG CTTTTGTGGT
CCCAATATAT TCAGCTTCAC TTCACTTTAC CCTATAACAT AGCAATGGGT AACATGGACG
GTCAAAATGG ACATTCATCA CTGACGTGGG AGATTCCAAT GGAGGTCAAC AAATCTCATC
AGGCCATGTG TCAGGTGTTA TCATCTCGTG AAGTATCTCC CCATCATGTG GTTGCACCTC
ACGATCAAAC AATGGTTTAC GAGGTGATAC CTGTACTATC AAGTGAGTCA AAAATCTCCT
CCGAACGGGA AGACAAATTT TCCGAGAACG AGAAAGCGGA AGGCGGTCGG GTATTCACGT
CGTCATGTAG TTCCTCAATC CGTGACTTCT ACAACCTCGA ACACGAGCCA ATGTCCTTTG
TGTTCCACCG AGACAGTGCA TGTATGGAAA ATGGCGGTAG TCGGTTGTAC CCAGGTATTC
CACTTTCGAT ACCAACGGGT TTGCTTTTTG CCGGCTCATC ACTGAAGAGT CTTTCTCTGA
ATGGTTTTGG GCTGGAGCGG CTACCTGATC ATATGGATAT GTATTTTCCG AAACTTACAG
TACGTTCGCT GGAGTCGTCA TTCGTTTTTA GCTATCCGAT CTTTCTCACA ATATTTGTTT
GTCCAAGAGT CTTGAAATCA AGGAGAATTC GTTGGTCATT TTGCCCGAAT CAATTTGCAA
CCTAACGCAA CTAAGAGTTT TAAGGATCGA CCGGAATATG CTAGGCTCTC TACCGTCAAT
ATTTCAAATG GTCAGCCTCG TGGAGTTGAC TGCTGCCAGG AACAAGCTGA CGACCCTTCC
GGCTTCGCTG GCTGAATGTC TGAACTTGCA AGTGTTGGAT GTGAGTGGTA ATCCTATCAA
TTGCGATTTG CTCTGGTTTG CAGCCAGACT GAAGCATCTT CGAGTCCTCC ATCCTATGTC
GGAATTAGTA TAATATAGGC ATGTATAGCA GTATGTGAGT GTTGCGTTGT ACTTCGCTCA
ATATTAGATT CTAAATCAAA TTGAGACGGG CGAAGACCAA GAAAGTCGCT CGTGTTTGTG
GTAA
 
Protein sequence
MMQENVGGSD GNKRRRRVWI AQEELHSLAA YTDATDRRLE QTEMSLRKEV TQRLVLQKRL 
RDAELNRQHL LAVQTQQEIL RDALADSVAQ LAKKVLVEES PSNVTESDGG ADDLVRIIHT
AGDRTPPSDV HGTGSADEIA PHSPRDAART ARDEPEIWPD NNDTNNNNGV TVYTITEARP
RGTVHIDAYP RHHETTTSHG HRKCGTPRVS CFIPCLASPN AVARILAVST REVFALDAVM
RDAGHRRQLL RWTALDFGLW SSNTESSKDG SGAKNGNGDL RRDASPKRLD PHLPLCPYEL
AGECADPFCS YQHITPRSSI ANVMPREFLP LPTITISKKV GILSDRRPHD QSEATRKYAA
KVRTNAVDSE DGYIPLPQPA KPPIAAAPQD SSCKNEHLGC LAWWDTSGII SNFMQCPHPF
SQIREIFSFD TDEVRFLIED SDRIPSALYI WLGKVSGICA LSTHAGRFDV ACSLLQDINR
RLRAKQQETQ ETAPNKSPMV LTRSRIAVGI TTCFSRLLEK SFLYDSNPGD DIFLFAFQTQ
LKISLVSAYL HSLYTQEIDP TFSSTQTLHH FEEIWVGLEA ALTSVPPAYC EVMEWEKLQQ
VLFSASRDED RGEKPRKPRQ NDINHGTASL FFLSSGGTLE CLSLINRVQF SLNAIASLDS
LMNTALRSSW SAFNHLFEDK KSGSSSSRHD LLFFSQIGSI VLACLRRAST AIELSDTGFA
KSLMDYIELY NLTETILCWL ESIPSTESWI DLLLAPLFAA NIALGCRLQQ YDKICRRLQD
FLMHRPRNES CAGLCTFSEL LWSQYIQLHF TLPYNIAMGN MDGQNGHSSL TWEIPMEVNK
SHQAMCQVLS SREVSPHHVV APHDQTMVYE VIPVLSSESK ISSEREDKFS ENEKAEGGRV
FTSSCSSSIR DFYNLEHEPM SFVFHRDSAC MENGGSRLYP GIPLSIPTGL LFAGSSLKSL
SLNGFGLERL PDHMDMYFPK LTLSDLSHNI CLSKSLEIKE NSLVILPESI CNLTQLRVLR
IDRNMLGSLP SIFQMVSLVE LTAARNKLTT LPASLAECLN LQVLDILNQI ETGEDQESRS
CLW