Gene PHATRDRAFT_43038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43038 
Symbol 
ID7196237 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1871969 
End bp1875641 
Gene Length3673 bp 
Protein Length1112 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177391 
Protein GI219111279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00196864 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCGT CTCGTTGGAA GCGCTTTGCC TTTTTCGAGC GAAGCACGCT CGACGTCCCA 
CCGGAAGTCA TCGACGACCT TATTCCCGTT GATGGAGTCA GCCGAGACAA CCGTCGCTCG
GTCAAATCCT TAAAATTGGC TGGTAAAGAA GTGAGCAACG ATTCCGTTAT CTTGATAGTA
ACAACTGCCG CGTTGCCCTT GTATTCCAAG CCAACGGAAG TCCGTACAAC TCATATTTCC
GCTGGTCAGA GAAAAAAACA AGAAACGAAA GATGACTCTA TAAGCGCCAT GTGGTCGAGT
CTGACAGCTT GCACAGTACC TGCTTTTGCT GAAGAGGGCA ACAATTCATC TGGGGCTGCG
GGGGAGGCGC TCGCTCATGG TATCTCCATG CCGAGCCAAG CGCAGCTTCC TAGACAGGGC
ATGAATGGGA GAATTTCGAA AGCATTAAAG GGCGCTTCCA TGGATGGACT CGTACTCTCG
TTTGTGACTT CTCGAAGTAC AGAGTTTGTT CACTGTGTCG ATGTTACAGT CCGTTGCAAT
CCCCCTCGGG GCAAAGAAAG TCTCGAAGAT CTTGACGGTT GGCGTGGTTA TTTTAGCCCA
TTTTTAAGAA CCCCGGGCGA TCGATCTGGG ACGACGTCTG AAACTCCTAT GGTAGAAAAT
ACGAACCACT CTGACGTTTC GGTCTTAGGC ATAGCGGCCT GTCGGATGTC AACAGGCGGT
CAAATCGTCG ACGAGCACGA CGTTGTTCAC TTGGTTTGTA TTTCTGCTCA GCAGATTTGC
GTGTGGGAAG ATCCCCACAT TCATCTTTCT TGCCGGCGTC CATTGACGCC ACCTCCTGTC
CCCAGTGAAG CGAGAACAGT TTATCTTCAG TCATCTTGGC GACCTACTGA TGGGAATTGC
CGTGTGGTAG ACATTATTCC AGGCATCGTC GCCGTGGGTA CCGATACAGG AGCCGTTCTT
ATATTAACGT ATTCGCCGGA CCTCTCAGTG AGCACTTCGC GACCTCAGGG CCTTCGGACG
TATTTGCGTA TTCCCCCACC ACCGACTCAA AATTTGGAAG TAGTGAGTGT CAAAATATCA
TTGGTAAACG ATAAGGCTAG TGTTTTTGTG GCCTACAACT TCACGGGATC AGTTTCGGTA
TCTCAGATGT CTACAGCAGG AGTTTGTTGC TATGATTTTC CGGTACCAAC ACCCAGCTCA
CCCTCTTTGT CGGCCCCCTC CGCCCGCCAC GATTTGGACG GACGATATGT GGGTTCCTCG
ACATTGGTGG ATGCTCTTAC GACTCAGCGG GGACTAGAAC TGAGTGTGGT ACGTACGTGC
AACTGAAATA TTAGAATCTG TTACGTCGTG TTGCGTTCTT CTCACACAAT TGTCGTTGGG
TCTATTGTTC CAGGCGCGTC CTGACGGACT GTACTCCTAT TCCCAGACGG AACGTATTGG
AGTCGCTCCA GTTGACGGTA CAAAGCTTGC TCTCTGCTTG ATTCCACCGA CGATCCCAAT
GGAAAGGACC CGAGAAGTAG ATCTTGAAGG TATACCTCAT GGTTATGCTC TAGTAGCGTC
AACAGACGCA AAATCTGGTC GTGATGCTGT GGACATATAC GATTCCACCA ACAAGCTTGT
TGCGTTCCAT TTACTCCTTT CACCGGGACA CCGCGCCGTG CGTACGGTAG GAATTACCAC
TCCGCCAGTC CAATGCACTG ACGGAGGCAT CAAAGGTGGT CGATCGTCCG CGTTGGTCCT
TACTTCGGGA GGCTCATTTG TGACTTTAAC CGAGAAACTG ACAGATGAAA AGATCTCATT
GTTAGTGCAA AAAAATCTTT TTTCGGCAGC CATTGTGGTT GCGTACGCTG ATCCATCTTA
TCAACCAGAA GAAATTTCTC AGCTTTATCG ACAGTATGCG GAACATTTGT ATCACAAAGG
CGACTTTAGT GCTGCAATTG ATCAGTACAT ACATACAATC GGATCGCTAG AGTCGTCGCA
TGTCATTTTT CGGTACCTCG ATGCACCAAA GATTCCTCTT CTTGTCAAGT ATTTGGAGAG
TTTGCGATCA CGAGACCTTG CTACAGCGGT GCATAATGAG CTTTTACGAA CGTGCCATCT
CAAGATGAAC GACCGCGAGG CAGCAGAAGC AATTATCACC ACATCGAGTT CGATCGACAA
GGCGTCATTG TCTTCTATAC TAACAAATAT TTCCAGCAGT CCTAAGGAGG CTCTGGCAAC
CATATGTTCT CTGGATGCAA GGCAGACAGC CGAAATCGTT ATTCAACACG GAGCATCATT
GACCCGAGTA CTTCCACGAG AGATGGCTGG AATTGTTATT TCACTATGCG TTGGAACCTA
TTCACCGAAG GGTCTTGCAG ATGCGGCATC CGCTGCATCG ACTGACTTAA ATCGAATGAT
TGCGTATGCC ACGGACGACA AAAAAAAAGC CTGTGAACCC TTCCCTGCGA ACTTATTTGC
GTCAGCCTTC GTTGAGCACC CAAAAATGTT GCGACTCGTC CTTGCTCATT GCAATCGCAA
CAAATGCGTT CTGACACCAT CTTTACGTCG TACCCTGCTT GAATTGACGC TGGCGGAATG
GAACCAAGCG AAGCGCTCGG GTGATACCGA GGCTGAAAAG CTGCGCCACA AGGAAGCAAT
AGCGGTACGT TGTCTATCTC TCCCTTCGTT TTAGGTGCAC ATGGTCTCAC ACTGATCGTA
TCGATTGCAC AGGCACTGAC AGACTCTCAT TCTCGTGAGA TCGGAGACTA TGATGCACTC
GTTATTGTCC AATCAGCTAG CTTTGACGAA GGTGAACTGC TCTTATATGA ACGATTACAG
ATGGCACCAT TGCTTCTAGA TCGATACGCT AAAGATGGAG GTGAAAAGTC GCGGCGCCAA
ATGCTCGCCA TGTGCCAAAG CGATCCAGAG ATTTTGGCAG ACGTGTTGGG TCGCTTCGTC
GATATGGCAG GGAGAAGGCT GTCGCAGTCC AGCGTAAAGA TTAGCAGTGA CTACGATTCG
GATTTTGATG AATCGGAAGA AATCTTGAAT GACATTCAAG AGGCCCTTGC ATTAGCTCGT
CGGCAACGAG TGATTCCGCC AGTTCGAATT ACCCGAATCC TAGCTGGTGA GGGAATAGGA
CAATTTACAG ATAGGAATGA TAGCAGCTCG GAGAGCAATC TGAATAAGAG AACAGTTCCC
CTCTCGGTCG CTTTAGAATA CGTTGGGACG ATCCTTGAAG AATCAAGAAG AGAAATCTCT
CGCTTGAAGG CCGAGGTCGA GGAGTACAAC CAGATGTGCA ATTCCATGGC AACTGAAATC
GAATCTTTGC TGCGAGCGTC TCATTCTCTT CCTTCGTTGG CTCCCACCAG CAGTGCGGCT
CCAAGGCGTC TGAATATCGA CGGCCTGTAC GCGAAGATTC GGTCTGGGGA GAATGAGAAC
TATTCTCCAG GTCAACCCCG CGAAGCCTTT TGGAGGGATA TGGAGCAGAG TGAAGACAAA
TTTGATACTA TTGCGCGTTT TTTTGCCAAA GGAGTCATCA GCTAGTGGGT TAAGGTTCCA
GCATGTCGTT TCCTTACAAA CATACGCTAT ATTAAGAACG GGTGGTGCAA CGCAGTAATC
TTCTGGGTCG TGTTTTAAGG TAGCGATTTT AATTGCATCG ATCCAGTATA TAGCCAAAAA
AAACGAAGCC ATT
 
Protein sequence
MASSRWKRFA FFERSTLDVP PEVIDDLIPV DGVSRDNRRS VKSLKLAGKE VSNDSVILIV 
TTAALPLYSK PTEVRTTHIS AGQRKKQETK DDSISAMWSS LTACTVPAFA EEGNNSSGAA
GEALAHGISM PSQAQLPRQG MNGRISKALK GASMDGLVLS FVTSRSTEFV HCVDVTVRCN
PPRGKESLED LDGWRGYFSP FLRTPGDRSG TTSETPMVEN TNHSDVSVLG IAACRMSTGG
QIVDEHDVVH LVCISAQQIC VWEDPHIHLS CRRPLTPPPV PSEARTVYLQ SSWRPTDGNC
RVVDIIPGIV AVGTDTGAVL ILTYSPDLSV STSRPQGLRT YLRIPPPPTQ NLEVVSVKIS
LVNDKASVFV AYNFTGSVSV SQMSTAGVCC YDFPVPTPSS PSLSAPSARH DLDGRYVGSS
TLVDALTTQR GLELSVTERI GVAPVDGTKL ALCLIPPTIP MERTREVDLE GIPHGYALVA
STDAKSGRDA VDIYDSTNKL VAFHLLLSPG HRAVRTVGIT TPPVQCTDGG IKGGRSSALV
LTSGGSFVTL TEKLTDEKIS LLVQKNLFSA AIVVAYADPS YQPEEISQLY RQYAEHLYHK
GDFSAAIDQY IHTIGSLESS HVIFRYLDAP KIPLLVKYLE SLRSRDLATA VHNELLRTCH
LKMNDREAAE AIITTSSSID KASLSSILTN ISSSPKEALA TICSLDARQT AEIVIQHGAS
LTRVLPREMA GIVISLCVGT YSPKGLADAA SAASTDLNRM IAYATDDKKK ACEPFPANLF
ASAFVEHPKM LRLVLAHCNR NKCVLTPSLR RTLLELTLAE WNQAKRSGDT EAEKLRHKEA
IAALTDSHSR EIGDYDALVI VQSASFDEGE LLLYERLQMA PLLLDRYAKD GGEKSRRQML
AMCQSDPEIL ADVLGRFVDM AGRRLSQSSV KISSDYDSDF DESEEILNDI QEALALARRQ
RVIPPVRITR ILAGEGIGQF TDRNDSSSES NLNKRTVPLS VALEYVGTIL EESRREISRL
KAEVEEYNQM CNSMATEIES LLRASHSLPS LAPTSSAAPR RLNIDGLYAK IRSGENENYS
PGQPREAFWR DMEQSEDKFD TIARFFAKGV IS