Gene PHATR_43826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43826 
Symbol 
ID7203959 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp149284 
End bp152986 
Gene Length3703 bp 
Protein Length1003 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186287 
Protein GI219113407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.911661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAA TGCGCCCCGA AGCTCGAGCA TCAGTTTCGG ATGGATTAGA ACGAGTAGAC 
GAGGACGATA TCTCCTTGTC CAGTACTTCC GAAAGTGGGT CGTCGGGTGC CCAAGTCATT
CTCACCTCCG ACGAACGAGA TAAAAAGATC CGAGATCAGA TTATCAAAAA AGAAGAGGCA
GATGTTAGGA AAGCGAAACT AATCGTCGGC TCTGCTCTGA TTCTGTCCAC TATCTTAGTT
AGTGTCTGCA TCTACATTTT CGCATCTAAA GCAGAGGTCC TCAATTTCGA GCTCGAGGTA
AGCACCATTG TTGAATCATA GAGGTAGCGG TTGGATAATG CTATCTCACA ACTCACTACA
CCACTCTCAC AGCACGAAGG ATATGCTAAG AACATTGTGA ACTTGGTAAA ATGGGAAAAC
CAGTATAATT TTGCTCTTAT GCAACAATTA AGCGCATCAG CTACGGCATC TGCCGCCATG
ACTGGCTCGG TGTTCCCCAA CGTGACCCAG AAGTACTTCG AGATCACAGG TGGCTATGTG
GATGGCTTGG GCGGTATAAT GGCAACTGCC TACGCTCCAA TTATTGCAGC CGAGGAAGTG
ACTCAGTGGG AAACATATTC ACAAGAGAAC CAAGGTTGGA TTGGAGACAG TACTGTACTT
CGGCAAGTCC ATCCGGGACA TAGACAACCC ATGGAAGGTA CCATTCAAGA CCACGAATTC
GATCGACGAC TTGATTCCGG GTCCATCAAG CCATATATTT GGCGTTGGGA AGATGGCGAG
CAAGTCCAAG AAACCACCTT TTCGGGCAAT GTACTGGCTC CTTTTTGGCA AAGTTCTCCG
GCCGATGCCG CTTCTGTAAA CCAAAACTCC TCGCAAACAA GGACATAGCT CACCTATTTT
CTCTGGTTGT AGAAATGAAC CATACAGTCA TCTCCCACGC CGTCCAAATT GACAGGCTCT
TTGATTTCGT CTTTGACCTC CATGAAAAAG AAAGAAAGAA AGAAGAACCC GCATTCATTT
ATCATGGAAC CTGTCTACGC AGAGTTCTCC GAGAATCCCG TTCTTGTCGG TATTTTGATT
GCCATTTCTG CATGGGAAAA TTTATTTGAT CGAGTGTTGC CAGAGGGAAC AAACGGGCTA
GTCTGTGTTG TAAAGGATAC CTGCGGCAAC GTTTTTACGT ACGAAATCAA CGGCGAAATT
GCAACTCATC TTGGATATCT TGACCTCCAT GACGAGAGAT TTGATCAATA CCAAAGAACA
ACACCTATCG AGTTGTATGA TTCCGAGGCA GCCAGCCTTT GCAAACATGA CCTTTACATT
TATCCATCGT CGACTTTCCG AAGTGCATAC AATACCAACA GACCAGCCAT CTACAAAAGT
GTAGTCGCGT TAGCGTTTGC TTTCACTGCA CTACTGCTTC TCATGTACGA CAAGCTAGTA
AGTCGACGTC AGGAGAAGAC AATGACGTCC GCCATTCGCA CCAACGCCTT GGTATCATCA
CTTTCCCCGA AAATATCCGC GATCGACTTA TCGGTTACAA CGGACTTGAC CACGGTTCAA
AAATGATTTC CGGGAATGAA AAAACAATGG AAAACTACGG GAATAAAGCT ATTGTAAATG
CACCATTCCA TTCAAGACCC ATTGCAGATT TTTTCCCGCA AACAACGGTA CGTATATTGA
AAGACAAAGA GCGACATTTG GCTTCCTCCT TCTGACTTTT ATTTTTAAAG ATCATGTTTG
CCGATATCAC GGGCTTCACA TCCTGGGCCA GCGCAAGAGG TAAGTGAGGG AGGGTTAGAT
CTGACGAATG ATGAAGAGAC CTCATCTTGG TAAATTATTC CACGCACAGA GCCATTCCAG
GTATTTGAGC TCCTTGAAAC AATATACGGC GCCTTTGACG AGATTGCAAA GAAACGTAGG
GTTTTCAAAG TTGAAACTGT TGGGGACTGC TATGTGGCTG TTGCCGGTAT CCCGATGCAA
CGCAAAGATC ATGCTGTTAC AATGGCCCGA TACGCTCGTG ATTGCCACCA CAAAATGAAC
GAGCTGACTC GTCGGCTGGA ACTCGTCTTC GGGCCTGATA CCGCTGATCT TGCCTTTAGA
ATTGGTCTTC ATAGCGGGCC TGTGACTGGC GGGGTACTAC GTGGGGAAAA CGCCAGGTTT
CAGCTCTTTG GAGACACCGT CAATACAGCT GCTCGGATGG AAAGCACAGG TGTTCGCAAC
CGTGTGCATA TCTCTGAGAC CACTGCCGAT CTACTGGTTC AAAGCGGAAA AGAACATTGG
CTCAAACAAC GAGATATGAA GATCATTGCA AAAGGGAAGG GAGAAATGTC TACGTTCTGG
CTCCAACTTG GGACCGAGCA CAGCGACGGA ACGTCAGTCT CTGGTACCAA TCACGTTGCT
GACAAGAATG AAACATTGGA GGAAGAAAAA CATAAGCTTC AATCACTAGC TTCCGATAAA
ACAAGACGAT TGATTGATTG GAATGTCGAA GTGCTCTTGC GTCTCCTCGG TCAAATCGTT
GCGTGCAGAA TTACACACCC GGTCAAGATT TCCGGAGTTT TCGTTCGGAA TAGCGCGTCT
CCGAAGGGGC AAACAGTTCT TGAGGAAGTT AAAGAAATCA TAACTTTGCC TAATTTCAAC
GCCAAAAGCG CAGAGCTCAG AAAAAAAGAT TCGGCAACAA CACAGCTCAA TGATGATGTG
GTCCAGCAAC TACGTGAATA CGTAGCGAAT GTTGCAGCTC TGTATCGCTG TAATCCTTTC
CATAATTTTG AACACGCTTC GCACGTGACC ATGTCAGTAG TCAAACTGCT CAGCCGAATA
GTAGTCCCAG CGGATGTTGA CTATGAAAAT CTTGATACGG ATAAAATTGC GTCAACCCTG
CACGATCACA CCTACGGAAT CACTTCAGAT CCTTTGACTC AATTTGCCTG CGTCTTTTCG
GCTTTAATTC ACGATGTCGA CCCATAGTGG CGTTCCGAAC TCGCAACTAA TTAAGGAAGA
CACGAAACTT GCTGCATTTT ACAAGGGCAA GAGTATCGCC GAACAGAATG CGGTTGATTT
GGCATGGGAT CTGCTCAACG AAGACTCATA CAGCAGTCTG CGGGCGGCGA TATATCGCGA
CGACATCGAG CGAAAACGAT TCCGACAGCT GGTGGTCAAT TTGGTCATGG CCACAGACAT
AATGGATGCG GATCTCAAAA TCCTGCGCAA TGCTCGATGG AACAAGGCAT TTTCCGAAGC
AAGTTTGCAA GAATCCATGG TCCAATCAAC AAATCGTAAG GCAACAATTG TGATTGAGCA
CTTGATTCAG GCATCAGACG TTGCTCACAC GATGCAGCAC TGGCATATCT ATCGCAAATG
GAATGAGCGA TTGTTCGAAG AAATGTACAA CGCGTTTATT GATGGTCGGG CAGAGAAGAA
TCCGGCGGAG TTCTGGTACC AAGGGGAGCT GGGATTTTTC GACTTTTACA TTGTTCCGCT
TGCAAAAAAA CTGGAGGAAT GTGGAGTCTT TGGGGTGTCG AGCGAAGAGT ACTTGAATTA
TGCGCTACGC AACCGTCAAA AATGGTCAGA CAAAGGGCAA CAAATGTAGG GGATATGATG
CAGAAATTGT CTCAAGGAGC GAGCCAAGTC AAAAGAAATG ATTGCAAGCA AGACATGCTT
TTCCTTTAAC CGTGGCGATG TGTATGCACA TGGCCGATTT CGT
 
Protein sequence
MTTMRPEARA SVSDGLERVD EDDISLSSTS ESGSSGAQVI LTSDERDKKI RDQIIKKEEA 
DVRKAKLIVG SALILSTILV SVCIYIFASK AEVLNFELEH EGYAKNIVNL VKWENQYNFA
LMQQLSASAT ASAAMTGSVF PNVTQKYFEI TGGYVDGLGG IMATAYAPII AAEEVTQWET
YSQENQGWIG DSTVLRQVHP GHRQPMEGTI QDHEFDRRLD SGSIKPYIWR WEDGEQVQET
TFSGNVLAPF WQSSPADAAS SSPTPSKLTG SLISSLTSMK KKERKKNPHS FIMEPVYAEF
SENPVLVGIL IAISAWENLF DRVLPEGTNG LVCVVKDTCG NVFTYEINGE IATHLGYLDL
HDERFDQYQR TTPIELYDSE AASLCKHDLY IYPSSTFRSA YNTNRPAIYK SVVALAFAFT
ALLLLMYDKL IMFADITGFT SWASAREPFQ VFELLETIYG AFDEIAKKRR VFKVETVGDC
YVAVAGIPMQ RKDHAVTMAR YARDCHHKMN ELTRRLELVF GPDTADLAFR IGLHSGPVTG
GVLRGENARF QLFGDTVNTA ARMESTGVRN RVHISETTAD LLVQSGKEHW LKQRDMKIIA
KGKGEMSTFW LQLGTEHSDG TSVSGTNHVA DKNETLEEEK HKLQSLASDK TRRLIDWNVE
VLLRLLGQIV ACRITHPVKI SGVFVRNSAS PKGQTVLEEV KEIITLPNFN AKSAELRKKD
SATTQLNDDV VQQLREYVAN VAALYRCNPF HNFEHASHVT MSVVKLLSRI VVPADVDYEN
LDTDKIASTL HDHTYGITSD PLTQFACVFS ALIHDVDPYI AEQNAVDLAW DLLNEDSYSS
LRAAIYRDDI ERKRFRQLVV NLVMATDIMD ADLKILRNAR WNKAFSEASL QESMVQSTNR
KATIVIEHLI QASDVAHTMQ HWHIYRKWNE RLFEEMYNAF IDGRAEKNPA EFWYQGELGF
FDFYIVPLAK KLEECGVFGV SSEEYLNYAL RNRQKWSDKG QQM