Gene PHATR_44005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44005 
Symbol 
ID7203978 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp669627 
End bp673216 
Gene Length3590 bp 
Protein Length1135 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186393 
Protein GI219113619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.861173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATCAC CACGCCGTCT CATCTTTGAG CCACTTCCTC CGCCCCCAAC CAGTCCAGAC 
ATGGACCACG ACGCATCTCC TTTTCTTTCA CAAAGGTACA CGACTAGCAG GCGCCGAACG
CACACTTTCC GACGAGTCCA ACGTGTTTCT CGCCACGCAG CTCTAACCAC GAAACAACAT
CTATGGGATT CCAATGATGC AAGCATTGAC GCTGCCGTCG CAGCTGTCGA TCAGTGGGAA
TCTGCCTACA ACGCGTTAAG AGGTCTGGTC GTTGCCGGTG TTCAGTCTGC GAAAGGTGTC
TATGGCGGTC TCAAGGAAGG GGCGGGAAAA ATTGAAAACG GTGTGTTGCT ACCTGTTAGA
GATTGGATTA TCCTACCAGC CTTCTTTGGT GTTGAACACG CAACGGCGGA AACAGCGAAG
TTTCTTCAAA GTGAAGCGGC GCATCAGCTC GCGGGTCAGT CACTTGAGCT CGTTAAAAAG
GTTCCGATCG TAGGGGACAA CGTGCTCGCA CCTGCTATGT ATTTTTCGGT TGGACTCGTT
CAGCGAACCT GGGAAATTGT ACAGTACCCA ATTCCATCAA AACAACAAGT TCGAGGCTCG
GTCGAGCTTG CTTTAAATGG TACAAAATGG GCGCTTTCGA CTGTTGGACG TGAAGTTTAT
TTGTATTTTA AACGTGCGGA TGCCAATATC ACTCGCACAT TAATGCATAC GCAATGGAAA
GTGCTAGGAA GTGGCCCCTA TGCAACGCTT GATAAGCTGA ACAAGAGTGA AGTCATCAAT
CATTTATGTG AACGATACTT CAGTCTTGAG GATGCAATAT CACGATATGA GCTGGCGGCT
CACATTAAAA GCCATAATGT ACCATTGTAC CATGATCTTG TCGTGTCAGG TTTGTTGAAG
GATAGAGGAG GGGGCATTAC AGAAGACGAT GAATGGCTCA GTTCATGGCC TGTATATCGA
CATCTTGAAG ATCCATTTCT GATCCCTGAG GAAGAGAAAC TTTCCGGCAT CGAAATATCG
TCTCTTTGGT TCCGTTTGCC CTATCTGAAC GGAAAGCGAC CTTCTGGTAG GGATCAGCGC
TGGGTTTGTT TCGGTCGCAT TGAGCAGAAT ATCCTAGAGG ATCGGTACCG CCATGTGATT
AGAGAGGGTG TTACGGTTTT AGGCATTGGT GAAGAAAGGA AAGCTGGTGC GATGGAACCA
AGCGGTTTGG TTAATGAGGC GGACAATCAG ATTTCTGCCA ATCATACTAC AAATAATTTG
TCAAATGTCA CGTCTGCGGA GACCAATTGC GTAGAGTATC CCACCATCGC AAAATGGTAC
GTGCCTAATG CTCAAACCGA TGTCTTTTTA GATCAGAGGC GTCATACTGT GTCGCTGTTC
CTTTGCTGTC CAAAATGCCG AAACGAAATC GCAAGGTCCA TCCCGCCACT GGTGGCGAAA
GAATACGGGG AGCTCTGCGA CTCTTGCAAT GAACAGGATG CCCAAATGCC ATCCGTTTCT
ACTCTACTAG CTCCACCGCC AATTGGCGCA GTCATGCGTC CAACTTTTTG GAGGTTTCAT
GGTCCGGGGG ACGAAGTTCG AAGAGCAGCA TGGATACTTG ATACTCCGAG GCACGGTCTG
CAGCCTTTTG ATGAGGAGGC GCAATCTGTC TTGGAAGATG CCTATCTTTT CTTAAAATGG
ATGTCCGTTC GCCAAGAGTT TCGTCATGAG TCTGATTTAG ACAGCGCTCT TCTAACAGGT
ACATGGAGCT CTCGATACTT CGAAGGGGTA GTCCTTTGCG GCTCAAAACG AACTCATTTT
GTTTGCTTTG TTTTCAAATA GTGGAGGTAC CTTGCCCTGA TGGGACGGAT AGACTTATCC
AATTTAGTTC CTTGACTCAA GCTACAGCAA TTCAAAAAGG TATGCTGCCA GCCGTTGCCA
TCTTCAAGAG AAGGGTATAT CGGGGAGCCT GGTTGCAAAA TTCTGCTGCA GCTCTATTAG
AAACAAAGAC CCAAAAGCAG GAACTAGTGT CAGTTCAGGA ATCTATTTTG CAGGCAGTTC
AAGATAATGG TGCTCTCGGC GAGACAATAG TACCCGATGC AGCGCTGCGG ACCGTTCTCT
CTCCTGGCAA ACGTCAAGAT GAAAGCCGTC TCGTGCTCCA CACGAGCGGC GACGATCTTG
CCGTTCCTCC CAGTAGACTT TGGGAGGAAG GATCATCTCC CGTAATGGAG AGCCATCCTC
AGGCCGATGT CGATCACCTC ATCCTCGTCG TTCATGGAAT TGGGGAAATG CTACGTTCAA
TAGATGTTTT TGGTCTTGCA ATGCCCAATC TCTCTTCGAT TGTGGATTGT TGTGGTTTTC
TGCGGAAGAA TCACTCCGAA GTCCAAGATG CCCACTTTGC TCAGATGTAC CCAACGGCTG
ACGCCACTTC AAGAGCCTCG ACTGGTAGAG TGGAGTATCT TCCTATTGAA TGGCATGAAT
CTTTTTCTCT TTTATCTCAA CGACGGTCGA CTTCGGAAGC TACACCCAAA CACAATGTTA
TGATCAAAGA CATCTCATTG CGAACGATTC CGAACATGCG AGAGTTCGCG AATGACACTC
TGATGGATGT TTTATATTTC ATGAGTCCAG AACACCACGA TATGATCATG TCCATCGTAA
CAAATGAAAT GAATGTTGTT GTTGAAAAAT TTGCTGCCTT AGCTGGTTTC TCTGGACGAG
TATCTTTAAT TGGGCACTCC TTAGGATCCA TTATTTCATG GGACATCCTT GCTAATCAGT
CGCTGGACAT ATTGGGGGAA AGTGCCAAGC AATCCTTACA TGGTGTGCCT TCAATTGAAA
CCTTTGGCGG CACGGGGTTT TCTAACTACG GTAGTGCTAC TTCAGTTGGA CATGACGCAC
CAGAGGTGAC GCAGCAGGCG ACTCGATTTG AAGGATTGAA GCCCTATCCA AAGCTCAGAT
TTGCGGTTGA CAATTTTTTC TTACTTGGTT CTCCGGTTGC CGTCTTTTTG ATGATACGAA
ATCAACGAAA GCCTTTGTGT GAGAATTATT TTCTTTCTGG GTGCAATCGG GTATTTAACA
TCTTCCATCC ATATGATCCT GTCGCTTATC GAGTGGAGCC CTGTATTGAT CCCAGAAACG
CGGACTTCGA GCCTACCATT ATGAAACATT GGAATGGTGG CTTTCGCGTT CAGTACCAGA
CCAAGCGGCT TTGGAAAAAG TTTGTTGACT CAACTTGGAA GACACAGCAG AGTGTTGTTG
AGGCATTTGA AGCAAGCATG GCTGGAATGG GTCTGCTTGA TGCGACAACA GACACATTCA
ACGACGACGA TACTTCCGCC TCAGAAATAA GTTCAGACGA TAATCGAAGT ACCGCCAATG
TCATCGCTGG AAAGTTAAAT CAAGGAAGGC GTATTGATTA TATGCTCCAA GAGAAAGAGA
TCGAGACAGC CAATGAGTAC GTTGCCGCAC TGGCGGCTCA CAGCTCTTAC TGGATTGAAA
AGGATCTTTC TTTGTTCGTT GCACGCCAGA TTTACCTCAG CACTCTTGAA CAATCAGCAG
AGGCTGCTGA AGCCAGTTTG TGGGAGTCTA TTGGGAGTAA CTCTGTGTAG
 
Protein sequence
MGSPRRLIFE PLPPPPTSPD MDHDASPFLS QRYTTSRRRT HTFRRVQRVS RHAALTTKQH 
LWDSNDASID AAVAAVDQWE SAYNALRGLV VAGVQSAKGV YGGLKEGAGK IENGVLLPVR
DWIILPAFFG VEHATAETAK FLQSEAAHQL AGQSLELVKK VPIVGDNVLA PAMYFSVGLV
QRTWEIVQYP IPSKQQVRGS VELALNGTKW ALSTVGREVY LYFKRADANI TRTLMHTQWK
VLGSGPYATL DKLNKSEVIN HLCERYFSLE DAISRYELAA HIKSHNVPLY HDLVVSGLLK
DRGGGITEDD EWLSSWPVYR HLEDPFLIPE EEKLSGIEIS SLWFRLPYLN GKRPSGRDQR
WVCFGRIEQN ILEDRYRHVI REGVTVLGIG EERKAGAMEP SGLVNEADNQ ISANHTTNNL
SNVTSAETNC VEYPTIAKWS IPPLVAKEYG ELCDSCNEQD AQMPSVSTLL APPPIGAVMR
PTFWRFHGPG DEVRRAAWIL DTPRHGLQPF DEEAQSVLED AYLFLKWMSV RQEFRHESDL
DSALLTVEVP CPDGTDRLIQ FSSLTQATAI QKGMLPAVAI FKRRVYRGAW LQNSAAALLE
TKTQKQELVS VQESILQAVQ DNGALGETIV PDAALRTVLS PGKRQDESRL VLHTSGDDLA
VPPSRLWEEG SSPVMESHPQ ADVDHLILVV HGIGEMLRSI DVFGLAMPNL SSIVDCCGFL
RKNHSEVQDA HFAQMYPTAD ATSRASTGRV EYLPIEWHES FSLLSQRRST SEATPKHNVM
IKDISLRTIP NMREFANDTL MDVLYFMSPE HHDMIMSIVT NEMNVVVEKF AALAGFSGRV
SLIGHSLGSI ISWDILANQS LDILGESAKQ SLHGVPSIET FGGTGFSNYG SATSVGHDAP
EVTQQATRFE GLKPYPKLRF AVDNFFLLGS PVAVFLMIRN QRKPLCENYF LSGCNRVFNI
FHPYDPVAYR VEPCIDPRNA DFEPTIMKHW NGGFRVQYQT KRLWKKFVDS TWKTQQSVVE
AFEASMAGMG LLDATTDTFN DDDTSASEIS SDDNRSTANV IAGKLNQGRR IDYMLQEKEI
ETANEYVAAL AAHSSYWIEK DLSLFVARQI YLSTLEQSAE AAEASLWESI GSNSV