Gene PHATR_44174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44174 
Symbol 
ID7204094 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1204419 
End bp1209278 
Gene Length4860 bp 
Protein Length1370 aa 
Translation table 
GC content51% 
IMG OID 
Product5'-Nucleotidase or metallophosphoesterase 
Protein accessionXP_002186201 
Protein GI219113235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTTCCTCT GCTATTGCAG TTTCCAGCAC ACAGAGCCGA CACCACCGCC ATCCATCCAG 
TATGAAAATA CCATCGCGGA ATCGATGGCG TTCTCGACCG GGATGGGCTG TGGCCATTCT
GGTGTCTTGC GGATCCAATC GTGTCGACGG ATTGGCCTTT CAGCAATCAG TGCGTGGGGG
AGGTCGAGCC GCGAGTACAC CGGATAAAGT AGACAGGGTT GAAACGTGGG AAGCCTCGCC
GTGCCAAGAT TCGGAATGTC GATTGATCCT CTGTCACATA ACGGACGTCT ACACGCTCGA
GCATCTCGCC CATTTCAAAA CGCTCGTGGA AGAGACCAAG AAAAACTCCG AAGGATCCGC
CGTGGTTTCT GTACTGACGG GGGATTTTCT GTGTAAGTGC ACGAACGAAA GCACGCATTC
CAACACCGAC AATTAATCAA TTGCTCCATC TAACACTCCT CGGCTCCTTT ATACAAGATC
AGCACCCTAC TTACTTTCCA GTGTGGATCG AGGCGAAGGG ATGATGCACG CGCTTGGGCG
GATTCCCCTC GATTATTTGA CATGGGGAAA CCACGAAGCC GACATCAACC ACCGCACCGT
TTGTCAACAC GTCCGCAACT TTGCTGGTAC GTGGCTCAAT TCCAATATGA TCGATCACGA
AGCCATGGAT GCACAGAAAG AATACGATGT CATTGAACTG ACCTCACCTG ACGGATCCAA
CCACCGCAAA ATTGGACTCG CCGCGGTCCT TTCCAGCGAC CCGGCCTTGT ACGCGCAATT
TAAAGCACCC GGGCCTTTTG GTGGCGCAAC CGTCACTGAT CCGTGGGAAG CCCTGGCAAA
GTACAAACGC TTGTTGGAAA AAGATCACGG TGTGGATTTG GTGGTACCTT TGCAGCATTT
GTACGTCCCG GACGATCACA AGACTTGCCA CAAATTTGAT TTTCCGGTGG TGCTGTCTGG
ACACGATCAT CACCGTGTTG ATGAAGTGGT GGATGGCACT CGGCTCATCA AACCGGGCAT
GAATGCTGCG TACGCAGCTG TTGTGGAAAT TTCGTGGAAG ACGTCGTCGA GCGAAAAACC
CGTTATTCGA TCACGATTCC TGCGCTGTCA GGATTGGGAC TGCCGATCCA GTTTTGGCCG
AGGAGAACGA GCGTGCTTAT GATGCCCTGA TTCCTTTACG TAATACCGAA TTGGCTCGTG
TTCCATCAAA CTTTGAACCT TTGACTTCCA ACAATAGTCG CGGCAGTGTC TGTACCATGG
GCAGTTTCAT TTGTTCGCTG CTGCGATCGT CGCTCAATGT TTCGCGGCGG CAGCGGGATA
ACAAAGTGGA CGCCGTCCTA CTCATGGGTG GCAATGTAAG AGGAAATGCC GACTACCCCG
AAGGGTCATT CTTCTCGCTG GAAGCTCTAG AGGCCGAAAT AAAATCGGAC GAAGTAATAG
CAGTAGTGAA CATGCCTGGT TGGCTGCTGG CCGCAGGCAT CGAAGCGACC CATGCCGGTG
ACCCAATTCC AGGATGGATG CAGTATGATG TGGGCATCCG ACAAGATGAG AATAACGTGG
TTACGCAAGT AGCTGGGCTT CCCATTGATC GCGAGCGTGT CTACCGAGTG GCGACCAAAA
TTGGCGACTT GACCAACGGA CAAAGTATTC CGCTCACGCA ATACTACACA GAGAATCGAC
ATCTTCTTCC ACCAAAGGGA GCGTACGTTA ATATACAGTC CGAGCTGATG GCCTACTTTG
CAAGAAATCT TTGGCGAAAG CTGTGGGATG CCGTCTCCTT GGAGCTTGCG GAGACTTGCG
ATACCGACAA CGACTGCAAC CCCGTAGGCC GGCTGGAAGT CCTTGACAAA ACCGGGGATG
GTGAAGTCTC GGTGGCCGAA ATTCAAACCG CTTTACGAGA TTTGCTGGGG TATTCTGTGG
ACGACCGAGA AACATCGTTG GCCAAGTTTG TTCATTCGTT TGCTGATACG ACCAGTACTG
GACGGGTAAC ACTACGGGAT TTTGAAATTT TCTGCGACGA AATGGAGCAA ACTTACGAGC
GAGACAGCTG GCGATTGTCG TATCCGAAAC CGTCCAAACC GACTGCTGGT ACTGCTAAAT
CAACATAGTT CAAGCTTCGA GATTGTCTGC GATGCTCGCG CACCCAGAGG TTCCCAGAAA
TTCTGCAAAT TCGGATTCTT TGTTGTCGTG ACCAACTTTC CAAGTGGTCC GATTCTTCCT
TTGCCTTCCA CGATATCCAT TTTTACGGTC TTTATGCGCA GTGTAGGAAA TGGATGGAGG
AAGACGACCA ATGGTTAGGA ACGGTGATGC TGGTTCGTGT GTTAACAGTT AGCAAAGCTG
CGAATTGTGA GGAATTGACA CGTGTGCGCA TTTCGTAACG ACAGAACATG TGACATGGAT
TCCGTGAAAT GATTGGCTCA ACCACAATGA CAGAAAATGA TTATTAGAGA ACATATACCC
GTCAAGATTG TAGCTTCAAG TCCGGTGCGT AGATAAAATT AGAACTGTAC TCAAATTCCG
GTAGGTATCG TAGTTTGAGC GCGACTGACT CTGAGAAAAA GCGTTCGGAG AAGGATCGCG
ATCGGAACGA TCGATCCATC GAGATCGATC GAAAACTGCT AGAATGCCCT GCCCTCCCCT
GCTGTGCAAA GACTATTTTG CGTGCAGAAC GATTCTGCAC AAATTCTGAA AAAAGACGAT
TTCTGTACGA AAGGCACGAT GAAGACGCTT CAAGGCTTGC TCCCGCCTAC GTTCTATCTG
ATGGGCTTTG TGCCTTTCTG GAGTCCAGTA CGTACGGCTG AATCCTTTTC GGTCGCTCGC
CAGCCCCACG CTGGCGGGGG CCGAGCACCG GGACAACCCA ACAAGATCGA CACTACGGAA
AGTTGGGAAG TCTCGCCCCG CAGCGATTCG GAATGCCGGC TCATCATTTG TCAGATCACC
GACGTGTATA CGCTCGAAAA CTTGGCGCAC TTCAAAACGT TGATCGCTGA GACCAAGAAG
CGGGCACCGG GGTCCACTGT AGTATCCATG TTGACGGGTG ACTTTTTGTG TAAGTCAAAA
TAAGCGACAA TGCTCGTTCC ACCAATCTGG ATACGTAGCA TCCTTCACTC ACGTTTCCTT
TTTTGCTTGG CAGCTCCGTA CCTTTTGTCC AGCGTCGATA AGGGCGCCGG AATGATGCAT
GCTCTGAACA GCATTCCGCT CGACTATTTG TGTTGGGGCA ATCACGAAGC CGACATCGAG
CACCACATCA CCTGTCGTCA CGTCAGGAAT TTTCACGCCA AGGGGGGCAA GTTCATCAAT
TCCAACATGC TCGATCACGA CGCAATGGAC GCTCAACAGG AATATGATGT CATTGAATTG
AAATCGGAGG ATGGTACCAA CACGCGACGC GTCGGCCTTA CGGCAGTCCT CTCCGACGAT
CCAGCTCTGT ACTCTCACTT CAAGGGGAAA GGTGCTTTTG GTGGAGCCAC TTTGACGGAT
CCCTGGGAAG CTTTGACCAA GTACAAGAAG ATTCTTGAAG ACGACGAAAA GTGTGATGTC
GTTATTCCGC TTCAGCATTT GTACGTTCCG GACGATCACA AAACGTGCGA ACAATTCGAT
TTCCCCCTAA TTCTGTCGGG TCACGATCAT CACATCGTGG ACGAAGTAGT GGAAGGAACT
CGTCTCATCA AGCCCGGTAT GAACGCCGAC TTCGCCGCCG TGGTCGAAAT ATCTTGGAAT
GACGCGACGG AAGAGAAGCC CAAGATACGC TCACAGTTCG TTCGTTGCAA GGATTGGGCT
CCGGATCCTG TCATGGCGGA AGAAAATGAA CGAGCCTACG ACTCACTCCT TCCACTACGC
AATACGGAAC TTGCGCGCAT TCCTTCGTAC TACGAACCGC TCACTTCCAA TAACTCTCGT
GGCAGTGTCT GTAGCATGGG AAAGTATATT TGTTCGGTTC TGAAATCGGC ATTTAATATG
AACCGTGGCA AAAAGAACCG TGTTGATGCT GTGCTGATCA TGGGAGGTAA CGTTAGGGGC
AATGCGGACT ACCCCATCGG CTCCTACTTT TCCTTGGAGG CACTCGAGGC CGAAATAAAG
TCGGACGAAA CAGTAGGCGT CATCTCGATG CCCGGTTGGC TTTTGGCGGA AGGGATCGAG
GCAACGCACT GTGGCGACCC TATTCCTGGA TGGTTTCAGT ATGATGTAGG TATCCAGCAG
GACGAAAACA ATGTGGTCAC ACACGTCGCC GGGCTCCCTA TCGACCGTGA TCGCATGTAC
CGGGTTGCGA CGAAGATAGG TGACTTGACG AATGGACAGA GCCCACCATT TACGGAATAT
TACCAGACCA ACCCCAAAAG TCTTCCACCC AAAGGTAACT ACGTCAATAT TCAGTCCGAG
ATGATGAGTT ATTTTGCCCG GAACTTATGG AAACGCCTCT GGGACGCAAT TTCTCACGAA
GTTGAAGATA CTTGCGATAT TGACGGCAAT TGTAGTCCCG AAGACCGCCT AGATGTTTTA
GATTCGAACG GGGACGGGAC TGTCACCGTC GAAGAAATTC ACAATGCCCT CCGCGATCTT
TTAGACTATT CGGTAGACGA TCGCGAAACC ACCCTGGCCG AATTTGTACA TGCCTTTGCC
GATACGGATG GTAGCGGCAA GGTGACCGTC AAAGACTTTG AAGTTTTCTG TGACGACATG
GCGGAGCAAG CTGTGATCAA TCGAGCTTTG GCAAGGGAAG CGATGGAACG GCAGCGCGAG
ATAGCAGCAG CCGCCTCCAC ATAGTCCACG GAGTGCCCTG CCGCTAGTTT AGGGACAACC
TGATCGATTC ACAACCTTTG ATTTAAAGTA GCATTAGTAG TATCAGAAAA TCCCATTAGT
 
Protein sequence
MKIPSRNRWR SRPGWAVAIL VSCGSNRVDG LAFQQSVRGG GRAASTPDKV DRVETWEASP 
CQDSECRLIL CHITDVYTLE HLAHFKTLVE ETKKNSEGSA VVSVLTGDFL SPYLLSSVDR
GEGMMHALGR IPLDYLTWGN HEADINHRTV CQHVRNFAGT WLNSNMIDHE AMDAQKEYDV
IELTSPDGSN HRKIGLAAVL SSDPALYAQF KAPGPFGGAT VTDPWEALAK YKRLLEKDHG
VDLVVPLQHL YVPDDHKTCH KFDFPVVLSG HDHHRVDEVV DGTRLIKPGM NAAYAAVVEI
SWKTSSSEKP VIRSRFLRFL AEENERAYDA LIPLRNTELA RVPSNFEPLT SNNSRGSVCT
MGSFICSLLR SSLNVSRRQR DNKVDAVLLM GGNVRGNADY PEGSFFSLEA LEAEIKSDEV
IAVVNMPGWL LAAGIEATHA GDPIPGWMQY DVGIRQDENN VVTQVAGLPI DRERVYRVAT
KIGDLTNGQS IPLTQYYTEN RHLLPPKGAY VNIQSELMAY FARNLWRKLW DAVSLELAET
CDTDNDCNPV GRLEVLDKTG DGEVSVAEIQ TALRDLLGYS VDDRETSLAK FVHSFADTTS
TGRVTLRDFE IFCDEMEQTY ERDSWRLSYP KPSSFEIVCD ARAPRGSQKF CKFGFFVVVT
NFPSGPILPL PSTISIFTVF MRSKMIIREH IPVKIVASSP NDSAQILKKD DFCTKGTMKT
LQGLLPPTFY LMGFVPFWSP VRTAESFSVA RQPHAGGGRA PGQPNKIDTT ESWEVSPRSD
SECRLIICQI TDVYTLENLA HFKTLIAETK KRAPGSTVVS MLTGDFLSPY LLSSVDKGAG
MMHALNSIPL DYLCWGNHEA DIEHHITCRH VRNFHAKGGK FINSNMLDHD AMDAQQEYDV
IELKSEDGTN TRRVGLTAVL SDDPALYSHF KGKGAFGGAT LTDPWEALTK YKKILEDDEK
CDVVIPLQHL YVPDDHKTCE QFDFPLILSG HDHHIVDEVV EGTRLIKPGM NADFAAVVEI
SWNDATEEKP KIRSQFVRCK DWAPDPVMAE ENERAYDSLL PLRNTELARI PSYYEPLTSN
NSRGSVCSMG KYICSVLKSA FNMNRGKKNR VDAVLIMGGN VRGNADYPIG SYFSLEALEA
EIKSDETVGV ISMPGWLLAE GIEATHCGDP IPGWFQYDVG IQQDENNVVT HVAGLPIDRD
RMYRVATKIG DLTNGQSPPF TEYYQTNPKS LPPKGNYVNI QSEMMSYFAR NLWKRLWDAI
SHEVEDTCDI DGNCSPEDRL DVLDSNGDGT VTVEEIHNAL RDLLDYSVDD RETTLAEFVH
AFADTDGSGK VTVKDFEVFC DDMAEQAVIN RALAREAMER QREIAAAAST