Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44174 |
Symbol | |
ID | 7204094 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1204419 |
End bp | 1209278 |
Gene Length | 4860 bp |
Protein Length | 1370 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | 5'-Nucleotidase or metallophosphoesterase |
Protein accession | XP_002186201 |
Protein GI | 219113235 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTTCCTCT GCTATTGCAG TTTCCAGCAC ACAGAGCCGA CACCACCGCC ATCCATCCAG TATGAAAATA CCATCGCGGA ATCGATGGCG TTCTCGACCG GGATGGGCTG TGGCCATTCT GGTGTCTTGC GGATCCAATC GTGTCGACGG ATTGGCCTTT CAGCAATCAG TGCGTGGGGG AGGTCGAGCC GCGAGTACAC CGGATAAAGT AGACAGGGTT GAAACGTGGG AAGCCTCGCC GTGCCAAGAT TCGGAATGTC GATTGATCCT CTGTCACATA ACGGACGTCT ACACGCTCGA GCATCTCGCC CATTTCAAAA CGCTCGTGGA AGAGACCAAG AAAAACTCCG AAGGATCCGC CGTGGTTTCT GTACTGACGG GGGATTTTCT GTGTAAGTGC ACGAACGAAA GCACGCATTC CAACACCGAC AATTAATCAA TTGCTCCATC TAACACTCCT CGGCTCCTTT ATACAAGATC AGCACCCTAC TTACTTTCCA GTGTGGATCG AGGCGAAGGG ATGATGCACG CGCTTGGGCG GATTCCCCTC GATTATTTGA CATGGGGAAA CCACGAAGCC GACATCAACC ACCGCACCGT TTGTCAACAC GTCCGCAACT TTGCTGGTAC GTGGCTCAAT TCCAATATGA TCGATCACGA AGCCATGGAT GCACAGAAAG AATACGATGT CATTGAACTG ACCTCACCTG ACGGATCCAA CCACCGCAAA ATTGGACTCG CCGCGGTCCT TTCCAGCGAC CCGGCCTTGT ACGCGCAATT TAAAGCACCC GGGCCTTTTG GTGGCGCAAC CGTCACTGAT CCGTGGGAAG CCCTGGCAAA GTACAAACGC TTGTTGGAAA AAGATCACGG TGTGGATTTG GTGGTACCTT TGCAGCATTT GTACGTCCCG GACGATCACA AGACTTGCCA CAAATTTGAT TTTCCGGTGG TGCTGTCTGG ACACGATCAT CACCGTGTTG ATGAAGTGGT GGATGGCACT CGGCTCATCA AACCGGGCAT GAATGCTGCG TACGCAGCTG TTGTGGAAAT TTCGTGGAAG ACGTCGTCGA GCGAAAAACC CGTTATTCGA TCACGATTCC TGCGCTGTCA GGATTGGGAC TGCCGATCCA GTTTTGGCCG AGGAGAACGA GCGTGCTTAT GATGCCCTGA TTCCTTTACG TAATACCGAA TTGGCTCGTG TTCCATCAAA CTTTGAACCT TTGACTTCCA ACAATAGTCG CGGCAGTGTC TGTACCATGG GCAGTTTCAT TTGTTCGCTG CTGCGATCGT CGCTCAATGT TTCGCGGCGG CAGCGGGATA ACAAAGTGGA CGCCGTCCTA CTCATGGGTG GCAATGTAAG AGGAAATGCC GACTACCCCG AAGGGTCATT CTTCTCGCTG GAAGCTCTAG AGGCCGAAAT AAAATCGGAC GAAGTAATAG CAGTAGTGAA CATGCCTGGT TGGCTGCTGG CCGCAGGCAT CGAAGCGACC CATGCCGGTG ACCCAATTCC AGGATGGATG CAGTATGATG TGGGCATCCG ACAAGATGAG AATAACGTGG TTACGCAAGT AGCTGGGCTT CCCATTGATC GCGAGCGTGT CTACCGAGTG GCGACCAAAA TTGGCGACTT GACCAACGGA CAAAGTATTC CGCTCACGCA ATACTACACA GAGAATCGAC ATCTTCTTCC ACCAAAGGGA GCGTACGTTA ATATACAGTC CGAGCTGATG GCCTACTTTG CAAGAAATCT TTGGCGAAAG CTGTGGGATG CCGTCTCCTT GGAGCTTGCG GAGACTTGCG ATACCGACAA CGACTGCAAC CCCGTAGGCC GGCTGGAAGT CCTTGACAAA ACCGGGGATG GTGAAGTCTC GGTGGCCGAA ATTCAAACCG CTTTACGAGA TTTGCTGGGG TATTCTGTGG ACGACCGAGA AACATCGTTG GCCAAGTTTG TTCATTCGTT TGCTGATACG ACCAGTACTG GACGGGTAAC ACTACGGGAT TTTGAAATTT TCTGCGACGA AATGGAGCAA ACTTACGAGC GAGACAGCTG GCGATTGTCG TATCCGAAAC CGTCCAAACC GACTGCTGGT ACTGCTAAAT CAACATAGTT CAAGCTTCGA GATTGTCTGC GATGCTCGCG CACCCAGAGG TTCCCAGAAA TTCTGCAAAT TCGGATTCTT TGTTGTCGTG ACCAACTTTC CAAGTGGTCC GATTCTTCCT TTGCCTTCCA CGATATCCAT TTTTACGGTC TTTATGCGCA GTGTAGGAAA TGGATGGAGG AAGACGACCA ATGGTTAGGA ACGGTGATGC TGGTTCGTGT GTTAACAGTT AGCAAAGCTG CGAATTGTGA GGAATTGACA CGTGTGCGCA TTTCGTAACG ACAGAACATG TGACATGGAT TCCGTGAAAT GATTGGCTCA ACCACAATGA CAGAAAATGA TTATTAGAGA ACATATACCC GTCAAGATTG TAGCTTCAAG TCCGGTGCGT AGATAAAATT AGAACTGTAC TCAAATTCCG GTAGGTATCG TAGTTTGAGC GCGACTGACT CTGAGAAAAA GCGTTCGGAG AAGGATCGCG ATCGGAACGA TCGATCCATC GAGATCGATC GAAAACTGCT AGAATGCCCT GCCCTCCCCT GCTGTGCAAA GACTATTTTG CGTGCAGAAC GATTCTGCAC AAATTCTGAA AAAAGACGAT TTCTGTACGA AAGGCACGAT GAAGACGCTT CAAGGCTTGC TCCCGCCTAC GTTCTATCTG ATGGGCTTTG TGCCTTTCTG GAGTCCAGTA CGTACGGCTG AATCCTTTTC GGTCGCTCGC CAGCCCCACG CTGGCGGGGG CCGAGCACCG GGACAACCCA ACAAGATCGA CACTACGGAA AGTTGGGAAG TCTCGCCCCG CAGCGATTCG GAATGCCGGC TCATCATTTG TCAGATCACC GACGTGTATA CGCTCGAAAA CTTGGCGCAC TTCAAAACGT TGATCGCTGA GACCAAGAAG CGGGCACCGG GGTCCACTGT AGTATCCATG TTGACGGGTG ACTTTTTGTG TAAGTCAAAA TAAGCGACAA TGCTCGTTCC ACCAATCTGG ATACGTAGCA TCCTTCACTC ACGTTTCCTT TTTTGCTTGG CAGCTCCGTA CCTTTTGTCC AGCGTCGATA AGGGCGCCGG AATGATGCAT GCTCTGAACA GCATTCCGCT CGACTATTTG TGTTGGGGCA ATCACGAAGC CGACATCGAG CACCACATCA CCTGTCGTCA CGTCAGGAAT TTTCACGCCA AGGGGGGCAA GTTCATCAAT TCCAACATGC TCGATCACGA CGCAATGGAC GCTCAACAGG AATATGATGT CATTGAATTG AAATCGGAGG ATGGTACCAA CACGCGACGC GTCGGCCTTA CGGCAGTCCT CTCCGACGAT CCAGCTCTGT ACTCTCACTT CAAGGGGAAA GGTGCTTTTG GTGGAGCCAC TTTGACGGAT CCCTGGGAAG CTTTGACCAA GTACAAGAAG ATTCTTGAAG ACGACGAAAA GTGTGATGTC GTTATTCCGC TTCAGCATTT GTACGTTCCG GACGATCACA AAACGTGCGA ACAATTCGAT TTCCCCCTAA TTCTGTCGGG TCACGATCAT CACATCGTGG ACGAAGTAGT GGAAGGAACT CGTCTCATCA AGCCCGGTAT GAACGCCGAC TTCGCCGCCG TGGTCGAAAT ATCTTGGAAT GACGCGACGG AAGAGAAGCC CAAGATACGC TCACAGTTCG TTCGTTGCAA GGATTGGGCT CCGGATCCTG TCATGGCGGA AGAAAATGAA CGAGCCTACG ACTCACTCCT TCCACTACGC AATACGGAAC TTGCGCGCAT TCCTTCGTAC TACGAACCGC TCACTTCCAA TAACTCTCGT GGCAGTGTCT GTAGCATGGG AAAGTATATT TGTTCGGTTC TGAAATCGGC ATTTAATATG AACCGTGGCA AAAAGAACCG TGTTGATGCT GTGCTGATCA TGGGAGGTAA CGTTAGGGGC AATGCGGACT ACCCCATCGG CTCCTACTTT TCCTTGGAGG CACTCGAGGC CGAAATAAAG TCGGACGAAA CAGTAGGCGT CATCTCGATG CCCGGTTGGC TTTTGGCGGA AGGGATCGAG GCAACGCACT GTGGCGACCC TATTCCTGGA TGGTTTCAGT ATGATGTAGG TATCCAGCAG GACGAAAACA ATGTGGTCAC ACACGTCGCC GGGCTCCCTA TCGACCGTGA TCGCATGTAC CGGGTTGCGA CGAAGATAGG TGACTTGACG AATGGACAGA GCCCACCATT TACGGAATAT TACCAGACCA ACCCCAAAAG TCTTCCACCC AAAGGTAACT ACGTCAATAT TCAGTCCGAG ATGATGAGTT ATTTTGCCCG GAACTTATGG AAACGCCTCT GGGACGCAAT TTCTCACGAA GTTGAAGATA CTTGCGATAT TGACGGCAAT TGTAGTCCCG AAGACCGCCT AGATGTTTTA GATTCGAACG GGGACGGGAC TGTCACCGTC GAAGAAATTC ACAATGCCCT CCGCGATCTT TTAGACTATT CGGTAGACGA TCGCGAAACC ACCCTGGCCG AATTTGTACA TGCCTTTGCC GATACGGATG GTAGCGGCAA GGTGACCGTC AAAGACTTTG AAGTTTTCTG TGACGACATG GCGGAGCAAG CTGTGATCAA TCGAGCTTTG GCAAGGGAAG CGATGGAACG GCAGCGCGAG ATAGCAGCAG CCGCCTCCAC ATAGTCCACG GAGTGCCCTG CCGCTAGTTT AGGGACAACC TGATCGATTC ACAACCTTTG ATTTAAAGTA GCATTAGTAG TATCAGAAAA TCCCATTAGT
|
Protein sequence | MKIPSRNRWR SRPGWAVAIL VSCGSNRVDG LAFQQSVRGG GRAASTPDKV DRVETWEASP CQDSECRLIL CHITDVYTLE HLAHFKTLVE ETKKNSEGSA VVSVLTGDFL SPYLLSSVDR GEGMMHALGR IPLDYLTWGN HEADINHRTV CQHVRNFAGT WLNSNMIDHE AMDAQKEYDV IELTSPDGSN HRKIGLAAVL SSDPALYAQF KAPGPFGGAT VTDPWEALAK YKRLLEKDHG VDLVVPLQHL YVPDDHKTCH KFDFPVVLSG HDHHRVDEVV DGTRLIKPGM NAAYAAVVEI SWKTSSSEKP VIRSRFLRFL AEENERAYDA LIPLRNTELA RVPSNFEPLT SNNSRGSVCT MGSFICSLLR SSLNVSRRQR DNKVDAVLLM GGNVRGNADY PEGSFFSLEA LEAEIKSDEV IAVVNMPGWL LAAGIEATHA GDPIPGWMQY DVGIRQDENN VVTQVAGLPI DRERVYRVAT KIGDLTNGQS IPLTQYYTEN RHLLPPKGAY VNIQSELMAY FARNLWRKLW DAVSLELAET CDTDNDCNPV GRLEVLDKTG DGEVSVAEIQ TALRDLLGYS VDDRETSLAK FVHSFADTTS TGRVTLRDFE IFCDEMEQTY ERDSWRLSYP KPSSFEIVCD ARAPRGSQKF CKFGFFVVVT NFPSGPILPL PSTISIFTVF MRSKMIIREH IPVKIVASSP NDSAQILKKD DFCTKGTMKT LQGLLPPTFY LMGFVPFWSP VRTAESFSVA RQPHAGGGRA PGQPNKIDTT ESWEVSPRSD SECRLIICQI TDVYTLENLA HFKTLIAETK KRAPGSTVVS MLTGDFLSPY LLSSVDKGAG MMHALNSIPL DYLCWGNHEA DIEHHITCRH VRNFHAKGGK FINSNMLDHD AMDAQQEYDV IELKSEDGTN TRRVGLTAVL SDDPALYSHF KGKGAFGGAT LTDPWEALTK YKKILEDDEK CDVVIPLQHL YVPDDHKTCE QFDFPLILSG HDHHIVDEVV EGTRLIKPGM NADFAAVVEI SWNDATEEKP KIRSQFVRCK DWAPDPVMAE ENERAYDSLL PLRNTELARI PSYYEPLTSN NSRGSVCSMG KYICSVLKSA FNMNRGKKNR VDAVLIMGGN VRGNADYPIG SYFSLEALEA EIKSDETVGV ISMPGWLLAE GIEATHCGDP IPGWFQYDVG IQQDENNVVT HVAGLPIDRD RMYRVATKIG DLTNGQSPPF TEYYQTNPKS LPPKGNYVNI QSEMMSYFAR NLWKRLWDAI SHEVEDTCDI DGNCSPEDRL DVLDSNGDGT VTVEEIHNAL RDLLDYSVDD RETTLAEFVH AFADTDGSGK VTVKDFEVFC DDMAEQAVIN RALAREAMER QREIAAAAST
|
| |