Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48146 |
Symbol | |
ID | 7203296 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 338938 |
End bp | 343983 |
Gene Length | 5046 bp |
Protein Length | 1348 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182515 |
Protein GI | 219124448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCGTTGAGC GCTTGTCTTG TTGCAGCAGC TGTTGCTATT GCTGTTAAAG TCTCCTGTTA CAGTTTCTTC TCAACTGGCA ACATCGAGTG GACGTTCTCA TCATGTTCGA AGTGTCGTGG GGCGAGATGG GCGTGTGCAT CGGTTTGGGT CTCTTCTTAA TCGGCCGTGA GGATTTGCCC AAGGCGGCGC GTATGGCCGG TACGCAGGTG GGCCGAGTGG TGGGTTTGTT GCAGGGTGCG CGGGCCCGCG CCGATCAGTT CGCCGGCCAG AACGAACTCC ATCAGCTGCA GAATGAACTG CGGTCGGGAC TCCGGGATTT GGATGCGGTG AAATCGGAAT TGGCCGTCTC GTTGAGTTCG CGGGGAGTGG TAGGACGACA ACTGGGAGCC ACCGTGCCGG GGGTCAACCG GGCAACCGTG TCGCCGATGA CCACACATAC CATGTTGACG GGAGGGACTG CTGCGGGAAG TCTCGCGGCT TTAGGGACGA TGGGGCTGCC CGTAGGACGA CCAGCCGGTG GGGACGATGA CGGCACGAAC GTCGATGGGA CAACGAATGC CGCGGGCTCG GAAGGACTGT CCTGGACGTC GTCGCCCGGT AGTCCCATAC ATGCACCGCC CCGAACGTTG CCTCCCGTCT ACCAAACGGT GGCAGCCGTC GCCGAGTCGG AATGGGAATC CCAAGGGATT GCCTTTCGTT CCCGAGCCGA ACAAGGATCG GGTCTGGCCA ACGCGTCGGA TCAAGCGACG ACGGGGTCGG CCCTTTTGGC CAGTCTTTTG CGACAATCGC TCGTCTTTGA CCAATACGAT CGAGTGGTAG CGGATCAAGA CGAAGCTCTC CAATCCAAAA TTAGTCAAAT TAAACAAAAG GCTCGAGATC AGGTGTCCGC ATCCAGTGAC ACGGAGGTCC CCCGATGATT GGTAGATGGG AGGACCAAGC CCTATCCGGA ACGTCGTTGG AGGCCCAAGC CGTGTTGCTT TGTTTCTGAC CGCGGCGCTG TCGGCAATTT TCCGAGTTGA TCCACACCAG CTCCACGAAA CTCGTTCCGC TCCCGTAGTG TTGGCCACCC ATCGGGGACA GTATCTTCTG CTGCGTAAAA TTCATGTTTT TCAAAGCTAG CACGCACGGG TCGAAAAATC ATCTGTCTAT CCAACAATCG AGAGTGTACG AAAGGTCTGC AAAGCCATAT ACTACTCAAT TAGATTTGGG CTTGTGATCC AAAGTGCACG GTTTTCTATG TAATGTCCGT TAGAGCAGCC TAGGCAACAA GCCTCGATAA AGTTCGAATG CTGGTGTCAC CGAATAGGTA GTGACTGTGA AATAAACAAG GTATAGATAG CATAGTATCT CACTGTCAGA GTGTTTCGAG TCGACACGAT TTACTGTTCG TTGTAGGTAG CTAGCTCATC GTCTTGCCGA CTGTACGGGA TTCAATTGCT CACAACGGTA GGTATGCTTT TTTCGAGCAG TAGGGTAGGT TCAATACACT CTCTTTACAG TTAACAAAGA GTGGGAAGCG CCGTCTGTCG AGACATGGAC GACTCCCCAC CTCGAAACAC AGTCGCAAAG GGATCACGAT CAGATAGTGC GGGGGGTTGT AGGAAGTCTG GAAATGCTCC ACAACTGTAC CGACACTTGT AGTCATCCAC TTCCCCATCC ATTCCTAGCG GTGCCAAAAG GACCGTCGGC AAGTACGTAC TGCTTGTGGA ATCACTAGGG TTGAGTATAC TGTATCCAAA GAAAGTGTAG GCCCGAGTGA CACCGTACGA GGCCTAGTAG AGAGAGAGAG TGTGTGTGTG AGTCGGATTG TTCTATACGA ACAGTATGAG TACGGACACG AACGAGGACA ATAGCTTGCT CGCCGCGGAC GAAGCATCGA TTCTGGACGG AATGGACGGA CCGCCACAAC CTCCCCCCGG AGAGATCCGG GAAGATCCCT GGATCCTACA GCCTCCCACC ATGGCGGCCC TCGGACAGCC CCCTCTCGCT ACCCCGCCGA CGGCTCTGGG TGATCCGTCC GCGGCGGCTC TGTGGCGGGA ACGCGAACGC CGCCAGCGTA CCGTACGACT CCTCATGATG TTCCTGCTCA TGCTCTTACT CATGGACGGC GAAGACGGGA ACAATAACAA CGCACCGCAC AAACACGGCT TGCGATCGCG GCGGGACCCA CGGAGAAAAC GCGACGCCGG GAACCGTCCG CTCGATTCCG TTGTCTGGGA CGCACGCACG TTGCAAGAAT CGCGGTTACG GGAGATTGCC CGGACGCATC CGCGGTATCA GGCCTTGATT GATAAGAATA AGGGAGAAAA TGTAGAAGCT GGTGTTTGGG AATGGGCGCA GCAGTTGGCG GAACAAGAAC GGGACGAGTT TGGGGTCGAA GCGGCTCCGG CCAGTAAACC AATGTTGTCA CCAGCAGGTA CGATTGTGGA TGAAGAAGCC ACGCGAAAGG TGTGGACGTA TCCGTGGAAT GCGACGGGAT TTTATCGCGG TGAATGGTCC CGACGAATCG CTAAACAAGC AGACGAAACG ACCCCTGCGG TTTCCAGACC AAAAGTCCAA GATGCGAACG CGACTGTGGC CCAAGAACGA AGCCTTCTCG ATCCGGTGGC TCTGGAAGAG ACACTCTTGG AAACGCTTCA GAGTCGTGGG ATCCAGGCGG GCGTCGTTTT GTTACCAAAC GAATTGGCCG TGGCTACCCG GGATGACAAC AATCTAACGT CGATTCGTTG GGAGCAAATG ACCATGGATG CCAACGGGAA AATGTACAAA TCGGTCCAAG CCACTACCGA AGACCAACAT ACTAACAATG ATAAAGAGGG CAGCCCATCA ATCACCCTTA CTCACAACTC AGGTCGCGCC GCGTTCCAGA TATACGCTAG ATCGATTCCC GCGATGAAGG AACTGAGTTT GGTGGATGGG TTCGTCAAGC TCTATGATTC TACGAGTCCG GGATACTCAA CCCGCAAAGA TATTCTGTTG CGTGTACGGG GCGTGCTGAT ACACGCCATA GGGCAACTTT CCTTGGTGTC GAATGTGGAC GTGTCGAGGA GCGCCTTTGT GATTATGCCC TCCGAAGCGC GTGCAACGCA ACATCGGCGT TTACGGACGG CTTTGGAAGA TTTGGACGGT GCCAGTTTGC CCCAGATCCG TCAAGATGCG CTGGACTTGT ACGGTGGTCG TTTGTCTACG GAGGACCGTG ACTGGGCGTT GCTTTCTACC CCCTTTGGTG GTCGTCGTTT ATCAAGCGAA ACTTCCACCC CAAGGAATGC GAGAGCAGCG ATGGAGGAAG CCACGGAAGT GGACGCGGAG ACTTTACCAG TCTCCTCGAA TGCTACACAT CAACAATCCG AATCAACGTT GGAACCTATT CGGATTCCGT GGTCGGATGT TGTTGTTCCT TATCCCTTTG TTCGAGATGA CAAGGAGGAG ACGGTCCGTC GGGTTCGAAC ACCAGCAGCA CGTCGTATGC CTCCTCGCGA GCAACTGCTA GAATCCAATG CTGCCTCCTG TGAATTTGAA ATTACAATGA GTATTGAGCC GACAGAGTGG ACCGTGGGTG CGTGGCGAGC GCTTTTAAGC CGACACGCAA ACGAGGCCAA GCGATTGGAT CCTTCCAAAG CTGGGGATAT CACAGAGGAT TCACCAAGTG GGAAGGAAAA TGTCAGTACT CCACCCATGG GCCCCACCAA TCGTCGGAAA ATACTTCAAG ATCAGGCTTT GGTTATGAAT ATGGCCGGAT CAATCCACTC GCCAAACTGC GACTTTACAG CTACCTTGAA TGTCACGGCA ATTCGGACGG ACTGGGAAGC TACGACGGGC AAGGTCATCA ATTACAGTTT TTACATGATG CTGGTTTGCA TGGCGCAGAT TATATTACTA CTGCGTCAAC TATTACACTC GCAGGCTCAA TCCGCGGCGG TTCGTGTATC AATGCTCTGC ATTGGATGGC AAGCAGTCAT TGACGCACTC GTGTGTTTGG TACACATTTA CTTCAGCCTT GCCATGCAGC CGCTTTTTAC GGCCTTTGCC AGTGTGGCTT TTTTCAAACT ACTGATATTT TGTGTCATTG AGATGAAGTA CATGAGCAGT AAGTGGATCG ATGCTGATGT TAAAAAAAAA CTCTCGGCAT CTGAATATAG CCTCATGGAA TTCTTTTGCC GTGCGTAGTT ATCATTCAGG CGAGAAACAG CAGCAATGGT GGTCAGTCAA CAGAAGTTTT GAGAAGACAA GTGGCAATGT TACATCTTCG GTTTTATCTT GCATTGTTCG CGGCTTTTTT AATGTTGTTT TACATGGGCG AAAAGTACCG CACATTTTAT GTTCTTGGGT TGTACTCTTT CTGGGTTCCA CAGATCATTC TGAACATAAT CACCGAAGCG AAGAACCCGC TGCACAAGTA CTTCATCAAT GGTATGAGTC TATCGCGACT TGTGGCGCCG CTGTACGTGT TCGGTGTGCC GAACAACTTT TTGAAGGAAA TCCATGCCGA TGCTCCGACG GACGCATGGT TGTGCCAGCT ACTTGTGCTG TGGGTCGGGG TACAGGTGGC TATTTTACAC TCCCAATCGA AGTACGGAAC TCGATTTATG ATTCCAGCAC GATTTCTGCC ACCCAAGTTT GACTACAGCC GGCCAATTCC CTCATCCATG CTTCCACCCG GGGCCTTGGA ATCGCCGGTC CCTGAGTTGG CGCTGGAGCG GAATAGCGAA AACCAGCCGT TGGTGACATC CAGCCCGGAC ACCTCGTCAC CAGCGCGCGA TCGATTACGG CATACGACGG CCGTGACGAC AAGGAATCGA ATGCGACGAA CGAACCGAAC CGAAAGCAGC GGAATGACTA CCGAAACGCT CGATCATTCA CGATGCAATA GTCCGTTGGC GCCGACGCTA GACTGCAGTA TTTGCTACGA CGCAATCAAT GTACGAGATC AGCTTGGCTA CATGCTGGCT CCGTGCAATC ACTTGTTCCA TCGGGATTGC CTTATACAAT GGATGGATGT GAAAATGGAA TGTCCCATTT GCCGCACCGA ATTACCCGCA TTATAG
|
Protein sequence | MFEVSWGEMG VCIGLGLFLI GREDLPKAAR MAGTQVGRVV GLLQGARARA DQFAGQNELH QLQNELRSGL RDLDAVKSEL AVSLSSRGVV GRQLGATVPG VNRATVSPMT THTMLTGGTA AGSLAALGTM GLPVGRPAGG DDDGTNVDGT TNAAGSEGLS WTSSPGSPIH APPRTLPPVY QTVAAVAESE WESQGIAFRS RAEQGSGLAN ASDQATTGSA LLASLLRQSL VFDQYDRVVA DQDEALQSKI SQIKQKARDQ VSASSDTEVA SSSSCRLYGI QLLTTSSTSP SIPSGAKRTV GNMSTDTNED NSLLAADEAS ILDGMDGPPQ PPPGEIREDP WILQPPTMAA LGQPPLATPP TALGDPSAAA LWRERERRQR TVRLLMMFLL MLLLMDGEDG NNNNAPHKHG LRSRRDPRRK RDAGNRPLDS VVWDARTLQE SRLREIARTH PRYQALIDKN KGENVEAGVW EWAQQLAEQE RDEFGVEAAP ASKPMLSPAG TIVDEEATRK VWTYPWNATG FYRGEWSRRI AKQADETTPA VSRPKVQDAN ATVAQERSLL DPVALEETLL ETLQSRGIQA GVVLLPNELA VATRDDNNLT SIRWEQMTMD ANGKMYKSVQ ATTEDQHTNN DKEGSPSITL THNSGRAAFQ IYARSIPAMK ELSLVDGFVK LYDSTSPGYS TRKDILLRVR GVLIHAIGQL SLVSNVDVSR SAFVIMPSEA RATQHRRLRT ALEDLDGASL PQIRQDALDL YGGRLSTEDR DWALLSTPFG GRRLSSETST PRNARAAMEE ATEVDAETLP VSSNATHQQS ESTLEPIRIP WSDVVVPYPF VRDDKEETVR RVRTPAARRM PPREQLLESN AASCEFEITM SIEPTEWTVG AWRALLSRHA NEAKRLDPSK AGDITEDSPS GKENVSTPPM GPTNRRKILQ DQALVMNMAG SIHSPNCDFT ATLNVTAIRT DWEATTGKVI NYSFYMMLVC MAQIILLLRQ LLHSQAQSAA VRVSMLCIGW QAVIDALVCL VHIYFSLAMQ PLFTAFASVA FFKLLIFCVI EMKYMSIIIQ ARNSSNGGQS TEVLRRQVAM LHLRFYLALF AAFLMLFYMG EKYRTFYVLG LYSFWVPQII LNIITEAKNP LHKYFINGMS LSRLVAPLYV FGVPNNFLKE IHADAPTDAW LCQLLVLWVG VQVAILHSQS KYGTRFMIPA RFLPPKFDYS RPIPSSMLPP GALESPVPEL ALERNSENQP LVTSSPDTSS PARDRLRHTT AVTTRNRMRR TNRTESSGMT TETLDHSRCN SPLAPTLDCS ICYDAINVRD QLGYMLAPCN HLFHRDCLIQ WMDVKMECPI CRTELPAL
|
| |