Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37731 |
Symbol | |
ID | 7202607 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 821451 |
End bp | 825974 |
Gene Length | 4524 bp |
Protein Length | 1373 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181632 |
Protein GI | 219122605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACG ACCGGATTCG GTCCGTTGAG ACACGCTCGC CACAGTGGGC CCTTACGGTA AGTGGCGAGC GACGAGCCGG GAGTGGCGAG TACCGAGTGG CTACCGGTCC CGCAGGAAAA TGTCCACGTA CAATGCGGGG ACGGGGAGTT TCTACGAACC ACTACCAAAA CTACTGCAAC ACTCTCTACT ACTACTAGCT ACTACTGCTA CTACTAGAGT AGTACTACGA GTATACAATT CGACCGCATT CGAGGTCAGT CGGTCTTGTC TTTTATTCGT CACAGCGATC GAATCGCATC CACGGTCCCT TTGGCGTCTG TGTTACATTT ACTCCCTCTT CCGGTAGTGG ACACACAGAA AGACAATTCA ATCGCTCACG AAGGGAATTC CTTTCTCCAC ACGCAACACA CATACACAAC ACACAGCACA CACTGCGCCA ATGCGGCGAC GTTTGCAAGT CGACGTGGCG GTGGGGTCGA CCGATCCTTC CGTGCCGTCG TCACCTACCG CACCGTCCCC CACGACGACT CCTCGTCGTC GCTCTCCGTA TCAAAGTTTT CCTCTCCCCG CGTATTTACC CTTGTGTCCC GGTGGTGTCG CCTATCCTGC CGTGGGGACC GGCACAGACC CTCGCGACTG GCACGTTCTC GGGTCCCCCG TCGGTACCAG CAGTAGCACC AGTACTAGTA GTACTAGTCC CAGTGGCAAC CAAAGTCCGT CTCCTGCCGT TCCCATCAGT ACTACCAATA CCAGCGCCAG TACTAGCACG ACGAGTCCGA ATGCTGCGAC GGCGATCCTC CCCCGGGCTA CACTACAACA ACTCGCCGCA CGTGCCAAGA CCAACGACGA TCAACTCCGA CAGTACGCCT TGTTGTACGC GTCGCACGAG GCCAACACGG CAAGCTCCCT GTCCAACACT GCAACAACAA CAACGACACA CGTCTTGCCC AGCCTCGCGA TACTCACCGA AGCCGTCCTG GGGCGCCTGA CGGGGGTCGC ACCCTCTCAC GTCAAAACGG CGGTAGATAT ACTCCTCGGC CCCTTTTTTC AAGATCCCAA CGATCCCCCG TGCCCGCAAT CCGCCGACGA CGTTGGGAAC TGCAAACCTT CGGCCGTTGC TACCGACAGC CGTATCGGTA CCGGTATTGG TACCAGTGAC GGCAGCAGCC GTACCCGTAC AACCAGCTCT AGTACTACCC ACCACCACCA CAGCAACAAT CACCACCACA ACCACAGTAC CATCACCACG GCGCAACGAT TGCAAACGCT CTTGGACCAG GCCGAACGCG AGCACCAACA ACAGCTCCAC CAGGACAAAG AACCCTCACC CACCACTACT CGGCGAACCC GGCCCTGTGG ATACGTCTTT CAGCGAGGAG ATATTGCCTG GAACTGTAGA ACCTGCCAGA CCGACCCCAC CTGCGTCATC TGTGACGCGT GCTTCCGGGA CAGCAACCAC GAAGGTCACG AGGTGTACTT CCACCGTACC ACACCGGGAG GGTGTTGCGA TTGTGGGGAC ACCGAAGCCT GGAACATTGC CGGTTGTTGC GAACGACACC GTCCTCCACC AGCGCTACTT TCCGACGACG CCCCGCATCA CCCGGACGAT CCTTTTGAAG CCGTCCGCGC CAGCCGCCGC GGATACACAC TGGCCCACGA AACCCTCACC CTCGAACCCA CCGCTTTGCC GCCGCGTCTT ACTGCCGCGT TGGCCGTCGT GGTCGGAGCC GCCGTACACG CCCTACTCGA TGCCGTCAAC GGCGCCGGGA TTGGCGCCGA TCCCGTACAG TGGAAACGAC AATGGGCCGA TGAGGCGGCC AAAATTGCCA ACGGTGTCGT GCACCCCGAA GATTACGCCC TGGTCCAACC CAAAGACCGT CAGGGCAATG ACGACGACGC GGCGGCGGTC CCCAAAACAT CGACCCACCC GTTTCTCTGC AAGCTTTTTG GGAGGAGGCC CCTGCCACCA CTACTGCATT TCCCCACGGC TTTCATTTGC AACTACGACT CCACAATGAC GACGTGCACA CGTTCGATGA AGTCATTGAG GCTCTACACA AACCCCGGTC ACTGCGACGC CACTCCGAAG ACCGACAATC TCTCGTGGCC CTCCGTGATC ACGCTACGGA AATGACGCAC CACGTCGACG CTGACGGACA AGTCACGGTC AAAACATTTA CTTCCTTTCC GGCCGCCCTC CAGGGATATC AAAGTCTCAA ACGACGGGGC CTGCACTGCG CCGTGGTTAG TTCCGTGCAA GTCCAAGCCG AACACCGTGC CCGTGCCTTG GCGTCGTGGC TGTCCGAAAT TTCTGCCGCC CATCCCGCCG CGGCCGCCCT GGTCGTCCAC GCCTTGGTAC AAGTAGGCGA AGGCGATGAA ACCCTGGCGG ACTTTAGTGT GTGGCCCCGA GCGCGTTCGA TTCCACCCTG GGCCGCCACG GACGCGTCCT CGGAGGAACA GGCCTGTCTG CGGCGCTTCG CCGCCTTTCC ACCCCATTTA CCTTCGAGTT ACGTCACCCG CGAACAAGCC GAACTCCTCC ACGGCATTGC CCTGACCGTG CAAGTGGCGG ATTTTGTTCA CGTCACGGGC GCAGATCCGC ATTTTTACGG ACGCGTGCCT TACCGACTGT GTGCCGATCG GTACAAGAAA TCTCCCCACG CCCTGTGGGG GACCCTCCCG CAGTGCTACG TCGACCCTAC ACCACCACAC GCCAAACACC CCCTCCTGCA GCGATTGGCG ACCGTGACTA CTGACGGTAC GGAGAATACA ACGAATCGCG CGGACTGCTG GAAGGATGTG CCGAATGCGT TGACGGAAAC GGTGTACGTG GTCGATACCG ATTTGCGGAA ACAGCAAGAG GCGGACCGGA TCACGTCGAC GGTCTTTCCA CATCGCTTGC CTGGCTTGCA ACTGGTCAGT GGTGTTGGCA CGATTCGTCT GGATCATCTC GACGCACGTC GACCCCCGTT GCCCAGTCCA ATGGATTGGC GGCATCTTTT GGCTACGTCA TCTTTTCGGG CCCCCGCGTC GACAATTCTC TGGTTGCTCT TGCTCGATCC CTATCCGACG AAACAAGTTC GCGGTGCCAT TCACGCGCTT ATACTGTCGC TGCTCACGGA CGCCCGCTTC AAATCACGGG TTGCTGGCGC TCTTGGTGTC GCGTACCGAC CGTTGAGTAC TCTGTTTTGC GCCGGCGTCG GGACGGAGGC GGACTCGCCT CTGCATTTTA CGGTGCAAAT TTTTACGGCA GGCAGTTTGG TGCGGGCTCT AGGGAGCGGA CCCGCGACTG AGGCCTTGCT GATTTCCGAC GACCCGAATC GGGCAGGGCA CAGCGAGGCA TCGATTGGTG TTTTCACATC ACCCATCGCG CACACGATCG TCCGATGCAT CCACACGAAC TTGCTGGGTG CCACGAAAGA GGTCAATATG ATTTTGAACC ACACGACTTC CGGGACGGAT GATGCGGAAG AAGATCCCGT CTTCCAGCCG TCGAACGATA GTTTGTTGCC GGCCTTGACG TACGTGGCGG GAGAACATCC GCTCATGACA CCCTTACCAG CGGCCCCGGA CGACGGCTTT TTGGATTCCC GGTCGACCCG TCACAAGCGG CTGCCGCATA TGCTGCGCGA TTTAGAATAC GTCATTGAGA CACCTGGTAC CGCAATCCGA CTTCTCCTAC CACGGCGTTT CCCGGTATAC CAAGGGCCAC CTTTGTCCAT GCGAGGAGAA GACGTCTTGG CATTTCCCGT CGTCTTTTCC CGCATGCTGC GGCTGGCTCA AGGAATGGAT CCACAAAAGC GTAAGATTTC GGGAGGGCAT GTCGAGTACG AACACATACG ATGGCTGGAA GCGTTTGGGT TGAGTCTTAA CTTTGCCGGC GCGCGCGACG CCTTGTCGGA AAGCCCGCCG CGAAGCAGCA GCGTCGCACT TGGTGCAGAT TATATGGAAA ATGTTATGGG CGTGCGTGAA GCTCTCGGGA ACATTGGTGC ATCCTTGCTA CGTGAAATAA AGCTTTGGCT GTATCGCGAA GGTATGCTCG AGACAGGTCT TCCGCTACCA CCCGGGGGCG CCCACGGAGC GACGGATATG GCCCAGGTGG AGTCACTACA GCGGAGCACT CTACACGTAT CGGGATCGCA GAGCACGTCC AATGATGCTT TATCCAATAG CAATGCCGGT GCTGTTGCTC TGGCTTGTGC CACGGGCGTG AAAATGACAG AGGCCCAATT AAGTCTCATT GAGAACGTGT TGAAGGCCGA AGCGGTGGAG CGCTTCCACT CCAAACAGGG ACAAGTGTTG ACGCCCAAGT CTATGGGTCC AGTGATGGGC GACTGGCTGC GCGTTCCGCA TTCACCACTC GCGGGAGATT CTCTTTCGTT TCACATTCCC TTGCATCGAG CTTTGGCGGA AAGCATTCGA TGCGTGTGCG CCCTTTCTGT TTCCGAGGCA TCGAGAAAAT CGGAACCATC CGGATGGTGG AAGCTTCCAG TCCTTGACGG AATTCCTACT AGTGTGTCCT CTGA
|
Protein sequence | MKNDRIRSVE TRSPQWALTS STTSIQFDRI RAHTAPMRRR LQVDVAVGST DPSVPSSPTA PSPTTTPRRR SPYQSFPLPA YLPLCPGGVA YPAVGTGTDP RDWHVLGSPV GTSSSTSTSS TSPSGNQSPS PAVPISTTNT SASTSTTSPN AATAILPRAT LQQLAARAKT NDDQLRQYAL LYASHEANTA SSLSNTATTT TTHVLPSLAI LTEAVLGRLT GVAPSHVKTA VDILLGPFFQ DPNDPPCPQS ADDVGNCKPS AVATDSRIGT GIGTSDGSSR TRTTSSSTTH HHHSNNHHHN HSTITTAQRL QTLLDQAERE HQQQLHQDKE PSPTTTRRTR PCGYVFQRGD IAWNCRTCQT DPTCVICDAC FRDSNHEGHE VYFHRTTPGG CCDCGDTEAW NIAGCCERHR PPPALLSDDA PHHPDDPFEA VRASRRGYTL AHETLTLEPT ALPPRLTAAL AVVVGAAVHA LLDAVNGAGI GADPVQWKRQ WADEAAKIAN GVVHPEDYAL VQPKDRQGND DDAAAEAPAT TTAFPHGFHL QLRLHNDDVH TFDEVIEALH KPRSLRRHSE DRQSLVALRD HATEMTHHVD ADGQVTVKTF TSFPAALQGY QSLKRRGLHC AVVSSVQVQA EHRARALASW LSEISAAHPA AAALVVHALV QVGEGDETLA DFSVWPRARS IPPWAATDAS SEEQACLRRF AAFPPHLPSS YVTREQAELL HGIALTVQVA DFVHVTGADP HFYGRVPYRL CADRYKKSPH ALWGTLPQCY VDPTPPHAKH PLLQRLATVT TDGTENTTNR ADCWKDVPNA LTETVYVVDT DLRKQQEADR ITSTVFPHRL PGLQLVSGVG TIRLDHLDAR RPPLPSPMDW RHLLATSSFR APASTILWLL LLDPYPTKQV RGAIHALILS LLTDARFKSR VAGALGVAYR PLSTLFCAGV GTEADSPLHF TVQIFTAGSL VRALGSGPAT EALLISDDPN RAGHSEASIG VFTSPIAHTI VRCIHTNLLG ATKEVNMILN HTTSGTDDAE EDPVFQPSND SLLPALTYVA GEHPLMTPLP AAPDDGFLDS RSTRHKRLPH MLRDLEYVIE TPGTAIRLLL PRRFPVYQGP PLSMRGEDVL AFPVVFSRML RLAQGMDPQK RKISGGHVEY EHIRWLEAFG LSLNFAGARD ALSESPPRSS SVALGADYME NVMGVREALG NIGASLLREI KLWLYREGML ETGLPLPPGG AHGATDMAQV ESLQRSTLHV SGSQSTSNDA LSNSNAGAVA LACATGVKMT EAQLSLIENV LKAEAVERFH SKQGQVLTPK SMGPVMGDWL RVPHSPLAGD SLSFHIPLHR ALAESIRCVC ALSVSEASRK SEPSGWWKLP CVL
|
| |