Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48002 |
Symbol | |
ID | 7203001 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 667861 |
End bp | 675452 |
Gene Length | 7592 bp |
Protein Length | 2001 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182440 |
Protein GI | 219124289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCAT TAGCGACGCA CCAGTCCGAG CTGGACTTCC TCCAGACGCA GCTGTCTGCG CTGGCGAGCG AAGGCGATCG ATTAACGTCC TCCTTCACGT ACGGTCAGGA AGCCACCACG AAATTGCCGG GTACGTCGGC TACTGTTTTC CTTTGCCGTC TATCTGTATT TGTCAGTGTG CAGCCTACGT CCTCGTCCCA TCTCACTTCA ATTGCTGGTC TTTGTCACTG CTTACCCTAC CACGACAGAT CCTCCGCTAG CCTCGCAATT CGTTCGGCAG CAAGCGCATC GCGTACGAGC GTCGGAACGT CATTTGCTGC ACTTGACCGA ACAGATTCTA CAGTATCGTC GTAAGGTCGT CGATGACGAC CAGGACGACA CCGATCGTCC ACAACCAATG ACTCTAACGG AACTCGCACA CGTCGTACGA TCTATACACC ACCTAGAAAC CCAACAAATT GCTCGTTTGG AGGAGCTGTG GGCACACCTC GACAAAGATG CACAACAAGG TTCGACACCG CTTCCGTCAC CGAATAATCC TATCGAAGAC CCATCAACGT CGTATCCCTG GTCCATGAAG GGTCCATTGG ATGCAGTGGA GGAAACCGAC AACGAAACGG ACCGCACGGC TCGATCTTCC CGTTCGTCGG GTTTCGGTTC GGCACAATCA CGTCGATACC GTACGGATTC GCTCCGGACT CCATCCACGC CTTCCGGACA CTCCGTACGA CTATCCACCA GTACCACCAA AACACTCGAA CGTCGAGGCT TGCTCCGCCA TTCAACGGGT ACGCAACCGG AACGGCCGTC ACCGGCATCG CGACCTAATC CACTGTTTCA GCAAGCGTTT GCGGCGTTGG AACACAAACT GGCGACAGAA TTACACCAAA TCAACGGCAA GGTGACTCCA CACGTACAGT CCGTCGAGAC TGACACCACC ACCAACACCA ACAAAGGCTT CGACTACAAC TATAACAAGG AGAACAACGC CGTTTCCGAC ACGGACGAAG AAAAATCCAT GCTCGACGGC TCGTTGGACG AGAGTAGTTA CGACGGACAC ACCATTGCTA CAGTGTCGAC CAAAGTTAAT CCGCATAAAA GACTCTCTAC TGTACGCGCA GCGGAACCTT TACCTCCACA CTCAGATCCA ACCACACAAA GCATCTCGTC GGCCAGTGAA TCATCGGATA TCTTTGACGA TGACGACGAT GATCACGGCA CATACGATGA CTCGGCCATG TACTGTCCAA CCCTGGTGGC TCCACCGCAT CGTAGTCTGA CGCGCGCCGT CGGTACAACA CCCCAGACCG CAGGAGACGC ACACGACGTG ACGCACATCG AGGTTGACGT AACAGCGTCG CCTTCCAATA CCAATCTCAC CATGGACGTC ACCACTCTCT ATGACGAAGA CGAGCTTTTG GACCACTCGG CTACTTCGGA CCACCTTTTG GACGCCAGCT TTGCATCGAC TGAGACGCCC ATCCTGGACC GATATCGCTT GGATCCGGAC GATGAGTTCC CGAACGGATT TAAGGTTGTT CCCAATCGCC GAGGCCAGCA CCATCGCAGT AAGGTTGTTA GTGACCACAA AGCCGAAGCA CATCGTCCAC ACCCATTGAT TTGTAAGGAA CACAGACGCG TCGCCGGATC TATACACGAC GTACAAACCC AACACGTCAC TCGCTTGGGT GAGCGACGGG GACATCTCGC CACGAATCGG CACGGTTCGG CGCCGCTTCC GTCAACGGAT TGTCGTCGCG CCAGCGCAGC AACATCGCTT CCATGGTCAA TGAAGGGTCC CCTGGACGCT GTGATGGAAA CTGAGAACGA AACAGAGCGC AGCGTTGGAT CGTCCGAAAC GCCCATACTG GACCGGTATC GCTTGGATCC GGACGACGAG TCCCCGAACG GATTCCGGGT CGTCCCCAAT CGTCGAGGCC AGCACCACTG CGGTAAAGGT CACGACGCGA GTCGAAAATC CCTGCGCAAC GAAGGAAACC ACGCGGCCGA GGAAAGTCCT ACCGTCTCGA CACGGCGTGG TTCCTCCCGA CGGGCGTACC GAAAGACTCC GTTTCCCACA CAGAAGAGAA CACCATTGGA GCCACTTATT GACGAGAATG AAGTGCTAGA AAGCGACAAT GATGCTTCGT TCGACAAGCC ACAATTCCCA GAGGGGGCGG GGATGACTCC CCGGCGTCCC TCAATGAATC TGTCTAGTAG CATGAGACGC TGGAGTTCCT CTCCGGCCAC GGAATATCCA GCCAGTGGCT ACGGGAGTGG TCGTGCTTCG CTACCAACGT CACTGTTGTA CGCCACCGCA CAACTTTCGA CACCCCGACC GCCGCCGTTG GCGACTCGGA CCCAATCTAC GATTCCGCCG TTGCGTCCGA AAGCGTCATT ATCGGAACAC CAGTCGTCCC GTTCTGCCTA TCTTATACCA CCGATTGGCG ACGCGGAATA TAACGAGGCT CCACGCGTTG TTCGCATGCA AGTTGCCTTA AACGACGTAC AACTTGCTGT GTCGGCGTTG AACGAGGTCT TGGCGGTGGC ACAATCGCGG TCGCCAAAAA GTGTGCGATA CATCCGAGAG GAAGAAGCGA AATCGATTTT GGAACCGTTG GGTTTAGATG GGCGCAAAAG CAAGAGTATT CTGATGAGCC TATGTCATTG GAGACGGTTG GTAATGCATC GGGAGGAGCA CGGAATTGTG TTTACAATTG TAGGTGACCA AGACTGACAG TGAACCCATT CCTTGGTCGA GTTGCTTGTA GCTCTCGTGT TTGAATGTAT TTGTAAACTT GTGATGGTTT GTGCTACTAT CACTTTTTTC TACTGCTACT GCTACTTTTG GATGCAGTTT CGGGTGGTCT GCTCCTTTCA AATTTGCGCC AAAGTAAAAC AATAGCGTGT GACGCGTCGT CAGGAGTTAT TTACTGTCAT GTCGTTGGAG AGCGGAACGG CGATTCCACA TGATACGCTT TATTAGGCAT TGGTACTGTC CAAATGTAAA CAGAATGTCG TAGAGGCATT CCTACTTTCC AAAAAGTTAT AATTGTTTGG CGGGAAAAGC CGACACCTTA CGTTTTTCAA GTTGTTCAGT TAGCATTCTG AATACCACTG GTAGTACTAC TGGTTCGCTG GCTGGATTTC ACTGCTCCCA ACTATTGGCA CTGACAGTGA TCCGCCGCAC CCTCGCCTTT GGAGTGCCAA CCTTGTGGCT TGGGTTTGTC TACCCTTCCT ACTATTGCGG GTTATCCCGA GCTTCGAGAC CATAGCACAC CGCCAATGAA GCAGCTCTTC TGTTAGAGAT TGGATCCGAG ACGCGAATCG GTCTAATAGA GGGGACAGCC AGTCGGATCG CCGAGGAAGA CGCTAGTAGC AATCGACTCT GACGAATCGA TCCGAGAGGT GATACAGAAG TAGTGCTTAT ACTAATACTT CTAGTACTGC GGCTGTGCTA GGCTATTGCT CATGACGGAC GCTCCTTGCT CTTCCGCGAG AGAGCAGTCG ATCCCGTCAT CAGCTACCAA GCCGGCTACG AATGCGGGTG CGGGTTTAGT GCAGCAACTG TTGTTTCCAG GCTTTACCGA TGCCGGTTCC GACGACACCT TCCAAGCACC GTCGCCGTCA TCGTTACCCT CCGACGGAAA TCTGGAGCCT ACTAATCCCT CAGAAGTAAC GTGTGTGGAC GACACAAACG ACAGCGTCGT ATCCGAAGAA GCTCCGACCG AAATACACGA AAATTTGACC CGGAATCAAT CCCGTGCTGG TGAATCGTCT AGGTTGGTGT CGACCAACGC CGTCCGCTTG CCCCGATCCA TCGATACCCC GACGACGACA ACAATGGACA CTATGGTATC CCACTTTCCT CCCTTTTTCG CCACCACCAA GCCGTCCCGC CCGACTACCA CTCTCGCCTG CACGTCTCCA TCGGATCCTG TATCCCCGCC CCGAGTGCCC CGGGTGGGGG TCCCGGCGAC GCTCACGAGC TCCATCGTGG TGGGCACCGG ACTGAGCAAT CCCGTGGAGA CGGCGGACCA CAGTTTGGCC GATGATGATG ACGACGACGA CCACAGTATC GATAGCGACG ACAGTGACGA TCTTCAACGG AAGGAAACGA ATGATACGGA AAACGTTTTT GGGGACGATA CAGCGACAAT CGATGCCACC GTGACACGGC CACTCAATAT TCCACTTGAT GAAGCCATGC AACAACGAGC GCTGCATGGC CTCGACTACC CCATTGATAA CCACGATCAC TGGGAAGAAG AAGCCAATCC ATCCCAAACG GACTTTGGAA AACAGCTCAT GTTTCAACGA AATATGGAAA CGTCGTCGGA TGTCTTGTCG ACCGCCTCCG AAATGGGTAC CGTCATTGGA ATAGAGCGCA TGCACGACAT TGAAGCCATG GAAGCCCCAT CAATGGAGAG GCCCGAAACC AAGACTCCCA AGGTGGTTGC GGCACAATCA CCCCGGATTT ACTCATCTTC TCCGGATGAC GAGCTCTACT TAGAGAAGAT ATGGGATGCG GATGCGTTGA ACCTTCCCAT GTTGGACCCC GCCCAACGTA CGTTCCCTCA AAAGCCCGCG GGGTGCGCTC GCGGCGAAGT TTACAATTTT GACAATTCTC AGGTCCGGAA ATTGGACAAA GCCTCGACTA TATTGTCGTC CGCTATTCGG AGAATCGGAT CCGGCCTGAC GGGTGCCAGT GGTCACGGTC CCAAGCATTC GGCTTCGGCA CCGTCTACTC CACGTACCTT GGCTGTCCGT CGCGGATACC CTTTTTCGTC GCCCCACACG GCTTGCGAAA TATCGCCGTC CAAAACCCCG TCCTCCTTTC TCTTGCGCTC CCCGGGATCG TTGGCACACC TTGCAGCGCA CTCTCACTCC CGACGAAAAA GCCGCCAAGA CCCGTCGCCG CGCGCCGCCT CCGGCTTCTT TTGGGAACGA CCGAGACCGA CCAACGTGAT GAGAATACCG GACGGTCGGG TCGTCAATAA TCAAAGACAC TCGTCCGTTT TGCCACTCAG CATACCGCGT TCCTGTTTTT CCTTCGACAC CAACGACAAT GCCATGGACG AGCACAAGAC ATACCGACCA GACGCGGCGC CGCGGTCATC CTGGCACGAG AGGAACCACC CCGGCCAGAA TGGATCCCAA CGCTTCGCCA CAAGCCGAGA ATCCAGAGCG CCACTGCTTC GCAGCAGTGC GAGCTGGGAC ACAGCGTCTG GAGTATCCCA AGAGCACTCT TTCGACCTTC GTCGTCGTTT GTCCGGAATC GTGGTGGCCC ATTGGCCCAC CGTTACGGCA GCGCTTTTGA TTTTCCGCAG CGGGGAGGTG AATCCCAGCT TCCGGAACGT CGGGTGCGGA TCATGAGTGA TTATAGTCCG CCCGTGAACG CGGACACTTC CTTGGAACTC CAGACTCCGC AGCGTATCAA ATTGGAACGT GAAGACGCTC TCGACATTCT GACATGTTTG GTGGAGAGAG GAGTCGCTTT CAACGAGCCA TTCCAGCCGT CTCCGACAAA GATTAATAAC GGAAACGCCC GAGATTTCTT GATCGAACGA AAACCTGTCC AGCGATTTGA GTCGGCCGAT ATTGACCAAG TGATAAGGCA CCTCAAGCAG CTCAGCGAGA GAGACAGAGA CGCTGATGGA GAAAAAGGGG AGATTGAACA TGCACACATG GACGTTCTGG AAGAACTTGT ACGCTCGCAC GAATACGCAT TGGAAATGAA TCGAGCGTCA CAGTCGGCAG CGTCTTGGCT AAAATCAATT GGACGTTCCC AGCCAGACAC TCGCGAAGAG GAGGGCAAAC ACAGCTCGAG CAACAGCGAT GTAGATGACA AAAACTTCGA TCGTGTGACA TCTAAGGAAA ACCATTCTGC TGGTGAAGGC GCTGCCGAAT GTATGGACTT TGTGACTACG AAAGCAATGC TGAATTCGGC ACGAATGAAA CTTAAGGAAA AAGCCATATA TGCGGATCGT TTGAACGAAG AGCTGGCCAA ATGCAGAGCA GAGATCGGTC GGCTTAAAAG CGCTTCTCAT TCCATTTCTT TTCGGTCGCC GAACCGTAGC ATTCTGGACC AGAGCGAAGA TGTTTCGGTC GAAGAGGAAG AAGCTGAGGA GGCAATCGAA CGGTCTACCG AGTCCGGATT TCCCCTCGAC AAGACGGATG ATTATTTGAA TTTCGATACA TCTTTTCTTC AACATACTGC AGATTCTCCT TTGCTCTTGG AAGAGCAGAG CGAGCTAAAC AAATACAAAA CCGCCCTTGA AAAAGCCAAT GAGCAGATAC GAATACTTTA TGAAGATTTG CAGAGCGGGT CTGAAGTACG CGGCATGGCT AAAACGAAAG CACCAATTGT TGTAGTAGAG TCACCGCCCT CTTCACCTCG CAAGGTCGAA AGCCCAAGCA ACAAAGAGGA GCGTATGGTG AATGTGCGAA TGCTAGACGC GGAAAACTTT GTTACGGAAT GGGACGGTAT TACACCACCG CTACCCCCTC CACCAGATCA CGGTCTACGT TCCCCAATCG TCGCAGCTGT TTTGCAAGAG TGGAGTGAGG ACAGCGGTTT GCACGAATCG TTGCTTTCCT GGATAGATCA GATACTCGGA GGCGCGGATC CCTACACTAT ACCGCCACTT ACTATTTCGA GCTTAGATCA TCAGGTCCGA GACGGCTTTG TCATGCACGT CCTTCCATTA TTACTGCGGC GAGCTGACAT TCGGGTCGAT GTCAAGACGC GGACACACCG TCGAACAACG TACGATTTGG CGATAGCCGT GGAGAGTTTA TCACCTTCCG GCTTTGACGC TACTCACTTA GACCTTAACC GACACCTGGA GACGCGGTCG GCACGGTCGG ATGTTGGCTC GGCCAGCGTA GCGTGCAGTG CGGCCACGAC AGCGATCACA ACTAACATTC CTACGGCAGG AGTACGTCCC CAACTGTTGG CCTCGTCATT GGGGCTGAAT ACAAACATTA CTCCACGCAA CAGCGAGCCC CGGGACAACA TATCGTCGCG CTTGTCGTAC GACGAAATGT CTGAAAGTAT TTCCACAAGG GCACAACCAG GTCTCATGAG CACCCTTGGC GGAGCCTTGG GGGGCTTGCT GACACGCCGT AAGTCTACCC CCGGGTCGAC ATTATACTCG ACGCAGCCTT TTCCGCGCGG ACCGGGCGAC AGTATTGCTG AAGGTGCGGT GGTAACGACC ATCCCCGAAG AAGGCTCCAA CAACCTACAA CATGCTGAAG ACGACAACGA CGACGAGGAG GGACAACCAT ATCATCGGTT AGTGTCGGCC CCAGCCGGTC GTATCGGTGT GACCTTTGTT GACTTTCGAG GCCACGCGAT GGTGAGCGAT GTGGCCACGG ACAGCCCCCT TGGAGGCTGG ATTTTTCCTT CCGATATTTT GATTGCCATC GACGAGTTGC CCGTTTCGGG AATGCGTATT CGCGACATTA TCAAGATTTT GAAGGACCGG CACAGCCGGC AGCGGGCGTT ACGTGTCATT AGTAGCCACA CAATGAACGA TGCCATGTTG GCGAGCAACC TGTTGAACGA CTCGGCCTCT GATATTCATT GA
|
Protein sequence | MAPLATHQSE LDFLQTQLSA LASEGDRLTS SFTYGQEATT KLPDPPLASQ FVRQQAHRVR ASERHLLHLT EQILQYRRKV VDDDQDDTDR PQPMTLTELA HVVRSIHHLE TQQIARLEEL WAHLDKDAQQ GSTPLPSPNN PIEDPSTSYP WSMKGPLDAV EETDNETDRT ARSSRSSGFG SAQSRRYRTD SLRTPSTPSG HSVRLSTSTT KTLERRGLLR HSTGTQPERP SPASRPNPLF QQAFAALEHK LATELHQING KVTPHVQSVE TDTTTNTNKG FDYNYNKENN AVSDTDEEKS MLDGSLDETE PLPPHSDPTT QSISSASESS DIFDDDDDDH GTYDDSAMYC PTLVAPPHRS LTRAVGTTPQ TAGDAHDVTH IEVDVTASPS NTNLTMDVTT LYDEDELLDH SATSDHLLDA SFASTETPIL DRYRLDPDDE FPNGFKVVPN RRGQHHRKRS VGSSETPILD RYRLDPDDES PNGFRVVPNR RGQHHCGKGH DASRKSLRNE GNHAAEESPT VSTRRGSSRR AYRKTPFPTQ KRTPLEPLID ENEVLESDND ASFDKPQFPE GAGMTPRRPS MNLSSSMRRW SSSPATEYPA SGYGSGRASL PTSLLYATAQ LSTPRPPPLA TRTQSTIPPL RPKASLSEHQ SSRSAYLIPP IGDAEYNEAP RVVRMQVALN DVQLAVSALN EVLAVAQSRS PKSVRYIREE EAKSILEPLG LDGRKSKSIL MSLCHWRRLV MHREEHGIVF TIFRVVCSFQ ICAKYYWFAG WISLLPTIGT DSDPPHPRLW SANLVAWYCG CARLLLMTDA PCSSAREQSI PSSATKPATN AGAGLVQQLL FPGFTDAGSD DTFQAPSPSS LPSDGNLEPT NPSEVTVVSE EAPTEIHENL TRNQSRAGES SRLVSTNAVR LPRSIDTPTT TTMDTMVSHF PPFFATTKPS RPTTTLACTS PSDPVSPPRV PRVGVPATLT SSIVVGTGLS NPVETADHSL ADDDDDDDHS IDSDDSDDLQ RKETNDTENV FGDDTATIDA TVTRPLNIPL DEAMQQRALH GLDYPIDNHD HWEEEANPSQ TDFGKQLMFQ RNMETSSDVL STASEMGTVI GIERMHDIEA MEAPSMERPE TKTPKVVAAQ SPRIYSSSPD DELYLEKIWD ADALNLPMLD PAQRTFPQKP AGCARGEVYN FDNSQVRKLD KASTILSSAI RRIGSGLTGA SGHGPKHSAS APSTPRTLAV RRGYPFSSPH TACEISPSKT PSSFLLRSPG SLAHLAAHSH SRRKSRQDPS PRAASGFFWE RPRPTNRGGE SQLPERRVRI MSDYSPPVNA DTSLELQTPQ RIKLEREDAL DILTCLVERG VAFNEPFQPS PTKINNGNAR DFLIERKPVQ RFESADIDQV IRHLKQLSER DRDADGEKGE IEHAHMDVLE ELVRSHEYAL EMNRASQSAA SWLKSIGRSQ PDTREEEGKH SSSNSDVDDK NFDRVTSKEN HSAGEGAAEC MDFVTTKAML NSARMKLKEK AIYADRLNEE LAKCRAEIGR LKSASHSISF RSPNRSILDQ SEDVSVEEEE AEEAIERSTE SGFPLDKTDD YLNFDTSFLQ HTADSPLLLE EQSELNKYKT ALEKANEQIR ILYEDLQSGS EVRGMAKTKA PIVVVESPPS SPRKVESPSN KEERMVNVRM LDAENFVTEW DGITPPLPPP PDHDHQVRDG FVMHVLPLLL RRADIRVDVK TRTHRRTTYD LAIAVESLSP SGFDATHLDL NRHLETRSAR SDVGSASVAC SAATTAITTN IPTAGVRPQL LASSLGLNTN ITPRNSEPRD NISSRLSYDE MSESISTRAQ PGLMSTLGGA LGGLLTRRKS TPGSTLYSTQ PFPRGPGDSI AEGAVVTTIP EEGSNNLQHA EDDNDDEEGQ PYHRLVSAPA GRIGVTFVDF RGHAMVSDVA TDSPLGGWIF PSDILIAIDE LPVSGMRIRD IIKILKDRHS RQRALRVISS HTMNDAMLAS NLLNDSASDI H
|
| |