Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45703 |
Symbol | |
ID | 7200474 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 982735 |
End bp | 990451 |
Gene Length | 7717 bp |
Protein Length | 2426 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179758 |
Protein GI | 219117946 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.235875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCCA ATAGTGGTGG GATGCCTGGA GGTAGTATGA ACGCAACGTC GATGCAAGAC ATGCAACGTT TGCAGCTCCA AATGGCGCAG TATCAACAGC AGCAGCAACA ACAACAACGA CAGGCGCCCG TCGGAAACCA GCTACTCCTT AACAATCACA ACAGTGTGTC AAACCTAAAT ATGCAGCAGC AGTTTCCCAG CAACACAAAC AACGCGCCTA CCGCATCATT TGTGAACCTG TCGACACAAT CTGGCGCCGC AGGTCGTATG AGTAATCCGG CGCTTGCCAT GATGCAACAA CAGCAGCAGG GAGTTGTGAC AGGTAGCAAT GGCGCTTCAT TGATGAATTC CGGGGGTCCC AACGCGGCTT CCATGTTTAG TTGGAATGGA ATGCAGCAGC CACAGCAGGG TCAGAACGCG TCGTCGATGG ACGCCAGCAC CGGAAGTAGT GCTCGTCTCA TGGCTATGGC TAACATGAAT CGTATGAGTA TAGGGGGAGG GGCCGGTACT ATTTCAGGGC AGGGGAATAG TATGAATCCG TCTACGAGCA CAATGCCGAA TATGCAGACT TTACTTCAGC AGCAGCAGGT GAACGCCTCT CATACACCAA ATCAGATGGG CTTTCAGCAG CAGCATCACT TGTCGGGGTC TCAGATGGGA TCGTCCACAA ATACGAATCA CACCAACGGT GGAGGAGCAC AGCAGCTTAT GCTACAGCAG CAGATCGCGA GTTTACAGAA GCAGATGCAA TTTCAACATC AAGGTGGCAT TGGAACCGTA TCAGCTATGC AGAATCCTTC TATATCCAAT GCAACTGTTG GCAGTGCGGG TCCACGGGCG GCAAACTCGC TGCAATCTCA CCAGCAGCAA CTTCTGCAGC AAATACAGCA ACAACAGCAT GTTGGGCCCG GGCCTCCTTC TATGCCTGCG CAACACCAGC AACCCTATCA ACAACACCAA ATGTCTGCCG GAATGCAGTC TCTGCATCAG CAAGACTCGA CTCCACAAAA TATGATGAAT ATGCTTCAAC AGCAACCTCA ATCTCACGCC AGAAATAATG CCATGGCTAA CGTGATGAGT GATCAATCAA GTCAGACTCT TAGTCGAAGT GGGTCCTTAA ATGAGCAACA GTTGCGAATT CATCAGCAGA ATCTGCTGCG TGCTTCATCC GGGCAGCAAA CCACGGTGTC GGATACGCAA GAAGCGGCGA AGTCCGTAAG ATCGCAGCAG CAGTCACCAA GTCAACCGTC CAAACAACAT GGAATGCATC CACAAAATGC AACGTCGTAT CAACCATCCA ACAATATTGT ACAGGGTCCA TTTGGCGGAA TGCATGGCTT GCCCAATCAG CATAGCATGC AAAGTGTGTC CAACCAACAG CTAAACAATC ATGGGAAAGC AATGCCGATG CATTCCGATG GTAGCACTAC TTTAGGCATG TCCTCACATG GGAACAATAG CATGTACAGT GGGCAAATGA GTGGTAGTAA CGCTTCTCAG CAGCAGGGTA GCGACGATCC AATTTCCATT TCACAGCACA GTAATTTAAG TGCTGGTCAG CTGTCACGCA CTCATCAAGC GAGCAGCAAT GACTCTGGAC AAAAGACTTT TCTGGATGGT AGCTTTGCTG GGGGCTGGCA ATCTAACGAT GATCTGCCAG ATCGACGTCG CGTTATATTT AGCATTTTAG AGGTGATTCG GCAGATTCGG CCCGACGATA CGAGCAAAAT GTCAAACAAG TAAGTTCTTT TCGAGGTTTA TTACGTGGTG CCGGCTTATG ACAAGTTTCT CAGCTAGCTC TTTCTCTCTT AGACTACCTC ATATGGCAAA GAGCCTGGAA GAGCATTTGT ATCGATCGGC ACACAGCAAA GACGAATACA TGGATTTTTC AACTCTGAAG GAGCGTTTGC AAGCAATTGC GCATGGACTT GACCTGCACA GAGGTTCCTC TTCGCCAATG GTTTCCAAGA ATCATGATAC GACGCACTTG CCCCAGCAAA GTAGTAATCC AAGCTATTCA AATATTGAGT CTCAGCAGAA TTCTTTGCAA ATCGGCTTTC CGCCAAGCTT GACTGCATCT GGTCCGACAA GTCAGCAGCA TCAAAATGCG GGTTGGACGG GTCCATATGA TGTAAGTTCC AAGGATGTGA TGAAAATTCA AGGCCAAAAC AACGCCGACA ATTTAGTTGT GCAGAGAAAC GCAGCTAGTC AGCAGAGCTT TGGACGTATT GCTGGTTCGA ATAGTCAACA TGGAGGCATT ATGTCGGGAT CAAATACCGC TGGTCCAAAC CACAACAGCG GAATTTGGCC AACGAATATG GGATCGTCGG AAAGTTTGGG TCAACCGAGC ATAGGGAATG TGGCAATGAA CGGCGGCTCG CAGCATCAAT CCTCAATGAA TCAAGGGATG AACGATATGG CGTCGATGAG TCAGACTTCG CAACAGAACG ATTTTGCTGG GTCTTCCCTG TTTATTGATC CTTTGCAAGG CTTCAATTGG CAGAGCGGTT TCCTTTCGGA CTCAAATATG CCTCCCCCTG TCGGGAATGG TATAGTTAAC TCGGATTATC CAAATACACC CAAGTACCAG GATCCGGGCG TAGCGCAGAA GCAGAAGGTC ATATTGCAGC AGCAACAGCG ATTGCTGCTA CTTCGGCATG CCAGTAAATG CAAGGCGGGA TCAAACTGTA CGACGAAGTT CTGTTCTCAG ATGGTGACCT TGTGGAAGCA TATGAAGACT TGCCGTGATA AGAATTGTAA GACTTCTCAT TGCTTGAGCA GTCGTTGTGT TTTGAATCAC TACCGTATTT GCAAAAATCA AGGCAAGACG TCGACTTGTG AAGTATGCGG TCCTGTGATG GCGAAAATCC GTCAACAGGA GCGCGACGAT GGTACTGGTG ATCCCTTGGC CACCGATTCC TCTGCCATGA ACTATCTTCA GCCAAGCTTG AATGCTCTTC CAAATGTGAT TCCGACAAAA CAAATCGGTG GTTTGTCACA GGTTCGACGG AGCGATAATA TTTTGGAAAA TTCTTGTCAA AGTGAACAGG TCCAGCTGCA GCAATTGCAG GCGCAGCAAA TGAAACTTCA AACACAGTTG GATTCATTGA AGCAGCTTCA GAAACAGCAA GAGCAATTGC TCGAGCAGCA GTCGAGAATA CAGGAGCAGG CGCATAAGGT CAAGGACCCA AGCTCCCAGC AAGCACAACA ATTGCAACAA CAGCAGCTTC TTCTGCATCA GCTACAGAAA CGATGCGAAC AACAGCAGCT TCAGCTACAA CAAGAGATTC AGTCCCAATC GAGAACAGCT GGTTTGGCCC AAGCTCAGGC TCAGCAATTC CAAGCGGCAG CACAGTTTCG TACAAGTGTA CAAGAGGCCC AGATGTTGCA GTCTTCATCA CCAATTATTC CTGGATCCTA CGGGGAACCA ACAGAGTCTA AGAAAAAGCG GCATACGGTA ACAAAATCCA AACGAATTTC GTCGAAAGGG AAGCGTGGTG GGAAAGGGAA AGGACTTCGG GCTGCGGTTG AGGTTCTATC ATCCCATGAT CCAGCCGAAG ATAACTTTGA TCCATATGCC TCGCCAAAAA AGAGGGGTCT GTCTTCTTCT TCGAAGCCAG CGCAAAAGAA AAGGAAGGCA ACTTCCGATA AAGAGGCTGA CCCAGGCGAA AGGGCGACAG GAACTGATAT TGTGGAAGAC TCGACGCTGG CGTATGAAGG CAATACGTCT TTGCTTCCGT TCATGAGTCT AGTCAGCGTC AGAAAACATG TGGATTCTCT GAATAAAAAA ACAAGTCTTT GGTCTCGCAT GGTGACTTAC AAGTGTCTTC CAGTCATTCA AGAGCTCATT GACGACCAGT TTGGGTGGGT TTTCCACGAC GCCGTCGATC CAATTGCACT TGGCTTGCCC GACTACTTTG ATGTTGTGAA ACATCCCATG CATCTCGAGC TTGTGAAGAA AAAACTGGAA AATGCGATCT ACTGTGACAC AGACAGTTTT GCGCATGACG TTGAGCTAGT TTTTGAGAAT GCTATTTTGT ACAATGGGGA AACCAGTGAA GTTGGAGAGC TAGCGAATAG TTTCTTGGTC AAGTTTGCTC AGATATACGA GAAGCTCATT GCAGGTATGT ATTAGTTGTG ATCGAAGTAG TCGTGTATTT CGAGGCTTTT GTTCACTTAT ATTCAATCAA TTTTTCGAAA AGGAATCGAG TCGCCGCAGC AACTCGTGAA AAAGAATGGG GAGGCTTGTG CTCTCTGTGG TCTCCAAAAG AGACAGCTTG AGCCATTATC GCTTTATTGT CATGGGAACT GTGGTATGCA GCCTATCGAA AGGCATTCAT CTTACTTTAC CGATCACTCA AAATCAAATC TTTGGTGTTT ATTGTGTTAC GATCAGTTGC ACGAAGAAAA AATCATATTG CTGGACGACG GAAGTGATAT TAGAAAAAAG GATTTACAAG AGTTCAAGAA TGACACTTGT CCTGAGGAAG CATGGATCAC TTGTGACGAG TGTAATTCTC AAGTTCACGA AGTTTGCGCT CTTTTCAGCA GGAGAAACGA GGCAAAAGCT TCGTACACCT GCCCAAACTG CTATACCTCG AAATCTTTAG CGTCGCAAAG CACGAAGTCT GTGGCCAAGT TTGTAAAGGG GGCTGATTAT TTACCACACT GTAAAATGAG TATTGATATC GAAAAGGGAC TTCATAGAAC GCTCCAAGAT CTCTATGATG CCAAAGCGAA AGATGAAAAA TTGGGGGCCG GCCAAACTGA GCAAGCGGAG GGTCTCACTG TTAGAGTGCT ATCAAATGTA GAAAAGAAAC AATCTGTAGG AGCGAGGGTA AGTAAAAAGC GGAGCTTCCA GTCAAATTTC TTGGAAGTTG TTCTCATTCT GAGGTTCCTC TTCACAGATG CAACGCTGTT TTTCCGAAAA GGGGTACCCT TTAGAGTTTC CTGTACGCTC GAAATGCATT GCCCTCTTTC AAAAAATCCA CGGTGTTGAC ACCCTTCTTT TTTCAGTCTA TGTGTATGAA TACGGGCAAG AATGTCCAGC TCCGAACAAA AGAAGGGTGT ACATTTCTTG CTTAGATTCT GTTCAATATT TTGAGCCCAG CTGCTACCGT AAAGCGGCTT ACCAGGCAAT CATTGTCGAA TATCTGCGTT ACGTAAAGGA GCGAGGCTTC CATACGGCTC ATATATGGAG CTGTCCTCTG ACGCCCGAAG ACGGATACAT TTTCTATTGT CACCCATCGC ACCAACTTAT ACCGCGAGAA GATATGCTTC AGTCATGGTA TCATCAGCTA CTAGAAAAGG CGAAGTCAAG TGGTGTTGCT ATTAGCACCA CCACGCTCTA TCACGAGTAT TTTGAAGGTG GGGCTGATTC TACGAAAATT GAGCAACAAA GGTTGCCGAC CTGTCTCCCA TATTTTGAAG GTGACTACAT ACCTGGTGAA ATCGAGAATA TCCTGGAAAC AATTGATGAA AAAGAAAATC AGAGTAGTGT CCAGAAACTG ATCATGTCCC TGCTTGGGCA GAGGATCATG AAGATGAAAG ACAATTTCCT CGTTGTTCAT TTACACAATG ATGGTGTTGC TGCGGCTAGC GAGCAAAGCG AAGACGTTTC AAAAGGGTGT GACGGCTGCG ACGAGAAAAT AGTGCTCAGC AAGAGATCAA GTACAACTGA ACCGGGTTTG ATGCGGATCG ATGTAAGGGA CGATGATGTA GCAATGACGG AAGCTGACGC TTTTCCTGCC CGGGAGGATC CTACTGTATT GAAAACAGCT GCTCCACCGA AGAAGGTAAA TACTCCGGAG AAAGCTACAC GTTCAATGGG AGAGGCAACA TCCAAATCTG AAAAAACTGA AGACAAGAGT GTTCCAACAC CTGGTATGTT GCTATTTGAA AAGCCTGGGA GCGACACAAG TCTTGTTGAT TCAGCTAAAG ACGCAGCAAA TGAGGGTGTG GCCCCAATAT CAGTTTCAAT GGGAGAACCA ACAGCCGAAT CTGAAAAGAG GAAAGATAGA TATGTTTCGA CAGCTATTGT TTGTGAGAAG CCTAGGAGTA ACTTCAGTCT GATTGAATCA ACGAAAGATA CAGCAGAAAC CGCTGCGGCC CCCGATTCAA TTTCGATAGT AGATTCAAAA GTTGATTCCA AAGACACGGC TTATTCAACA ACTGGCGCTT TGCTTTGTGG TAAGCCTGGG AGCGACATAA GCCCGATTGA TTCAGCCGAT AACGTCAAAA ATGAAATTGA GCTTCCTGGT GTAAGAGTAG CTGGAGTGAA AGAAGAAAGT GGAAGCGAGG GATTGCGGGA GAAAGTCAGC CTTGCGCATA CTGGTACGTC TAACGACATA CATCTTTTGT CAAGAGGCGC CACTGATGCT GATGTGTCTA GAGATAACGC TCATGTTGGC GCAAACGCAG TTTGCGTTGT AGAATTAAAA GCTAACGATG AACCTCCGCT AGAAGAATCG GGCGGTAACG GAGGCCTGAC AAACGAAAGC GATGGGGTCG CTGCTTCACT CATAGAGAAA CAAGCTACCA TCCAGATAGC TGGAGGGAAT CTTTCCGAAA CCCAAACGGA GCCAATCGAT TCGGAGGATG GATGTATCGA CGATTCTGTC AACACTGCAG TCCAATCTGG CGAGTTGGAT GAAAAGGAGG GAAGTGCAAC AGAACAAAAT CGGGATGAAG TGATTGCCAC CATCGACAAG AAAGCGAGCA AAAGGCTTAT GGACAGCGCG ATCTCAACCC ACACTGAACC CACCGAATCT TCGAGTGAAA TTTCGACAAA AAGTGCTCTG GCGAGCAGAA GCCCTCTCGT CAATAGAAAG AGGCCGCTGA ATTCGGTTGA ATCCAACACA TGGGATGAAG ATGCTCCCAT TGAAAATGCT TTGTTTGAAA CCCCACAGCA TTTCTTAAAT TTTTGTAAAA CAAAGCACTT TCAGTTTGAT GAGCTTCGAC GAGCCAAACA CTCCACTTTG TCGATACTCT TTCAGCTGCA CAATCCTATG GCTTCACACG TTCTTCAGCA GTGCGGATCG TGCTACCGAG ATATAACCTG CGATGCCAGG TACCATTGCA ATGTTTGCTC CAACTTCGAC TTGTGCCAAG AATGCTACAG CTCAGTAATG AAGAAGGAGT TTGTTCTGAA TGACTCCCGC TTCGCTCATG ACACGAGCCA CACGTTTTCT CCCATTGATA CGGAAATGCT TGAAGAAACG AAAACACGCG AAGAACGTCA GAAATCCTTA ACGGCGCATG TTGAACTCCT GGAGCACGCT GTACCTTGCC AAGGCCCACC AGCATGCTCT CTGGAGAACT GCCAGCGCAT GAAAAAACTC GTCGAGCACG TGGGAACTTG TATGATCCAA CCAAAGAAGG ACTGCAAGAT TTGCAGTCGA CTCCTGTCGC TATGTACAAT ACATTCGCGT TTGTGCGCTA TTCGCGGACC TTGTCCGATT CCCTTTTGTG ACCGAATCCG AGAGCGCAAC AAACGACTAC GCCAGCAGCA AGATCTTGTG GACGACCGGC GCCGACAAGC TCAAAATGAA TTGTACCAAT CCTCTGAAGA GCCATCTATA ACAACTTGAG ACTGCAAGTC ACTGTTTAAC GTCAGAATGT CGTTGCTGTA GATATGTTCC TTTTCATTGT TGATGCTGGG ATGAGTAAAG TTGGTTTTTT AAAAGACTGC AGTAGTAGGT TACTGTTAGT TGCCAAG
|
Protein sequence | MQSNSGGMPG GSMNATSMQD MQRLQLQMAQ YQQQQQQQQR QAPVGNQLLL NNHNSVSNLN MQQQFPSNTN NAPTASFVNL STQSGAAGRM SNPALAMMQQ QQQGVVTGSN GASLMNSGGP NAASMFSWNG MQQPQQGQNA SSMDASTGSS ARLMAMANMN RMSIGGGAGT ISGQGNSMNP STSTMPNMQT LLQQQQVNAS HTPNQMGFQQ QHHLSGSQMG SSTNTNHTNG GGAQQLMLQQ QIASLQKQMQ FQHQGGIGTV SAMQNPSISN ATVGSAGPRA ANSLQSHQQQ LLQQIQQQQH VGPGPPSMPA QHQQPYQQHQ MSAGMQSLHQ QDSTPQNMMN MLQQQPQSHA RNNAMANVMS DQSSQTLSRS GSLNEQQLRI HQQNLLRASS GQQTTVSDTQ EAAKSVRSQQ QSPSQPSKQH GMHPQNATSY QPSNNIVQGP FGGMHGLPNQ HSMQSVSNQQ LNNHGKAMPM HSDGSTTLGM SSHGNNSMYS GQMSGSNASQ QQGSDDPISI SQHSNLSAGQ LSRTHQASSN DSGQKTFLDG SFAGGWQSND DLPDRRRVIF SILEVIRQIR PDDTSKMSNK LPHMAKSLEE HLYRSAHSKD EYMDFSTLKE RLQAIAHGLD LHRGSSSPMV SKNHDTTHLP QQSSNPSYSN IESQQNSLQI GFPPSLTASG PTSQQHQNAG WTGPYDVSSK DVMKIQGQNN ADNLVVQRNA ASQQSFGRIA GSNSQHGGIM SGSNTAGPNH NSGIWPTNMG SSESLGQPSI GNVAMNGGSQ HQSSMNQGMN DMASMSQTSQ QNDFAGSSLF IDPLQGFNWQ SGFLSDSNMP PPVGNGIVNS DYPNTPKYQD PGVAQKQKVI LQQQQRLLLL RHASKCKAGS NCTTKFCSQM VTLWKHMKTC RDKNCKTSHC LSSRCVLNHY RICKNQGKTS TCEVCGPVMA KIRQQERDDG TGDPLATDSS AMNYLQPSLN ALPNVIPTKQ IGGLSQVRRS DNILENSCQS EQVQLQQLQA QQMKLQTQLD SLKQLQKQQE QLLEQQSRIQ EQAHKVKDPS SQQAQQLQQQ QLLLHQLQKR CEQQQLQLQQ EIQSQSRTAG LAQAQAQQFQ AAAQFRTSVQ EAQMLQSSSP IIPGSYGEPT ESKKKRHTVT KSKRISSKGK RGGKGKGLRA AVEVLSSHDP AEDNFDPYAS PKKRGLSSSS KPAQKKRKAT SDKEADPGER ATGTDIVEDS TLAYEGNTSL LPFMSLVSVR KHVDSLNKKT SLWSRMVTYK CLPVIQELID DQFGWVFHDA VDPIALGLPD YFDVVKHPMH LELVKKKLEN AIYCDTDSFA HDVELVFENA ILYNGETSEV GELANSFLVK FAQIYEKLIA GIESPQQLVK KNGEACALCG LQKRQLEPLS LYCHGNCGMQ PIERHSSYFT DHSKSNLWCL LCYDQLHEEK IILLDDGSDI RKKDLQEFKN DTCPEEAWIT CDECNSQVHE VCALFSRRNE AKASYTCPNC YTSKSLASQS TKSVAKFVKG ADYLPHCKMS IDIEKGLHRT LQDLYDAKAK DEKLGAGQTE QAEGLTVRVL SNVEKKQSVG ARMQRCFSEK GYPLEFPVRS KCIALFQKIH GVDTLLFSVY VYEYGQECPA PNKRRVYISC LDSVQYFEPS CYRKAAYQAI IVEYLRYVKE RGFHTAHIWS CPLTPEDGYI FYCHPSHQLI PREDMLQSWY HQLLEKAKSS GVAISTTTLY HEYFEGGADS TKIEQQRLPT CLPYFEGDYI PGEIENILET IDEKENQSSV QKLIMSLLGQ RIMKMKDNFL VVHLHNDGVA AASEQSEDVS KGCDGCDEKI VLSKRSSTTE PGLMRIDVRD DDVAMTEADA FPAREDPTVL KTAAPPKKVN TPEKATRSMG EATSKSEKTE DKSVPTPGML LFEKPGSDTS LVDSAKDAAN EGVAPISVSM GEPTAESEKR KDRYVSTAIV CEKPRSNFSL IESTKDTAET AAAPDSISIV DSKVDSKDTA YSTTGALLCG KPGSDISPID SADNVKNEIE LPGVRVAGVK EESGSEGLRE KVSLAHTVCV VELKANDEPP LEESGGNGGL TNESDGVAAS LIEKQATIQI AGGNLSETQT EPIDSEDGCI DDSVNTAVQS GELDEKEGSA TEQNRDEVIA TIDKKASKRL MDSAISTHTE PTESSSEIST KSALASRSPL VNRKRPLNSV ESNTWDEDAP IENALFETPQ HFLNFCKTKH FQFDELRRAK HSTLSILFQL HNPMASHVLQ QCGSCYRDIT CDARYHCNVC SNFDLCQECY SSVMKKEFVL NDSRFAHDTS HTFSPIDTEM LEETKTREER QKSLTAHVEL LEHAVPCQGP PACSLENCQR MKKLVEHVGT CMIQPKKDCK ICSRLLSLCT IHSRLCAIRG PCPIPFCDRI RERNKRLRQQ QDLVDDRRRQ AQNELYQSSE EPSITT
|
| |