Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49839 |
Symbol | |
ID | 7198567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 45190 |
End bp | 48864 |
Gene Length | 3675 bp |
Protein Length | 1164 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184721 |
Protein GI | 219129071 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATTCCG CATCGACTCC AATTTGCAAC TGACTGTCAC AATCCATTCT GATAGCTGCA CTGCTTTGAG TAAAGACTGC GGGTGCCATA ACGAAGCACT CGGCAGACTT TCCCTCTTGG GTCTAGGCTA GGTATCAGTA GAACCGTTGC CGAGAGTCGA CGCTATCCAT ATTTCGCGCC ATGAATGAAC AGTCGGGAAT AAGCAGTGAC GAATCTTACG CACAGGATTG GGTTGTGGAA TTCGAAGTGA TTGATGAAGC TGCAGAAAGT GCATCGCACG ACGCCGTTCC TGCTGCGGAT CAGCGACACC TGCAAGAGAA TGTTGAAGAA GTTGCCGGAC TTGAAAATTC CGGGGTCAAC CTCGTGAGCA ACGGCAGCCT GCCAGAAGAA CAACAATTGC AGCAACAGGA ACAGTATCCT AAAGCATGGC ACTTCGGCGT AAGTGATGGC AGCGACGATC CTTTGAACTT TTTGCTTTCG GAAGATGATT CTGGTGTTCG CGCCTTCGGA ATTTCTGGTT CGTGTATACA GCCGACAGAG CCCCCCGAAT CTTTTTCTTT CAATGTTGAA CAAAAGCAAG CAGTAGAAAC GTTTCTTTCC GGCGAAGCGA TACTCCAGAC GCCCGGAAAC CACGTAATCT TGGGTTGCCG GAACTTCGCA CCTTCCCCGG CGAATCACAA GTTTGACCAT CTGTTGCAGC AATACTGGTT GGATGTCCGC GCAGTCGAAG GCGCACTGGA AAGTCGGAAG CGTAAAACGG CCGATGTTAT CGAAATCATC CATCGGCACG GTGACTACTT TTGTTTTTGG GACGCCCACG ATGGGCCCCG GGACGACAAT GGTAAACTGT GCAGTCTTAG TGCTACAACA TGGCAAGCGT TTTGCTTTCG GAGGCTCGAA AGTGATAAGG AGCTCAAAGC CAAGCTACGC TTCCCACTCG AAACGAGTCT GAACCGGCTG ACCGGACGAA CTCATCACCA GCACTCGTCC TCTGGAGCTT CAAGCGTATT GGCTGAGCAA GGTATGGGTG GAATTTCGAG GCCACTCATG AACCTTTCTA TTCACGCCGA ACGTGAGCCG TCGGATGCAG CGGAAGAGGC CGTTCCGGCA TCTACCCGAA AGCGTTCCAT ACCTAAAAAA TCCCATCTGC AAAAAGAAGC CTCCTGGAAG TCAACCCTTC CCAGTTCCAA TATACATGAG ATTGATTCAG AAAGTGAAGA GCCCGGAGTA TCGGAACCAC TGTACGGCAG ATACAAAGAA ATGAAGGATT TGAAAGAAAA GGTCTCGCCC GACGGTAAAT TAGTGTCTTT GGTGGGTCGA TCGGGTGTTG GAAAGACCCA TCTTGCACGT GTCTTTGCAT GGGAATGGAC AAAAGAGCGA GACAAGAACT GCACTCGTTT TGGCTTTTGG CTCAATGCAG CTACGGAATC AACTTTGCGC GAAAGCTACG AAACCGCAAT TCGACGACTT CGCCACGGGA CGAGTCTTGA AGAGCCTTCG GCGAAACGAC GAATGGTAAC TATCCAAAGC CTAGCCCTTC GTCTTTGGGA AACTCTTGCT CAATTATCTT TGTCGTTCGA GTGGATTCTG GTATTTGACA ATGTCCCGGC CTTTGTAGAA GCCTTAGATG GAACAAAAAG AGAGGGTCCT TTGGGATTTC AAGAGTGGTT CCTTCCAAGA GACTGGAGAA ATGGTCGTGG CCGGATACTA TTGTTATCGA CACACGATGG ATATGTTGGG ACAACAAAAT CCAGCATGGG ATATATCGCT CAAATCCGTG TCGACCTCTT GGATGAAGAA TCAGCTGTGC AAATGTTACA AGCAGATCTG CCGGAAGAAC AGGGCAGCGA GGCCTCATTG CATCGCTTGG TTTCGTTATT CGATTGTTTA CCGCTAGCTA TAGCTACGGC AAAGGGAGAA TTGCTGAATG ACGTGATAAG TGTCGACCAG TACATAAAAC GGAACAATTT TGATGCTGTG CAAAATAGGG TCCAGGCGGC CATCAGAAGT TCTTTAAATA ACGCTTACCA ACGGGGTCTC GGGAAAGTTC TGGATGTGGC TGCTTATGTG AACCCCGATT CTATACCGCT GATTTTGTTG GGAGGAGAGA GCGCCAATAG AGCAGCGATA CAACTCACGA AGTGGAACAT TCTGAGGAGG GACTGGAAGA CAGATAGTAA TGAAGAGGTA TACTCCATGC ATCGGCTGCA TCAAAATGCT GCACGGCAGG TCTCATTGGA AAATGGATGT TCGCCTGGAG CTGCTCTCCG AGTGGTCCAC GAAAATATTA CAACATTTGA TCGCGACACT CCTGCACATT GGAAACTCCC AGCTGCCATG GTAAAGCACG TGGTCGCGCT AAAGGAGCGA GTGGAAAGCT GGCCTTCGGA GCTTTGTTTC GTATGGGCGC AAACTCTTCA GAAGACTGCA GAAGTGAACC GGTGGGTAAA TCATGACTTT GGAGGAGCGA AGAAAATGTC GCACTCTTGT ATTTCGGTAT GCTCCAGCAT TTTGAAGTCG GAATGCCTGC TAGCAGAAGT GCGAGAAGCT GTCAGTGAAG AAATGGTCAA GGTACACATG TTCCTTGGAA AACTGCACCG ATCCTGTTCG CAACCGGACG ATGCAAAAGA AAGCTTTGAC CGAGCGCGCA ACCTTTTACG ACTTTGCCCA GCAAGTGATG CGCGTTCCTG GCTGGAGGCT GACATTTTGG ATGACATTGG GAGACTGGAG CACAACAAGA GTTGCTATGC CCAGGCCCTG GAACACTTCC GGAAAGCTCT TGAAATTCGA TACAGAGCGA TACCAGACTC TACGCGAGGT GAATTGGTAG GTTCCCCTTT TTATTCCAAC GATGAAACCT ATTCCACAAG GCTCGTTGAA CAAGAAGCAC TGCGCGACTT AGGCCGAAAT CTATTGAAGT CACTATTAGA GTCAGCGGAC ACGGACGTTT CCCTGCCGAA ACCGCAACCT GTAGAGCGAG AATACAGCGC TCAGCGAAAA GTTCTCGGGG CGCTTGCGGA CACCCTCGTG AATTTTGGCC GATCGTACCG TGAGGGCGAA GATTGTGAGG GCGTACAAAC CTGGACTAAA GAAAATTATG GACGCAGCTA CGGCGAAAAT GGCTACCTCG AAGAAGCCTT GTTCTGGTTT GAAGCAGCTC TTGGAGCGCA ATCTCTGCAG TTTCGGAATG AGATTCAGAA TGACTCAGTG GCAACTACCA TAAGTCACCT CGGCCGAGTC CATATAGCGA AAGGGAACTT TGCTACAGGG CTGGAGATGT TTCAAAAATC TCTCGCCATG AAGCAACATG TATATGGCAA GGAGAAGGGC AACGAATCGA TTGCTACAGC TTGGGGTAAC GTGGCTACAG CTAAGAGACT TATGGGAGAG TGTCTCGTGA GCGAGCATCG TTTCGAAGCG GCATTTCAAT GTTACACTGA AGCCTTTGAG AATTACAAAC GAGCCCTTGG TATGCTGAAG TCTCTTTTTA ACGATTCGAA CCACAAAAAA GTGCGAGAGA CTTGCGCCGG GATTATGAGC GTAGCAAATT CGGTGTCACT TCTGACCGAC TTGTTGATCG CCATGAATGA CCCTGACGCC ACACTAGAAA TTAATTGCCG TGGGCTCGTC GAGTGCTGCT CTAATTCTAT ATCCGACCAG CAATTCGAAC AGTAA
|
Protein sequence | MNEQSGISSD ESYAQDWVVE FEVIDEAAES ASHDAVPAAD QRHLQENVEE VAGLENSGVN LVSNGSLPEE QQLQQQEQYP KAWHFGVSDG SDDPLNFLLS EDDSGVRAFG ISGSCIQPTE PPESFSFNVE QKQAVETFLS GEAILQTPGN HVILGCRNFA PSPANHKFDH LLQQYWLDVR AVEGALESRK RKTADVIEII HRHGDYFCFW DAHDGPRDDN GKLCSLSATT WQAFCFRRLE SDKELKAKLR FPLETSLNRL TGRTHHQHSS SGASSVLAEQ GMGGISRPLM NLSIHAEREP SDAAEEAVPA STRKRSIPKK SHLQKEASWK STLPSSNIHE IDSESEEPGV SEPLYGRYKE MKDLKEKVSP DGKLVSLVGR SGVGKTHLAR VFAWEWTKER DKNCTRFGFW LNAATESTLR ESYETAIRRL RHGTSLEEPS AKRRMVTIQS LALRLWETLA QLSLSFEWIL VFDNVPAFVE ALDGTKREGP LGFQEWFLPR DWRNGRGRIL LLSTHDGYVG TTKSSMGYIA QIRVDLLDEE SAVQMLQADL PEEQGSEASL HRLVSLFDCL PLAIATAKGE LLNDVISVDQ YIKRNNFDAV QNRVQAAIRS SLNNAYQRGL GKVLDVAAYV NPDSIPLILL GGESANRAAI QLTKWNILRR DWKTDSNEEV YSMHRLHQNA ARQVSLENGC SPGAALRVVH ENITTFDRDT PAHWKLPAAM VKHVVALKER VESWPSELCF VWAQTLQKTA EVNRWVNHDF GGAKKMSHSC ISVCSSILKS ECLLAEVREA VSEEMVKVHM FLGKLHRSCS QPDDAKESFD RARNLLRLCP ASDARSWLEA DILDDIGRLE HNKSCYAQAL EHFRKALEIR YRAIPDSTRG ELVGSPFYSN DETYSTRLVE QEALRDLGRN LLKSLLESAD TDVSLPKPQP VEREYSAQRK VLGALADTLV NFGRSYREGE DCEGVQTWTK ENYGRSYGEN GYLEEALFWF EAALGAQSLQ FRNEIQNDSV ATTISHLGRV HIAKGNFATG LEMFQKSLAM KQHVYGKEKG NESIATAWGN VATAKRLMGE CLVSEHRFEA AFQCYTEAFE NYKRALGMLK SLFNDSNHKK VRETCAGIMS VANSVSLLTD LLIAMNDPDA TLEINCRGLV ECCSNSISDQ QFEQ
|
| |