Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54143 |
Symbol | |
ID | 7197024 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1331261 |
End bp | 1336799 |
Gene Length | 5539 bp |
Protein Length | 1695 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178120 |
Protein GI | 219112737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGA CGACGACGGT ACCGCATGCC ACGGTTCCCG CCGTCCCCGC AGCGCGCCTC GTGGGTCAGG AAGCCCGAAT GGTCCTCACA TCCTTACGGG GTGGACCCCC GTACGTGGCA CGCGGAACGC TAGCGCAAGA CTTGTTGGAT CTACGGGATC GGCTCCCACA GTTGCGGACT ACGACGAACA CCACGCTCGC TACCGGTAGC CCCACGACGC CGGTGGGTGA GACTACCCAC ACCCACACCA ACACCAGCAT TAGTGTACAC CCCAATGCGT CCGATACAGC CGTGGACGAT ACGAACGACG ACCGTTCGTA CGATTTTGTC CGTCCCTTTC TACAAGTCGT CACGGATCCA CGAGCGGCAG GTCCCCACAC ACTCGTCGCC TTGCGATCTC TCCACCGCAT GCTGCTCAAC AAATCCTTGT TTGTCCTTTA CGATCATCAA CATCAGCAAC TACAACACCC ACCACCACCA TTGAAATATC TGCAGCAAAA CCACAGTGCG GTGTTGGCTT CGATTGTCAA GGCCGTCTTG ACGTGTCAGT TTGAACAGAC GGATGCCGGG GCGGACGAAG CCGTAGAAAT GGCCGTCGCC GAAGTCCTCG GACAAGTCGT CGCGCTACTC CCGTCGCTCG GGCTACCGGA AACCGTGGAC CCTCGTCACG CCGCTATCAC TCCCGAGACG TTAGCGGAAA TCTTCCACGC CGTCTTTGTC ACGCGCACCA GCAGTGCCCT GGCCAATTCC CCGGCACTGG TCCTGCGCTT GGAAGATATA CTCCTCCAAA TGACCCAACA CGTCTTTCGG CCCGGCTCGC CACCGAACGA AGCAACCGCC ACGCCCAATC GAAAAGTGTG GCTGGGACGA TGTCAAGCCG TTTTGGAGTT CTGGACGCAT CCGCTCCTGC ACACGCCCCT GGTCGGTGGG GATGGATTGG ACGAATCTAC TCGGGAAGAT CAGCGTCTTT ACGACGCCAC CCGGGTACTC TGCTTGCGAG CGGTACGCAC AGCCCTCCAA ACCGGATGGG CCGAAGCCTC AATCGCTACC AGTTACGATA TGGACGACGA GGAAGACGAC GACGAACACT ACCAAAGCCT AATCAGTATC ATTCAGGACG ATTTGTGTTT GTCCTTGCTC ATGACTGGTC AGGCCATTTG GGCTTACCAC GATGCCCACA CCAATATCTC TCCCGGATTC GTATCCCTCG AAGTCTTGTC GGAAATTTGT GCCACCCTCA CGACGCTTTG GAACACCCTC CCACTACGCA CCGTGTTAAT TGCGCAGTTT GAAACCATCT GGACCGGCTT TTACACCCGG GCTCTCGTGC TCTTGCGCAA ACGCCACCCA CCCACTAATT CCTTGTCCTT CAACGCCAAC TTGACCTTTG ACGCCGAAGT GGAAATTATT CTCGAATCGC TCGTGGACGT TCTCTGTCTC CACGATCACG TCCGCTCTAT TGCCGACGGA GACGGCGGAG CGCTGGAGAC AATGTTTGCC TACTACGATT GTCATTTACG TCGCTCCGAC GTGGCCGTCG GGCTCATGGT GGAACTCTGT CGTTGTTGTG GCGGGGCCGT GGATCCGGAC GGGGAGACGC TCGTCCTAAC CCCGTCCACC AGCTTTTTGG CGCGGCCCCC CACTTGTGAG GATTCGGTGC ATTCGGATAC TTCCGATACG GCGACACCTC CCGATACGGC CGTATCGTCT CCCATGGTAC AAGTCGATCA CGTATGGCGA CCGGTTCCGC CCCATCTCAA GGAACTGTGC GCACAAGCTC TCATGGGAGG AATGAAATGC TTGTTCCGCG ACGACAAGGC CTCGGCCGAA ACCCTGCTGG AACGATCACG GCGCAAACGG TCCATAATGA GTCGACAACT GAAAGACAGT TTCGAGGAGG CCTTGTCCGA GACGATCCCC AACACAAACG TTGCCGAATT ATCGGTGTCA CCCTCCTGCA CGCACGTGCT CCGAGACGTG AAAACGAAAA AGCGTCTCAT GCGCAAGGCG GCGCGCATCT TTAACCACAA GGCCTCCCGG GGCATCGAAT TTTTGCTCGA TGCCGGGCTG GTGGCCGACC CCGTTACTCC CATGAGTGTC GCTACCTTTT TGCGCAACGG CATTGTGGTC GGTCTGGACA AGAAAGCGGT CGGCGCGTAT TTAGGCGAAG CCGGTAAGGC TCCCATTGCC GGCAAGTCGC CACTGTCCTG GGAACGCGAC TGGTTTCATA AGGATGTCCT ACAGAGTTAT TGTGGACTGT TTCGATTCGA AGGACAGTCC TTGTTGGACG GGCTGCGCAT GTTCTTGGCG GCGTTTCGTT TGCCGGGGGA GGCTCAACAA ATTGATCGCA TTCTACAAGC CTTTTCCGAT TCGTGTGGAC AGGTCTGCGA AGAATCAGCC GACGGCCGTC TCCAGTTGTT CTCGGAAGAC CCTAAGCGGG CAAGTGACGC GGCCTATCTA CTTTCTTTCA GCATCATCAT GCTGAATACC GATCGACACA ACACCAATAT CCGGGAAGAC CGCAAAATGA GTGCCGCTGA CTTTGTCAAG AACAACACGG ACTACGGACG TGACATTACC GAAAAGGGAA AGGAATTTCC GAGCGAATTT CTGGAAGGAA TTTACCACAG CATCAATGAT GAAGAAATTC GTACGGAAGG GGAAGGAGCG GACGGCGCCA TGACTGTAGA ACGGTGGAAA GATGTCCTCC GTGGCTCCAC CGAAGAAGCC GAAGATGAGT TTCTGCCCTC CTTGCACGAT GCCGAAGACC TGACCGAACT AGTTCTGGAA CACGTGTGGA AGCCAATCAT GTCGTCCATT GGAGCTTTCT GGGGGATGCC TCGTGTAGCA GACGATGAAC CCCTGTCGCC AAGCGATCCG GCACAAAACG GCATGCTTGG GGTACAAGGT GCTCGTCTCG GAATGGACAT GGCGTTAGAA ATGCTGCACG GAGTCCGGAA GTTGGGTCGT ATCGATATCT TCCGTAAGAT TTTTTCTTGG ATCTGTGACT ATACCGGTCT AATTGGGGAT TACTCGGTAG ATGCTGTGGA ACGCACCTGG TCGCTGACCA ACTCGGTCGA AGCACAGAGT GCCGTTGTTG CTGCGATTCG TACGGCGCTG GACGCTGGCG AGGACCTGAA CGGAGACGGA TGGAAGCGAC TCTGGTCCAT CCTTTTCGAA ATGCGCGACT TGAAGCTTCT TGCGTACGGT GGACCTTCTG CCAAGTCAAG TCTACTCCAC GAATCGGATC CGGACATCCT CGACGAGAGC GCTCGTCGAG ATTGGACAAT TGTCTCGTTA AGGGTGATAT GGATTTCTTC AATCGCCCTC GTAAGGAAAA AAAGTCAACT ATGAGCAGTT CCGTGTTCGG TGCTTTCGGG CGAGCGCTCT TTGGAGCGGA CACGGAAAAC GATGACGAAA GATCTGCACA ACTGGATAGC CCAAGTCGAA GAGCACCCGT AAGCTCGGTT CACGGCAAAG AAGACCTTGT CGTGTGGGAC GATTATGCAC CTAGTGACGA CGAAGAGGAG CCACAATCTG TGGAAGAGTG TGATGATTTA TCGAGCGAAA TGGAAGGGCT CAGTCCAGGC GCCGAGTTTG AAAACCTTTT GATTAGGGAA AGCCTAGGCA TGAGTCGTCA ATTGGACTTA CCAGTCACTG GCCTGGAACG AATGGACGAA GCCAGGCGGC ATCTCGTGTC TCCACGCGCT CGCGTCCGTG GCCGACTGAC AAATGCATGC AACTTTAAGG CACTTGTTTC GGACAGTCGA TTTCTCAACG ACGCGGGAAT TCGTGTCCTC TTGCAAGCGT TGGCGGAGCT CATCGCAGGC ATGAGTCGGT CGACGAGACT TGCTGAAGCT CCACCGCTTC CTCCCCCAAG CGGAGGTCTC GAACGCAGCT CTAGTAGTGA TTCCATCGCA ACCCCGGTTT TCTTGCCGAC TAGTGGGTAT CTTCCGATTT CCCCGGCTTC CGAAGCGTTT GCTGAAGTTC TCATATGCGA AATTGCTTTG AAGAATAGAG ACAGGTTGAA GATGCTTTGG AAAGATGTTC TGCAAGACCA TTACCTGAGC TCTTTGACGA GTATTCTTGT CAATCCAGTC GAAGGTGCTA GTACCGCAGT TCCCCAACCA GATCCAGGTC TCGAGAAGCG AGTCACGGGA TTGCTTCGCA TCAGTATCTG CGCTGTGCAG CGCGACGAGC TTTCCAACGA AATTTTGTCT GCATGGAAAT ACTTGCTTCC TATAAGCGAT GAACAACGAG CGTCGTCGCC TTTGCGTGTG CTCGACAAGC ACATTGGTGA AGGATTGTGG AGAACCGCAT CTTCGGTTGA TGGCCTTCAT TCACTCAACG CCGATGGCTG GGAAGGTTTG ATGTCGCTTT TAAAGTGGTG CGCAAAATGT GGCGGTATGT CAAAGCCTGT CATCTCGCAC GGCAGTCAGG TGTCGGCGCC TCTTCCTGAA AACGATCCAG CATACCAAGG ATATCGAACA GCGCATCTGA TACTTAACAC AGAAGACTTG GATAAACGTG TCCCTTGCTC TATTGTGGAT GCTCTTAAAG CATTGGTAGA AGCAGGTCAA AACCGCGCCT ATCCACAGCT GAGTATCGCA TCTTTGGATT TGCTTCACAC GCTGCACGAG AAAAAAATAA ATTCGTTGCA AACGGAATCA TTTTCCGACG AAAACGCTGC TTTGTTCTGG TCCGGATGCT GGCGAGAGAC CGTTGCAGTG ATGGCGGAAG CGGCTGAGCT GTCTTCCGAT ACGGTGCGTT CAGCTTTCCC TCTATGTTTT GTTTTAGTGC TTGACTAAAC CTGTTTTCTT TCATTTTAGA ATGTTCGACA ACATTCATTG TCAATGCTGA CTGATTTATT TTTGGAAAAG CGAAAGACTG CAATACCAGT AGCACACGTT GCTGGTGTCC TCAGTGAAAT ATGCGTCCCA CTGGCAGGGC GCTGCATCTT ACGCCTTCAA ATGGGTGATG ATTCGATTGA GAATTCAGAC GCGTTGATGA TTGAGTTTGA GCTGTGCATC AGTCTTATCT TCAAGCCCTT GCGACACCAT CTCAACACGG GTATGTCGGC GATTTCCGAC GGAAACCTTT CATCCATTTG GAAGTCGGTG TTATCTGTTC TCGAAGAACT GCTGCGCGAA GACAGCCCTT CGCTGGACAG CAACGAAGGT CAACCTTCGT TACCGGTGAA TCTGAAAGCT ACAATGAATC AACTTGTTAA CGAACATCTT CAGAATGCCA TATCAGTACT TATTGCCGCC GGGGTCTTGC TGTCGGAAGG CTACTCCAAA GCTTCAGAGG ACATTTCGTT TATAACCTGG GAATCTGTTG GTCGAATGGG GATTCCTGAA AGTGCCGTTG TGGAATGGCG ACAGCAAGCT TTGCATGAAT CATAATAAAT TGTATCGATT TATCTCAATC AAATCAAAAA CTGTCCCTTT ACGGATCAAG CCACTATGTA TTTGCAATAT CCAAGTAGTA TGCAACTGGG ACTAATGTAG TGAATATTTA AAAGAAGCGC AAGTTTCGC
|
Protein sequence | MTTTTTVPHA TVPAVPAARL VGQEARMVLT SLRGGPPYVA RGTLAQDLLD LRDRLPQLRT TTNTTLATGS PTTPVGETTH THTNTSISVH PNASDTAVDD TNDDRSYDFV RPFLQVVTDP RAAGPHTLVA LRSLHRMLLN KSLFVLYDHQ HQQLQHPPPP LKYLQQNHSA VLASIVKAVL TCQFEQTDAG ADEAVEMAVA EVLGQVVALL PSLGLPETVD PRHAAITPET LAEIFHAVFV TRTSSALANS PALVLRLEDI LLQMTQHVFR PGSPPNEATA TPNRKVWLGR CQAVLEFWTH PLLHTPLVGG DGLDESTRED QRLYDATRVL CLRAVRTALQ TGWAEASIAT SYDMDDEEDD DEHYQSLISI IQDDLCLSLL MTGQAIWAYH DAHTNISPGF VSLEVLSEIC ATLTTLWNTL PLRTVLIAQF ETIWTGFYTR ALVLLRKRHP PTNSLSFNAN LTFDAEVEII LESLVDVLCL HDHVRSIADG DGGALETMFA YYDCHLRRSD VAVGLMVELC RCCGGAVDPD GETLVLTPST SFLARPPTCE DSVHSDTSDT ATPPDTAVSS PMVQVDHVWR PVPPHLKELC AQALMGGMKC LFRDDKASAE TLLERSRRKR SIMSRQLKDS FEEALSETIP NTNVAELSVS PSCTHVLRDV KTKKRLMRKA ARIFNHKASR GIEFLLDAGL VADPVTPMSV ATFLRNGIVV GLDKKAVGAY LGEAGKAPIA GKSPLSWERD WFHKDVLQSY CGLFRFEGQS LLDGLRMFLA AFRLPGEAQQ IDRILQAFSD SCGQVCEESA DGRLQLFSED PKRASDAAYL LSFSIIMLNT DRHNTNIRED RKMSAADFVK NNTDYGRDIT EKGKEFPSEF LEGIYHSIND EEIRTEGEGA DGAMTVERWK DVLRGSTEEA EDEFLPSLHD AEDLTELVLE HVWKPIMSSI GAFWGMPRVA DDEPLSPSDP AQNGMLGVQG ARLGMDMALE MLHGVRKLGR IDIFRKIFSW ICDYTGLIGD YSVDAVERTW SLTNSVEAQS AVVAAIRTAL DAGEDLNGDG WKRLWSILFE MRDLKLLAYG GPSAKSSLLH ESDPDILDES ARRDWTIVSP SRRAPVSSVH GKEDLVVWDD YAPSDDEEEP QSVEECDDLS SEMEGLSPGA EFENLLIRES LGMSRQLDLP VTGLERMDEA RRHLVSPRAR VRGRLTNACN FKALVSDSRF LNDAGIRVLL QALAELIAGM SRSTRLAEAP PLPPPSGGLE RSSSSDSIAT PVFLPTSGYL PISPASEAFA EVLICEIALK NRDRLKMLWK DVLQDHYLSS LTSILVNPVE GASTAVPQPD PGLEKRVTGL LRISICAVQR DELSNEILSA WKYLLPISDE QRASSPLRVL DKHIGEGLWR TASSVDGLHS LNADGWEGLM SLLKWCAKCG EDLDKRVPCS IVDALKALVE AGQNRAYPQL SIASLDLLHT LHEKKINSLQ TESFSDENAA LFWSGCWRET VAVMAEAAEL SSDTNVRQHS LSMLTDLFLE KRKTAIPVAH VAGVLSEICV PLAGRCILRL QMGDDSIENS DALMIEFELC ISLIFKPLRH HLNTGMSAIS DGNLSSIWKS VLSVLEELLR EDSPSLDSNE GQPSLPVNLK ATMNQLVNEH LQNAISVLIA AGVLLSEGYS KASEDISFIT WESVGRMGIP ESAVVEWRQQ ALHES
|
| |