Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44609 |
Symbol | |
ID | 7198105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1031837 |
End bp | 1036505 |
Gene Length | 4669 bp |
Protein Length | 1541 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178351 |
Protein GI | 219115111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCAG AGCCGGAAAA GGAATGGACT CTTGAACACC ATTTGGTAGG GTCGACGTTG CCGAGATCCG TGCCGATCCG AGTTAGCGGA CAGCACTATG CCATTACCGA ACAAAACACG ACGTTGCAGT TCTTTCAACC CGAAGACATT CCTGCTCGAA AGCCTGCGCT ATGGAGATCC ATCACAACAG GATGGGAAGA ACACGGCGTC GCTCTAGCCT TGGACAACGT AGCTTCCGAT ATGCAAGATC CTGTTGTGCT CGACATTTCT CACAATCAAA GAGTCTTGAC GGATCGGATC CAGTCGATTC TATTAAAGGA GGTTTTGGAT GAGTACCTTC CCGATCTTAT TATGGTAGCT GCGTTAGAAG AGGTCACTGC TCAGTTTCTG AAGCAAAGAA TGCTGGAGGA AAATGAAACG GCCATCACAC AGTCGTCTGC TAGCAGTAAA TCCAACCTGC CAATGTCACC GCCGCGCTAT TACAGCAAAC CACCCAGCAG CTACCTGACT CCGCCTGTCT TTAATGATCC TGCGGGTAAA GAATCATCAG AACTGCCCGT CGCTGTGCAG TTCACTCCCA ACCAGCTAGT GGCGCGGATC AAACAAGGAT GTGCCATGGT GTACGCACTA GAGCAGGCAA ATAACCCTCC GCCGAGCTCG ATCAATACGA GCGCAATGTC TGAGCGTCGA TCCAGCAGTC GGATTGCTCG TAAAGAAACG CAGAAGGTAC AAGACGAGAC GGATCGAACA GCTTTATGGA AAATTGTGCC TGGTGGCAAA GTTGCTACTT TTCTTCACAA TCGTTTGTAT CAGAAATCAG TCCCTGCCTC GGAACGAACG AAATCAGAGG TACTAGATTC AACGACCTAT GATGAAAGGG GAAAGGAGAA ATCGAGTGAA TTAATGAAAC AAGAACCTAC GGAAGAATCT AATCAGGCCC AAATCAAATC GGACGACGTT GTCAAAATTT TGCCTAAAGA AATCACCAAG GTTGATGAGC ATCAAAACAG TCTTAACACT TTACAATTGC ATGCAGAACT ACCTCTTCTG GGAGCGATGA GGGTAGACGT TGACCAAAAA AACGATCATG CTGAAGACGA GGAAATCTCA GAACCAGCTG ATGAGACGTA CTCTCCCGAA GCCGATGCCG AAGGTGATGC TGACGAAGAA GACGAGACAG GAAGTTATAC AAAGGAAAAT TTGGAGGACC TACAGCTCGA GAACGTGCAC AATGCGAAGG ATAGGAGAGA GGTCACGGTT GATAAAAACG GTGATGGTGA GGAAGAAACC GACGGTGAAG GTATCGAAGT AATCGACGCA AAGAACCCTT ACTTGCAGCC AACGAACAGC TTTGTTTTAG AATGGCTAGG TCGAAAGACT GGAAAATATC TTTCTAACGA AGACCTGCAA AATTCCTTCC CGCAGCTAAT TGTCACTGCG TCAAAGAAGA AGAAGATGAA AGCATCACAG AATGAAATGG GGCGTATGTT ACAAACGCTT GTTACGCCAT TAGACCGCCT TGTGTGGAGA TACACGATGG CTTCCGAGAT TGGGATGGAT AACCCAGAGG CTAGACTGGC CTTAGAAACG GTTGTCGAAG AAGACCCTCC AGATGTCATG AAATGGGAAA CGTCTTACTT CGGTAGAACC CAATTTACGC TTCGTGTTCT ACACAACTCT GTCGCTGAAG AAGTCAAAGT GCGTGAAGCC AATGGACTAG TCGAAGCGGA AGCTGAATAC AAGGCGAAGA AAACGTGGGA CTCACATCGG TACAAGGGAA TCCATGGGGG CCACACTTGC TGGCCGTCTT GGACGGATGG CGTTGCTAAT TGGTACGAAG AGCAGAAAAT GCATCGTAAG GATGAAGATA AAAATGATGC TCAGATCAAC AAGACGTCCA TTAAAGAAAA CACAAATGAC ATATTGTTGG CGCAATCAAT AGCTGACCTT GAAACAGAAG GCCGAGGTTC ATCGCGAAGG GCAAAAAGGC GTGGAGTAGA AGGGGGCGGG GTGTTTTATG GAACCCAGTC GAATATGACT CAAAAACAAT TGATGGACAC GCTTTTGAGG TTAGTTTCGC AAAATTCGTT TCAAACCGCT GCTTCCCTGC TTTCCGCGGT TCCTGAAGAA AGCAGCGACC CAATGAGGCG AATTCGAACG GCACTTGGCC GAGTTCTTTG GAAACGGAAT CAAGTGGCCA GAATATCTGT CAAGACAGCG TGGACTGATT TGTCCATTTG GAAAACTCTG CAATCTGGTC CCCTACTTTC GATCGCAGAT TCTGCCTATT CCAACAACGC AAACGTCAGT ACCTGTTCTT TAGAGACAAA CCTTACTTCT TCCGATTTTG AATTGATCGA GTATGTTCGC AGGCTGCATC AGATAGAGCT GGGACTCCGT AGCCTTGTGA TTAAGCACCT GACTGAAATA CCACTTGCTA TAATCGCAAC AGCGGCTGAT GAGAGACCTG GAACTATGGA AGCGATGGAT GACGCAGACT TTGAAGGCTC AGGTGGTGTC CAGTGGCAAT CGACCGGGCA CTCTTTTTTA ACAAAGCGAA TTTTTCGACC ATCGGAGACC CAGAACACTA ACGATTTGAC ACCTTGCTAC TGGTTCACCG TTCATGACTT TGCTCCTTCA ATTGCATCGG GGCCAGAAGA TGAACAAGCC CCATCTTCTG AACGAGCTGT CATTGGAAAC GCAAAAGATT GTAACACCGT TGAACGCCGA ATGCGATTTC GGGCCAGTCT TGAATCATCT GGACCGAATG GCGATTTACC TGCAATGAAC CTTCTTCTTA CAGAAGCACA AGTATTGGCT GGGCTAAAGG CTGCAGAGAT TGAGTCGAAC CAAAAAGGAG CTGCACTTTT GCGAGAAAAC CCTTTTTCAA ATGAAATTGG GGCAAAAATA ACCTTGCTCT CGTCGAATAA TCAAGAAGAA GCATGCTCGT TCAACGCAGC CATTGCTGGT CACAATAGCA TTGTTGGGGC AAACGGCCAA ATGCTTCACA CGGTTCTAAT GCTTCCTGAC AAGGGCTCAT CCCGACAAGA AGCTTTCTGG GCGACTTTGG AGGCTTCGTC GAGAGGGATG TGGTCGTGCA AGATTACAGG AGATCCATAC ACCTACTTTA TCCAACAATT TGACTACCAC TCGGGATCTG CCGCGTTTGA AGAGTGTCGG GCGATCGTCA ATTATATTCG TCGGCACCCA AAGACAGCTG CCTTTCAAGA ACCCGTTGAT CCAGTTGCGC TTGGTGTCCC CGAATATTTT AGTGTTGTCA AAAAACCAAT GGATATCTCG ACTCTTTCGA ACAACTTAGA GGAAGGAAAA TATTCTAATA TCCCTCCAGC CACTGCTATC GGCAGAACTC CAGTGGCCCG TATGCTGAAT GGCCCATTTC GGAAAGACGT CGAACGCATC TTCAACAACG CTATGCTTTT CAACCCACCA GACGATTGGA TCCACCAGTC CGCAGCTACA GCGAAAAAAG CTGCACTGAA AAAGATTGAA CAAGCCACAA AGGTTGCCGA AGAAAAGGCA GCTGGAACAT CAAGGCAGAA GAAGAGCATT TACATCGATT ACGATTCGGA TGTAGATATG TATGTCGACG AAAGTGATCA GGATGAAGAT TTTGGTGGAC TGCGAAAGAG TCGAAAACGG AAAGCGGTCA GTCGTCCCAA AGCTAAAGAA GACGCATCAT CTCGGGCCGT TGAGGCGCCC ATTCGCTTGC AAAAGATGGT TAGTGAATCC TTGGGGCTGC GGGGGCCTCT CGCGAATCTC CCCATCAGCA CGGATCCTCT TACTTTTTCT CTCCCCGCAG ACTGGACCTG CAAATCGGGG GCGGTGGTTA CCGTTACGGA TGCGGGAATT GTGGAGACAG AAGAACGTGA GCTCAGTGAA GAGCTAGGAG AGCTGATCAC ATTGCACAAA CAAACAGAAG CAGCGGAGGT TCTGAATTTG CGGCGTTCTA CTAGAGGACA TTTGTACGAG GAAGACGAGC AGAATGGAAC CAGAGAAATG AACTTGACAC AGTTCGAATA CTTGACACGG AGCTCATGGT TTAACATTGA TGACGGCGCT TCGCTCGGGC CTCTCCGGAA CCGACTGGAG GTTGAGCTTT CTTGTGAACG TCTTCACGAG GAGTATTTTG CAAGAGAGTA CGATAAACGG AGAAAGCAGA TCGTCAGTAC TGCAGAAGAC GGAAATAGAT TTGGTCATTA CACGGAGGGA TCGTTCCCTC CGTACCTCGG ACACTTGGTT CCCATGCGTT CGCCTGATTC AGATACGGAG ATGTTGTGGG AGATTCGGCC GGCCTACATT GCGCCAGCCT TGAGATGGGT GATTCGAGGG TTGGTGAATT CGCAGCATCT GGCCGCCTTA GAGCCGCTGA CAACCGACTC GATGAACAGT GGTGTCGTTT TGGCAAATCA TATATATTAT CTCGATCCAA GTACGAAAGC TTGCGAAGTT TTGAACTTGA AGGAGATCCA AAGACGCAAG CGTGCCGACC ATGGTGGGGA CCAGGAAGAA AGCGAGGACG AAATTGAACT CAGTGAGTAT GATAAGCTTC GCGCGGAGCG TGTCGCACGA AATGCCGATC GCTTGAAGGC ATTGGGGTTA GCTTAGTTTA AAGTTTGCCT TGCTTGGCGA AAACGAGTAC ACCATAGTG
|
Protein sequence | MPSEPEKEWT LEHHLVGSTL PRSVPIRVSG QHYAITEQNT TLQFFQPEDI PARKPALWRS ITTGWEEHGV ALALDNVASD MQDPVVLDIS HNQRVLTDRI QSILLKEVLD EYLPDLIMVA ALEEVTAQFL KQRMLEENET AITQSSASSK SNLPMSPPRY YSKPPSSYLT PPVFNDPAGK ESSELPVAVQ FTPNQLVARI KQGCAMVYAL EQANNPPPSS INTSAMSERR SSSRIARKET QKVQDETDRT ALWKIVPGGK VATFLHNRLY QKSVPASERT KSEVLDSTTY DERGKEKSSE LMKQEPTEES NQAQIKSDDV VKILPKEITK VDEHQNSLNT LQLHAELPLL GAMRVDVDQK NDHAEDEEIS EPADETYSPE ADAEGDADEE DETGSYTKEN LEDLQLENVH NAKDRREVTV DKNGDGEEET DGEGIEVIDA KNPYLQPTNS FVLEWLGRKT GKYLSNEDLQ NSFPQLIVTA SKKKKMKASQ NEMGRMLQTL VTPLDRLVWR YTMASEIGMD NPEARLALET VVEEDPPDVM KWETSYFGRT QFTLRVLHNS VAEEVKVREA NGLVEAEAEY KAKKTWDSHR YKGIHGGHTC WPSWTDGVAN WYEEQKMHRK DEDKNDAQIN KTSIKENTND ILLAQSIADL ETEGRGSSRR AKRRGVEGGG VFYGTQSNMT QKQLMDTLLR LVSQNSFQTA ASLLSAVPEE SSDPMRRIRT ALGRVLWKRN QVARISVKTA WTDLSIWKTL QSGPLLSIAD SAYSNNANVS TCSLETNLTS SDFELIEYVR RLHQIELGLR SLVIKHLTEI PLAIIATAAD ERPGTMEAMD DADFEGSGGV QWQSTGHSFL TKRIFRPSET QNTNDLTPCY WFTVHDFAPS IASGPEDEQA PSSERAVIGN AKDCNTVERR MRFRASLESS GPNGDLPAMN LLLTEAQVLA GLKAAEIESN QKGAALLREN PFSNEIGAKI TLLSSNNQEE ACSFNAAIAG HNSIVGANGQ MLHTVLMLPD KGSSRQEAFW ATLEASSRGM WSCKITGDPY TYFIQQFDYH SGSAAFEECR AIVNYIRRHP KTAAFQEPVD PVALGVPEYF SVVKKPMDIS TLSNNLEEGK YSNIPPATAI GRTPVARMLN GPFRKDVERI FNNAMLFNPP DDWIHQSAAT AKKAALKKIE QATKVAEEKA AGTSRQKKSI YIDYDSDVDM YVDESDQDED FGGLRKSRKR KAVSRPKAKE DASSRAVEAP IRLQKMVSES LGLRGPLANL PISTDPLTFS LPADWTCKSG AVVTVTDAGI VETEERELSE ELGELITLHK QTEAAEVLNL RRSTRGHLYE EDEQNGTREM NLTQFEYLTR SSWFNIDDGA SLGPLRNRLE VELSCERLHE EYFAREYDKR RKQIVSTAED GNRFGHYTEG SFPPYLGHLV PMRSPDSDTE MLWEIRPAYI APALRWVIRG LVNSQHLAAL EPLTTDSMNS GVVLANHIYY LDPSTKACEV LNLKEIQRRK RADHGGDQEE SEDEIELSEY DKLRAERVAR NADRLKALGL A
|
| |