Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45771 |
Symbol | |
ID | 7200912 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 224341 |
End bp | 227935 |
Gene Length | 3595 bp |
Protein Length | 947 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179993 |
Protein GI | 219118442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAATCACGA AGCCTCCGCG CGACCCCGGA ACTATCCGTG TAGAATTCCT GGTGTCTCGA ATTCGGATGC GCGCAACAGT CAGCTTTCTC ACGTGAAATT TGCGTTTGCC GTTTTTCCTT CCGTGCACCC TTTCATCCGA GATTTTGTTA CCGGACAATT ACTCTTAGTC TCGTACTTGA AAGTTGGAAT TCATTTCCCT TGCTGGTGTA TCCAACTACT ACCATGATTC AGTACGATCG TTCGCCCTTC GGCATATCGA CGCTCTGTCG CATGCACGGG AGTGCTGTGT ACCGAGCAAC GATTCCAGGC TTTGCCTCGG TGATCTTCAT TGTTCTCATT CACGAGCTTT GGTTGGAACG TCCTATTTAC CAGGAGAAGG ACAACGAGCT AGGTCATCCC TACGCCATTG GTGTTTTGGT TGGCACCATT ACCTTTCTAC TCGTCTTTCG CACTCAACAA GCCTACGCAC GGTACTGGGA GGCATGCTCG TCCGTCTACC AGATGATGAG CAAATGGATG GACGCCTCTA GTCACACGGC CGTCTATCAT TTACAGTGCG ATCACTACGA CCACATCAGA CCGCCTTCCT TTTACGATTA TCCCCACTTG AATGCCGAGT TTCTCACGAG AGACCGCGAA ACTCACGATC CCAATTCAAC GGGCGTGATG GAGTCGATGC GCATGCAAAA GCGGGCGACG ACCAAATCGA TCGAACAAAT CGAAACTTGC CGACTCGCCC CTGCCAAGCC GAAAAGAAAG GTTTGGGAAA GACGATCTTC GGATACCAAA TCCGAGAGTT CCAGTCGAGG AGAGGATACG CCCGTTCCGT TGGAATCAGA CCCGCGTCTA GACGGCAACT GGGGAGCGCT CTTCGACGAC AATCGTGCTA CGTTCTTCGA TCCCAAATTC CCCGACAGTA TTGATAAAGC TGGCTTTGCA AGTTCGCAAG GCGGAAACAC TCCACCACTC TTTCTCCAAG AACTGGCTCA CTTAACTTCA CTGTTGACCG GCGTTGCCTT GGCCACGCTC CGCAACGATA TCGAAGGCGC CGAGTCACCC TTGGACATTT ATGAAGCGGG ATCGCCCTGG CCAGAAGTAG ATCCGGACAA GATGGCCGAC GCCACATTCT TCGGCGCAGG CTTTACCGCC ATGAAGATTT CCAATTTCAT CGGCATCGGT CGCACTCCCG AGGAACGCAC CCGTTACAAC GCCTCGCGCC CGCTGGCCGT CCTTGGTGGT GTCTCCGCCG CAGAAATTCG ATTTCTGCAA ATGGCTCGAG GCCCCTACGC CAAAACACAG CTGTGCTGGT CCTGGCTCTC GGAATTCATC ACTCGAGAGC ATTTGGCTGG TTCCACCGGG AATGTTGGAC CGCCCATTAT CAGTCGTATC ATTCAGTTTC TGGGCGACGG GATGATCTAC TACAACCACG CGCGAAAAAT CACGTTTATT CCGTTTCCGT TCCCTCACGC CCAGCTTTCC GTAGTCTACG TGTTGGTCAC AGTTCCGGCA GTGGCCTTTC TAATGGATCA GTACACGGAA AAGCTCTGGG TGGGTTGCAT TTTGACCTTT TTGACCGTGA CCGCATTGGC GGGTATTCAC GAAGTCGCCC GAGAATTGGA AAATCCGTTT AGGAATGTCC CCAACGAGCT CCCTCTAGTG ACGATGCAAG CGCAGTTCAA CGAAGCTCTC TTGACCATGT ATGCCGGCTT TCACCCGGAC AACTTCTGGA AGGAGGACGC CGATCGCTAC TCGAAAAAGA AGAAGCGGAC GACTGCAAAA AAATGTGTGC AAAGCAATGG CGTCAATTCG TCGGAATCGT GTCCACGACA GGCCCCAGAA GGGGAGGCAC CGGAGACGAG GACAAACCAT GAAAAGTCTC TGGACGCAAA GATACAGGAA TTGCTGGAGA AAATCGATCA GCAAGGGTCG GAACTTGCTA GATTACGGGC GACGGTGGTT TCCGGCATAA AAGATCCCGA GAGTCAGCCG GTGGAATATC ACCGGCAGGC GGAGGAGCAA AGTGTTAGAT AGTGAAGAGC TCACTCGACA ACGCCAACAG TCGATCTTTG TGTATAGTAT ACTATTTTCA CTCACGGTCA ACTCTTTTTA CACCACTAAG TGTAGCTAGC AGAGTTGAAT CCATGTTTGG CCGTTCGTGG GTACAGTGAT TAGTATTGCC TCTGCTATCG TTTACCAATG GCATGTAAGA CATTATACGC ACCCACATTG CTACAGTCTC TTTCTGTCGC AAAGATCTCA CTGGACATTT TTCTAGCTAA AAGTGCTCGG AATGGAAATT TTCAGAACAT GTACGTACCT GCCATTTGAC GAAATTGACA AGTGAACTTA GACGGACCCG GAAGGTGTCG AAGCAGGTCA TGGTGGATTT TTTTGGAAAT TCCGTTGGGT GTTATCACAA CTTTAAGTTA CATTAGTGAA CAACGCAGAT TCTTACAGTA CATGGCACGA TGATACCTTC TGTTTCAGCT TTTGCCAATC TCGGGAAGCA TTTCTGGGCT ACTGCAACGG GCCTATTTCT GGCGCTAAGC GGTATGGCAT CTTTGTTCCA CGAAGGGCAA ATTATTTTCA GTTTAAGTAG TGATAGCCCC TTCGCTGCGA CACACGGCGT CGATGAGCTT CATCCAACTT CAGCAATATC TGAGATGAGG TCTGCTTCTG ATAGACAAAT GCAACAAAAT AAGACGGAGA CTGAGCTCTT GGTTTCATAC AGAAACAGGT TCGAAGTAGT ACCTGAAATC ACTCGACCTT TAGTGGAAAC AATCCCTGGC GATTCGACGG CACCGACCAG CCGTGCTTTC AACACAAGTC AAGCGGACCG GGCTGATATA AAAGAAGAAC CAGGTCGAAA GCAAAATTGG AGTCGCATGA TCATACCCTG GCCTGTTTTT GTTTTAAGTT TACCGAAATC TGGGACATCC TCGATCTCGA GTTATTTCAA CTGTGGACTA AAGCATAGGC AGTCTGCTCA TCATTGGGGG AAGATGAACA GTGGAAAACA AAATAAATTG GGATTTTGTT TTCTGGATAA CGTCAGGGGC AACAGGCCAA TGCTAAATGG CTGTGGAAAG TACAAGGTCT GGGTGGATGC CGGGGTGCCG TCCAGTCGGG GGAAATGTTT CTACCCCGGG ATGCATGGTC TTGACAACAT TGTAACTAAT TATCCTAACG CAACCATAGT CCTCTCCACG AGGGAAGCCC TGAACTGGGT GCGCTCTGTC AGAAAGTACG CGGGAGGAAC CCTGATGGAC AAGTGGCAAC GGAACTGTCC CGACTTTCCA AACGCAAACT CGACAGAGTT GGAGTGGGCG GTCTTCTACG ATGGCTACAA CGATTCCATC CGGAAATTTT CTATTGCGAA TCCGTCTCTC ACATTGGTTG AAGTCAACCT AGAAAGCAGC TTAGCCCCAA GTGTGCTCAA AGAAAAAGTA GGTTTCCGAA AGAATTGCCT TATGCATTGT CGACCTCATA CAGGCTGTGA AAAACTGAAT GATACCTTTC CAGTCGGAAT CCATAGCGCG ACTAGAGATA CATCAGTACA TTCGGTGGAT GGAGCCAATT AATCAAGACT TCCAA
|
Protein sequence | MIQYDRSPFG ISTLCRMHGS AVYRATIPGF ASVIFIVLIH ELWLERPIYQ EKDNELGHPY AIGVLVGTIT FLLVFRTQQA YARYWEACSS VYQMMSKWMD ASSHTAVYHL QCDHYDHIRP PSFYDYPHLN AEFLTRDRET HDPNSTGVME SMRMQKRATT KSIEQIETCR LAPAKPKRKV WERRSSDTKS ESSSRGEDTP VPLESDPRLD GNWGALFDDN RATFFDPKFP DSIDKAGFAS SQGGNTPPLF LQELAHLTSL LTGVALATLR NDIEGAESPL DIYEAGSPWP EVDPDKMADA TFFGAGFTAM KISNFIGIGR TPEERTRYNA SRPLAVLGGV SAAEIRFLQM ARGPYAKTQL CWSWLSEFIT REHLAGSTGN VGPPIISRII QFLGDGMIYY NHARKITFIP FPFPHAQLSV VYVLVTVPAV AFLMDQYTEK LWVGCILTFL TVTALAGIHE VARELENPFR NVPNELPLVT MQAQFNEALL TMYAGFHPDN FWKEDADRYS KKKKRTTAKK CVQSNGVNSS ESCPRQAPEG EAPETRTNHE KSLDAKIQEL LEKIDQQGSE LARLRATVVS GIKDPESQPV EYHRQAEEQM ISIASAIVYQ WHLKVLGMEI FRTSFANLGK HFWATATGLF LALSGMASLF HEGQIIFSLS SDSPFAATHG VDELHPTSAI SEMRSASDRQ MQQNKTETEL LVSYRNRFEV VPEITRPLVE TIPGDSTAPT SRAFNTSQAD RADIKEEPGR KQNWSRMIIP WPVFVLSLPK SGTSSISSYF NCGLKHRQSA HHWGKMNSGK QNKLGFCFLD NVRGNRPMLN GCGKYKVWVD AGVPSSRGKC FYPGMHGLDN IVTNYPNATI VLSTREALNW VRSVRKYAGG TLMDKWQRNC PDFPNANSTE LEWAVFYDGY NDSIRKFSIA NPSLTLVEVN LESSLAPSVL KEKAVKN
|
| |