Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44189 |
Symbol | |
ID | 7204105 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1237524 |
End bp | 1242470 |
Gene Length | 4947 bp |
Protein Length | 1560 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186212 |
Protein GI | 219113257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTGACCATG AGAATATACT TTACCAGCCC AAGACAGTGT TTTACATAGC CACCCTGATC TGGTCTACAA CCGACAGGTG GATAGAATTC GCATCTAGTA TTTCAAATCG TCCATTGCTT TTCTTCAAGC TTCACGTTGA ATTTCCTACC AGAATGCCAA CTCGTGCAAA TGAGAACCAA TCGCTCCAAG AGCTGTTTGA CTCTGTGGTT CAACTTATTC GAGAAACGCC CACTCCGGAT GATTCCGCTC ACCAGCTATC AACCGGCGAC AAACTGCGTC TGTACGGTCT CTACAAGCAT ATTGAAGCGA GTGCTGCGTC TGACAATAGT GACAAAACGG CAGTCGACGA GGAAGCCGCG CCTTCTATTT TTCGTGTGGA AGCCTACGCC AAGTACCAAG CCCTGAAGGC TTGCACGGGA TTATCTCGGG AAGAAGCGAT GCGGGAGTAT ATTTCGTTAC TCAGCGCACA AGAAAATTCA TTGGGTGAAA TTTGTCGGGA TTGGTGGCAT AGCTCTAACT CTGTTGCCAA TGACGGCAAA AGTTCTTTTG CGAAAGAAGT GGAAACTCCG ATTGACCCTA TGCTAGAGGA GGATACGAAG GCTTCGAAGC TGGCCGAATC CAAGGACGTG AAACATACTG AAATGAATAA AGCCTCGCAT GTCGAACCAG CCGCCGTTTC AAGGCCGAAA CCATCGAAAC ACTTTTTCGG TGTGTCGCCC TTAATTCCAC GTGGCCAATT GGATATTAAG TATCGCGATC TACTTTATGC GAGTCGCCAT TGCGCTACAA ATATGATTCG CCGATCGACT ATGCCATGCT ATCAACACTA CGAGCGAAAA ATAAAAAATC AGTGGATCAG GGGTATGGAA GGTGATGGTC AAAGTCCTCA AAACGTTGTA GTTGGTCTGG CAGTGCGATC TTTGTTGGAC TTGTACCTAT TGAGTCGGTC CTTCCCAGAA GGCTCGCAGA TAATTATTTG TCCGCCGATA AATGTTCCAG GTATGCTACG AGTGCTCAGG CACCATCGAC TCGAAGTGGT GGGCGTGGAC TTGCCGCCGT CGGACGAAAC GACCAGAAAT ACAACAACCA CCATATCTGT CGATATCGAG GGGATCGAAG CGGCAATTAC AGATAAGACG GTTGCCATAC TAGTGGTTCA TCCGTTCGGA ATGGTGTCGG CGTCAAATCG TGACTTTGAG CGGATAAAAA CACTTGCCGA TCAGCATAAG CTGGACGTCA TGGAAGACTG CGCCGAAATA TTTACAGGTC TGGGGTCACT TTCGTACAGG GGAAGTCCTC AGGCTGATGT GGTCTTCGTC TCCTTTGGTC TGATTAAGAC TTCCACTGCC TTGGGAGGTG GCATAGCGAT GGTGAAGAAT ATAAAAGTAG CAGAGACCAT GAAGCGATTG CACTTTTCTG TTTACCAATA TCAAACGAAT GCAGAGTACT TCGGAAAAGT GCTGTATGCC CTCTGTATAC GCTTCGTGAC GGATTTTCCT TGGATGTGTG GCATGATACA TCAGCTGTGC GTAATTTTCC GATTGGACTT CGATTATTTC GTTACCTCAC TACTGCGTGG ATTCGGACGC TCACCTATGG ATTCCAGTGG GACGAAGTTT GATCAAGCAA TACATCAGTT TCGACGACGC CCATGCGCGG CTCTCTTAGC CTTACTTAGT TATCGCTTGC AGGAAGCAAA CCAACATGTA CCCTCGGTAT TGCATAAAAA GGATCAATGC CTAAAGATCA CTCGCTTGTT GAAAAGCACG ATCCCAAATG TCAACCTTCC GGCCCCAACC CCATACTGCA CGAATAGTAA TTGGCTTTTC CCAATCGTGT CGGAGACGCC TGGGAAACAA AGCACTCAGC TCGGAAAGCT TGGATTTGAT GCCACGCAAG GATCATCGCA GTTGTGTTGT GTGTCTCCAA ATTGCAAGCG AGCGGAAAAG CTCATGAAGG CGTTGTTGTA TCTTCCTGTC TGCGGTAAGA AGCTCTGTAA CCTTGAAATG AAACGGCTAG TGGATGGACT TCGCACTTTT CCGTGCAATG GTGATGAGCC CGTACAACCA GGTTTCGGCT GTAATGTTGA TTTCTTGTTG AAAACAAGAT TGAGTCTTAG TATGGCACTC TGCATATTTT TTGTTTTCTC CCGAGATGTT GCCTTTCTCA TACGACAGAT TCATCTGGGG ATTGTGGCGC TGGGGATTTT CCTCGTTGTT TGTATCGCAT CGTCGAAGCT TCTGCGATGG TTGGTCGCTG ACTACTATAT CAATTCTTCG ACCGCAGTGG GGAAGTACAT AGACCTGTTG GGTCAACATC CAGATTACGA AAGCCATCGA GACATTTCCA ACCATTCCAC TATTCCTGGA CTTCAAGCTG TCCGCACAAG TGTTTTCCAG TCAAGTCCAG CTCTCAGGCT TCCACAGTGC TCTCCCTGTG AACGAAAAGT GATACTTACT GGGGCGACTG GTTTTATTGG ATCACTTGTT CTTCGAGATT TGCTCCTTCA TCGAAAAGTG TTGGGGATTA AGAAGGTCAT ACTCATTTGT AGATCAAAGC GTGGAATTTC GGCTCAGGCT CGAATCGACA CATTGCTCGA AAATACAGCT GTGTACGGGT TCCTAGACAA AACTGAGAAG AGCGACTTAG TAAAGGTTAT TGAAGGTGAC GTTACAAGAC CCAATGCTTT ACTGTGTAAA ACTGATCTCT ACGATGTCCG CAACGACGGA AGTATATCCC ATCTGATTCA CTGTGCAGCG TCGGTGAGCT TCACACAAAG CCTTCCTGCT GCTGCCACAG CGAATATATC CTCCCCACTA TACTTGCAAG ACCTCGCAGC ATCTCTTGCC CATGAAAAGA CACATTTCGT GCACGTCAGT ACTGCGTTTG TGCATGGCGG TTTGAGCGGG ACGGACGACG AGCCACTCTC GGAAAGGTTG TTTCCCCTCG GATCTTTCGA TGCTAATGAT TTATACAGTT CAATGCAAAG TACGGAGTTC CTCGCTTCCA AGGCTATGCG TGAGCTTCGT TTTCCCAATT CTTACACGTT TAGCAAATGT GTTTGCGAAC ATTTACTTGT GAAAAACAGC AAGGTTCGGA CAACAATTTT CCGACCGAGC ATTGTTGGTC CAGCATGCGA GATGCCTTTC GAAGGCTGGG CGGGAGAAAG GCCGACTACT CTAGTTGCTG CTGCATGTCT GTACCTCTCG TACCAGTGGA ACCTTTGGAG CTTTGGCCCT TATCGAGTAT CGTGTATTCC AGTTGATGTG GTGTCAAGAT TCTTGTTATC CAGAGCGTTT GCCGAAGGCG GCTTTCGCGA TGTTGTAGGT GTTTACGACT CGTCAAGTGA CGAAGACTTT GAAAAGGTTT CTTGCGCTTC CTCTGCTTTG GCCGATACTG TGTATGCAGG AGATCCAGAG TCAAGCAATT TACCACTTTA CACGATACAC AATGCGACTT GGGATTCAGC CTCTGCACCT AGTTCTACCT TCACTTGGCT GGACTATGCC AGCACTGTAA CTCAGGTTGG ATCTTTGTTT GGCCATTTTG GTCGAGCAAC AGCATACATC GGCTTGATTC TCTCGACACA GGTGCTCTCT ATACTGAACC CCTCTCTTGA GCTATATACT AGAATTCATC GGAGTATTGT CAAAGCTCCT CTCCTTTTCG TTGAAACAGT ATTTGCATAC TTGAGAGTAG ACGCGTCAAA TATCAGGAGA CTCCTTTCCT TTATTGATCT CCCTCTTTTG TTCTTTCCCT TTATGAATAC GAGCTTTCAC TTCCGCAGTC GGCTTGTTGC CAAGGATTTT GATGGCCAAC GATACGCTCT CAACTGTGTC TTGGCTGCGC ATGTGTTTCT AGCCACCACG AGTGAATGCC GTCGGTCTCA GAACCCCTCA AAAGAAAGGA ATACCTATCC AACCTTTTTC CTTCTCGGGG GGCGTATCCA CGAACCGGTT CTCTCGGATT CGTGGTGGGC TTTAACTCAA CCAAGGGGAT CTTTAGTCAT CAGATGCATT GGATTCTTGG CAAAGAAGGT TCTGAGATTG TGTTTTAACG AGGTTTCCGT TGATCTTTTT TCTTTCGCTG AAGCGATCCG AGAAGCCGAG AATACGGGAC GACCTATTCG CATTGTCCTT ACCCCTACGC ACCGGTCGGT CTTCGACTTC ATCCTTCTTA GCTTTTTAGC CTTTTCGTTG CCGGAGCTTC AGGTGGATAT TCCTTTCGTC GCAGCCGCGG AGGATTTTCG GCAACTTCCT ATCATAGGAT GGTTGTGTTC TTGTGCCCGA GCTATTTTCA TTCGAAGAGG CACTGGTCAA GTTGATCCTG ATTCAAATCA GCAGATCCAG GCAATTGGTA TTCATCGACA TGGACCGGTC CCTGTCATGG AAGTCTTCAT TGAAGGAACT CGTAGCCGTG ATGGACGGTT TGCGAAACCA AAGACTGGTG TTTTGAAGTG CCTTCATCAA AGTGGCATAG ATTCTTTGAT CGTACCAGTT GCAATAAGCT ATGAAGCTAT TCCTGAACAG CATTATATGG AAAAGGAACT CGTGACAGGT GCTTCGCTTA AGATGTCGAC GCCCGGTGCT TTGCAATGGC TGTTGGTAAG TACTCTCCAT GTTACACTTT GAGCGTTGCG TATGTGGTTG AAAGGTTCGA CGCTTACTTC TTTTGAGCAC AGGATGTATT CGACGGAAAG AACAGCTTTG GGAATGTCCT AGTAAGTGCT GGTGCCCCGC TCGTCATGAA CGCTGCCGAA AAGACGAATT TCAAGGAACT GGTCTGGAGT ATTCAAGGAC GACAACGAGA TCTTATGTAT ATCTCGCGCT ATCATGTAGA GGCGATATCC CGTCTGCTCG ATATTGATTG CGATACAGTC GAAGGCGGTA TCAAGGCCCT GAATATGAAC TACTGGTCAA GTCATCCTGT TGAATCTAGA CGCTTGA
|
Protein sequence | MPTRANENQS LQELFDSVVQ LIRETPTPDD SAHQLSTGDK LRLYGLYKHI EASAASDNSD KTAVDEEAAP SIFRVEAYAK YQALKACTGL SREEAMREYI SLLSAQENSL GEICRDWWHS SNSVANDGKS SFAKEVETPI DPMLEEDTKA SKLAESKDVK HTEMNKASHV EPAAVSRPKP SKHFFGVSPL IPRGQLDIKY RDLLYASRHC ATNMIRRSTM PCYQHYERKI KNQWIRGMEG DGQSPQNVVV GLAVRSLLDL YLLSRSFPEG SQIIICPPIN VPGMLRVLRH HRLEVVGVDL PPSDETTRNT TTTISVDIEG IEAAITDKTV AILVVHPFGM VSASNRDFER IKTLADQHKL DVMEDCAEIF TGLGSLSYRG SPQADVVFVS FGLIKTSTAL GGGIAMVKNI KVAETMKRLH FSVYQYQTNA EYFGKVLYAL CIRFVTDFPW MCGMIHQLCV IFRLDFDYFV TSLLRGFGRS PMDSSGTKFD QAIHQFRRRP CAALLALLSY RLQEANQHVP SVLHKKDQCL KITRLLKSTI PNVNLPAPTP YCTNSNWLFP IVSETPGKQS TQLGKLGFDA TQGSSQLCCV SPNCKRAEKL MKALLYLPVC GKKLCNLEMK RLVDGLRTFP CNGDEPVQPG FGCNVDFLLK TRLSLSMALC IFFVFSRDVA FLIRQIHLGI VALGIFLVVC IASSKLLRWL VADYYINSST AVGKYIDLLG QHPDYESHRD ISNHSTIPGL QAVRTSVFQS SPALRLPQCS PCERKVILTG ATGFIGSLVL RDLLLHRKVL GIKKVILICR SKRGISAQAR IDTLLENTAV YGFLDKTEKS DLVKVIEGDV TRPNALLCKT DLYDVRNDGS ISHLIHCAAS VSFTQSLPAA ATANISSPLY LQDLAASLAH EKTHFVHVST AFVHGGLSGT DDEPLSERLF PLGSFDANDL YSSMQSTEFL ASKAMRELRF PNSYTFSKCV CEHLLVKNSK VRTTIFRPSI VGPACEMPFE GWAGERPTTL VAAACLYLSY QWNLWSFGPY RVSCIPVDVV SRFLLSRAFA EGGFRDVVGV YDSSSDEDFE KVSCASSALA DTVYAGDPES SNLPLYTIHN ATWDSASAPS STFTWLDYAS TVTQVGSLFG HFGRATAYIG LILSTQVLSI LNPSLELYTR IHRSIVKAPL LFVETVFAYL RVDASNIRRL LSFIDLPLLF FPFMNTSFHF RSRLVAKDFD GQRYALNCVL AAHVFLATTS ECRRSQNPSK ERNTYPTFFL LGGRIHEPVL SDSWWALTQP RGSLVIRCIG FLAKKVLRLC FNEVSVDLFS FAEAIREAEN TGRPIRIVLT PTHRSVFDFI LLSFLAFSLP ELQVDIPFVA AAEDFRQLPI IGWLCSCARA IFIRRGTGQV DPDSNQQIQA IGIHRHGPVP VMEVFIEGTR SRDGRFAKPK TGVLKCLHQS GIDSLIVPVA ISYEAIPEQH YMEKELVTGA SLKMSTPGAL QWLLDVFDGK NSFGNVLVSA GAPLVMNAAE KTNFKELVWS IQGRQRDLMY ISRYHVEAIS RLLDIDCDTV EGVILLNLDA
|
| |