Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18067 |
Symbol | |
ID | 7197400 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 229139 |
End bp | 232949 |
Gene Length | 3811 bp |
Protein Length | 1091 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177888 |
Protein GI | 219112273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.437528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCTTGCAT CGCCACAACA AATTGCTTTA AAAGCAACCC ATACGGTTCC CCTCCGTCAT TCATCGTACA CGATCGAGCT TTGGATCGTA CCCTTTTTCC TCTGTATACC CTCACTGCCA GTCAGAAGCC AATAGACTTT ATTCGAAGTC CTCTAAATCC TGCCATAAAT TATCTCACCT CCGACGCTTC ACTTGAGGTG TGACGACCGT TGTGATTCAT ACTTGGTGAC CGACCTACGT CATTGAAACA TTTTCTTAAA AATTGCGTTG ATTCAAGAAT GGACGCGGCT CAGTTAGTTC AGGTGGAAGC CTTGTGCGAA ACGCTCTACA CGGGTGTGGC TGCAAATTTT GATTCCGAAA CGGTCACTCG TAGCGAGGCA CAGCAACGGT TGCTGAGTCT ACAGTCGAAT GCTGAGTACA TTCCGCAATG TCAATACATT CTGGATAATT CCAAATCACA GTATGCCCGT CTGGTAGCAT CGAACAGCCT CATTGAACTT GTAACGATAC ATTGGAATTC CTTCACCGTC CCACAACGTA TAGATATCCG GAACTATGTC TTAGGCTATC TGGCCAACAA CGGACCTTCT CTGCAAGACT TCCTGGTATT GTCGCTGATA AAACTGGTTT GCCGCATCAC CAAATTGGGA TGGTTCGACG ACTCGGCGCA TAGGGAACTA GCCGACGATG TGACCAAGTT TCTTCAAGCA ACTGTAGATC ACTGCATTCT TGGCCTCAAA ATTCTCAATC AACTTGTCGA CGAGCTGAAT ATTCCCACCA GTGGTCGAAC CTTGACGCAA CACCGCAAAA CGTCCGTCTC TTTCCGGGAT GTCTGCTTGC TCAAGGTCTT TCAATTGGGA TTGACCACAC TGAAGCAGTT ACAAACTGGT GCAATAAGTA CGTTGCTTGC AAGACATTCT TTTTTTGTTC AGCTAAAGCA GATCTCATCT CTGTTTCTTA CCACTTTTTT GCTTATGTTT ACCTTAGGCG CTGATCCTCG ACAGCAAGCA ATATTAGGCG AACAGGCGCT ATCGCTGACG GTACGGTGTC TCAACTTTGA CTTCATTGGA ACAAACCCGG ACGAATCGAC CGAAGACGTA GGCACAATTC AAGCACCTAC CTCGTGGCGA CCACTTCTAT CCGACCCGGC TACCACAGAA TTGTTGTTTG AATTCTACGC TAACACGGAG CCCCCTCAAT CATCTAAAGC TATGGAGGCA TTGATACTTT TGTCCTCAGT GCGTCGATCT CTCTTTCCAA CAGACAAGGA CCGGGCCGTT TTTTTGGGCC GCCTCATCAC TGGCATTCGC GAAATGATGT CCAATCGTAC CGGTCTACAA CATCAAGACA ACTACCATCA ATTCTGTCGT CTTCTCGGGC GCTTGAAAGC GAACTACCAA CTCAGTGAGC TTGTCAAGGC GGATGGCTAC CTCGAGTGGT TGGAGTTGGC TGCGACATTT ACTGTGCAAT CGGTACAAAA CTGGCAGTAC AGCACCAATT CGATTCACTA TCTATTGGCA TTGTGGGCAC GTTTAGTCTC AGCAGTACCC TATATTCGTC CTGAAACGGG GGCGCGTGGA CACGTCGCTC ACTTGGAAAA GCAGGTCCTT CTAGTTGCCG AAACTTACAT TGATAGTATG TTAGGTTCGT CGGAAACAGT TATACGCAGC GACGGTGCGC TTGAAGATCC GTTGGACGAT GATGGGTCTT TGAAAGAACA GTTGGATCGG CTCCCAATCA TTTGCCGCTT TCAATACGGT ACAGTTGCCA ACTTGATTCT TAACAAGTTC GACCCGTTGC TGAACTCATA TCAAGACATT GTGGCTAAGC TAGGGTCAAG CGCGACCAAC TCAGCCCCGC CTGATGTGAT GCTGCGCGTC GAGATCATTG AAGGGCAGCT CACGTGGTTG GTTTACATCG TAGGGGCGAT CGTTGGTGGC CATTCGTGGT CGTCTACCCA CATGGCAGAT GGTGAAGAGA CTATCGATGC CAGTCTCAGT AGGCGCGTGT TGCAGTTGGC GCAAGGAATG GAATATAGAC TGACATCGTC AAACGGGGTT GGACGTGCCA ATGCTCGCTT AGAGCATGCC CTTCTATGCT ACTTTCAGAA TTTTCGTCGC GTATACATGT TTATGTGGGA TCAAATGGCC GGGGCAAACT CTTCAAGCCC TATTGACACA AACGGCACGC TATCTGTAGT TGCAATGATG ACATCTAAGC TGGACTCAGG CGGGACTTCG ACGAAGCAGA AAATCTATCT GCGTATGTTT GAGCACTTGG GCATGGGTGA TCATACAGCG GTTGCCAACC TGATTGTCAC CAAGATTGGT AATAACTTAA AATTCTGGCC CGAAGACCAA GATCTCATTG GAAAGACCTT GGACTTGCTA CACGACATGG CTCAAGGGTA TAGTAGTTCA AAACTATTGC TCACCCTGGA GACAGTCCGA TTCCTAGCTC ACCATCACAC CGAAGAGCAC TTCCCCTTCT TGTCGATGCC TGGGAATTCT CGCCAGCGGA CGACTTTCCA CGCAACTTTG ACACGACTAC TACTGTCGCC TTCTGGAGAA GAAAAGTTGG GTTTGACGTT CGAACAATTT CTAGAACCTA TTGTGGTGAA GCTGACACGC TTGGAAGGGC TTTCACCATC CGATCTGCGA CAAGAGCAGT GTAGGCAACC TTTAATAGGA GTCTTTCGAG ACCTTCGTGG CATTGGTGCA AGTCTCCACA ATCGCAAAAC TTACAGTGCG CTTTTTGACA TTATGCACCC GCATCATTTG CCATTGTTGT CCAAAGTTGC TGACGTGTGG TTTGACCAGT CGGATGTCAC GGTCAGCTTA CTTCGCTTTC TGCAAGAGTT TTGTCATAAC AAGGCCAATC GTGTCAACTT TGATCAAAGC AGTCCAAACG GTATTCTCCT TTTTCGCACC GTCAGCGATG TGGTGTGCGC GTACGGAAGT CGCATCTTGT CCTTACCGCC TCCAGTGGCG AACGATCCAG AAGTTTACAA AAAGCGCTTT AAGGGCCTTG CTTTAGCACT GAACGTGCTT AATTCCGCTC TCGGCGGCAA CTACGTTTGT TTCGGCGTTT TTGAATTATA TAACGACCGG GCTCTTGAGA ATTCGTTGGA TGTAGCGTTG CGCTTATGCT TGACGATTCC ACTTGAAGAA ATCAATGCCT ATCCGAAGGT AAGCAAGGCG TACTATGGGT TCATCGAAAT TCTGTTTCGA AACCACCGAA GAACTGCTTT TGCCATGGAT ACGAATATAT TTATGCAGAT CATGGCTTCG GTGCATGATG GATTGCAATC GACTGACGCC ACAATCAGTG CCTGCTGTGC AAACACCATC GACCATATGG CATCATTCTA CTTCACGAAC CAAGGAAAGG ACAAGCTAGA AATGCGAAAC CTTAGCAAGG TATGTGATGT TTGCAAACCT TGTCTTCTCC AGCTAGCTAG CGTTTCTAAG ATTTTTCTTC GATTTTTTTA CAGCACTTGG CAGCCCAGCC CAATCTGTTT TCCAGTCTAA CGATGACACT GTTTAACCTG TTGCTTTACG GTCCACCCCA ACATCATTGG GCAGTGATGC GTCCGATGCT CAGTTTGATG CTTGCCAGTG AATCAGGCTT TGCTGCATAC AAAGATCATC TTCTCAGTAC CCAAGCTCCT GAAAACCAGG CAAAATTGAA CGAAGCTCTG AACAAACTCC TAGCAGATGT CAGTCGAAGT CTTGATAATG CCAACCGTGA TCGATTTACT CAAAAGCTGA CTGCTTTTCG AGTTGCAGCC AGGAGTTTTT TGACGCTGTA G
|
Protein sequence | MDAAQLVQVE ALCETLYTGV AANFDSETVT RSEAQQRLLS LQSNAEYIPQ CQYILDNSKS QYARLVASNS LIELVTIHWN SFTVPQRIDI RNYVLGYLAN NGPSLQDFLV LSLIKLVCRI TKLGWFDDSA HRELADDVTK FLQATVDHCI LGLKILNQLV DELNIPTSGR TLTQHRKTSV SFRDVCLLKV FQLGLTTLKQ LQTGAITILG EQALSLTVRC LNFDFIGTNP DESTEDVGTI QAPTSWRPLL SDPATTELLF EFYANTEPPQ SSKAMEALIL LSSVRRSLFP TDKDRAVFLG RLITGIREMM SNRTGLQHQD NYHQFCRLLG RLKANYQLSE LVKADGYLEW LELAATFTVQ SVQNWQYSTN SIHYLLALWA RLVSAVPYIR PETGARGHVA HLEKQVLLVA ETYIDSMLGS SETVIRSDGA LEDPLDDDGS LKEQLDRLPI ICRFQYGTVA NLILNKFDPL LNSYQDIVAK LGSSATNSAP PDVMLRVEII EGQLTWLVYI VGAIVGGHSW SSTHMADGEE TIDASLSRRV LQLAQGMEYR LTSSNGVGRA NARLEHALLC YFQNFRRLDS GGTSTKQKIY LRMFEHLGMG DHTAVANLIV TKIGNNLKFW PEDQDLIGKT LDLLHDMAQG YSSSKLLLTL ETVRFLAHHH TEEHFPFLSM PGNSRQRTTF HATLTRLLLS PSGEEKLGLT FEQFLEPIVV KLTRLEGLSP SDLRQEQCRQ PLIGVFRDLR GIGASLHNRK TYSALFDIMH PHHLPLLSKV ADVWFDQSDV TVSLLRFLQE FCHNKANRVN FDQSSPNGIL LFRTVSDVVC AYGSRILSLP PPVANDPEVY KKRFKGLALA LNVLNSALGG NYVCFGVFEL YNDRALENSL DVALRLCLTI PLEEINAYPK VSKAYYGFIE ILFRNHRRTA FAMDTNIFMQ IMASVHDGLQ STDATISACC ANTIDHMASF YFTNQGKDKL EMRNLSKVYF SSIFLQHLAA QPNLFSSLTM TLFNLLLYGP PQHHWAVMRP MLSLMLASES GFAAYKDHLL STQAPENQAK LNEALNKLLA DVSRSLDNAN RDRFTQKLTA FRVAARSFLT L
|
| |