Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37484 |
Symbol | |
ID | 7202482 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 249194 |
End bp | 252136 |
Gene Length | 2943 bp |
Protein Length | 929 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181687 |
Protein GI | 219122717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGCGG ACGAGCTCCA AAAGAATGCC GCCCAACAAA GGGCTGCCGA AAACCGCCAA CGGCGCTTGA AAATCAAACA GCAACAAGCG AAAGCTTCGT TCGCATCCAC TTCACCGACA AATGCTTTCA AAGCCAAAGA AACAAGTACA GCGACGTCCG TTACTATTAC GGATGCGTCG ACGCTACCGG TTAAGACGGA AACAAACCAG ACAATCCCCT TTTCCTCCGG GACTTCGATT TCTGCCGTGA CCAAGGCATC GCAGGAACGC GCCCGACGAG CTAACCAAGT GAGAGAACAC CAGCGTGCCG TCCAAATTCA GAGCAGGGCT CGGGGATGCA TCACTCGCGA TCGAATGCAA CGAAATATTC GTGCCGATCT CGTCAGTAAA ATGTACGACT TGAGGTCCGT CCGCGATCTA CTTTCGCGGT CCCAGAACAT CACTACGTAT ATCCCGCCTC CGGCTACAAC AACGACACTA GTCCGCAGCC TCTGGTTCAT CACATCCCGT CGTGAAAGTC AGGGCCATAC TTCTGTTCCG TGGGGAAGTA GGCGAGTGAT CGTGTGGAAC GATCGCCAAG ATGTCATCTT GCTGTCGCAG GTTTTACAGT ATGCCGTAAC TCCGGGCTTG CAAAGCCCGG ACGAAAACGT CAACCCTTTT GCATGCTGGA ATTCTTCGGA AGAGGGAAAT TACCGGATGA AGTTTGTGAT GAGGATCATT CTAGTGGCCC TGGTTGATCC AAGTGTGGAG CCGTTGGTTG GGAATGACGT GTTTGATGCC TGTCAACAGT GTTTGAAGGC TGTCATGGCC GTACCGCGAT CGATTTCTTC TCTGCAGGCT TTGTCAGATT GCCGCGTTAC AGTTTTTCGA ACATCTTGGG ATTGGTTGTT GCCAAACCAA ACGGAGGCAG GGCCCCGAAC ACGGCTTACC ACTACGACCA CAGCCTCGCA ACTTCATCAA CCTTGTGCAT ATTTCTCATC CCCCTTGGAT ATGCTGTCAA TTTGGCGACA CTATTTACTC TTTACAGTTG CCGGTCCCAA ACCTATTCCA CCCATGCTGG ACTTGAATCG AGAAACCTGC GTGTCTCACT TACGTAAACA ACGTATAGGA GTGTGGATCC GTAACGTTTG TGATGCAGTG GAACAAGCGA GTGATCACGA TGCAAGGGGT GACTTTTTGA TGATTCGTCT CGTTCGCGAG ATTTATACTA TTCCGCTCTT GACGTGGAAA GTATCAAATG AGCTCTTGGC ATACTGGGTC ACATCGTCGG GGACAAAAGG TTGTCCATTC GTGTCCGCCT TGCGACTTTT TGGCGATAAA GGCGACGGCT TGCTTCAAAA TGACGGCGTC GAAAATTTGT TTCCGTTCGA CGATGTGCCC ATGACGGTAT GTCCTGCCAC GCCGACACAA TGCTTCTTAG CCAATGTGGT CCAGCTAGGC CTTCTATGTC CTACGCTGAA TGGTACTGAC AGTCTTAAGC TGGACTTTGG AGCTGCCGTG GTGTTTTTTA ATCTCATTAC AGTTTTGGTA CAAGCCATTC CTTTAGCTAC ATTCTCCTCC CGTGACTCGG CCGTTGTCTG GGTTGACGGA GTAAATGGTC ATACGATCCC AATTGTCTTG TCGAAGGTCA TTCAAGACCA GTGCAGGGCT ATGTATGGTG GATTCCTTTG TCCGTCGCAT CTTTCAAATC GCCCTGGATC CGCAAGTACT TGGTACAGAA AACATCTTGT CGACCAAGAG CGATAAGGAT TTAAAACACG AGAAGGACAT GCTGGAGGTG GGAAGCTCGT CGGCAGCAAG TTTGGCAGCC AAAGAAGCAC GTGTGGACCG CAATAAGAGC TTTTGGAACT CTTCAAAATG GGCGCGGAAG CTAACAAAGG GCATGTCAAG CCTGCTGGTC GGTGAGGACG CCAAAAAGCG AGCGGCCAAG AATTTGAAAC CGTCGTCTTT AAGGAATCAG TCTACGGTTT CGCGCAACCT AGCGGAAGGG GTGGAAGGTG ATTGTAGCGA TATCGGGACC ATTTTCTCAA CTAACATTGT CCCACGATCC GACTATACAG TGACATTTTT GTTTTGCTTG TGTCGCTGCT TCTCTGTTGT CGTGGCACGA TGGGGTGGCG CTGGGAACGA TGATATGCTT CTTAGTCAGA ACAGGGAATC CGCCAAGGGC GAATTCCCCA AGGCTTGCAT AAGAGCGGAA CCATTCGTTG TTATTATTTT GAACGCCCTG TGCTTTTCTA CGCCATTTGT GCAATGTGCG TGGGGAATCA TGCAGTCAGA CCGTCGGATT GCATCCCAAA TTCACAGCAT TGTCGAGATG GATGAGGGCA AATCTTTCAT ACGGTGTTTG GACATGCAGA CTGGCCTTAC TGGGATATCT AGTGGAATTA GCGACTTAGA CGGGGCAGCG TTGCTGTTTA TGTTTGCTGT AGTGTTGTCA CACACCTTGA TTATTACAGA CGACGTTGAA ATCCACGACA TGGACCGCCC TTTGCCCAAA CATCAACTAA GACGTGTCAT TCAGCTTCTC AAAAAGCTCT TGTATCGGGC TTGCTGCATC GATTCGACAA GTCTTTCTGT TCATTCTAAT TACTTTGGAG TAGCTCTGAT ATCGGCCTCG TCAAGGGCCA TGCGTGATTT GTATGATCGT TCAAGTCGGA GGCCAATCTG TGTACCCAAG CTATGGCTGC TGCCAAACCT GTTGGAAAAG GATCTCTCGA AATGCAACTG CCATGCCGAA TATGTAGCGT TGCTCTCGAC ACCCGTGTTG CGTATGTGTC CGTTTCTTGT TTCTTTTAAG CGAAGACTTA AACTCTTTGA ACGAATCGTG ACTACGAATC GAGTAGAAAT TCAAGGAGAG GTAAGTTACT GACTGTGAAT ATCTCTTTAC CTTGCAAGCA GAAATGTCTG ACAACGGTAA GGGCCTTCAA TATACTGTTT TAG
|
Protein sequence | MFADELQKNA AQQRAAENRQ RRLKIKQQQA KASFASTSPT NAFKAKETST ATSVTITDAS TLPVKTETNQ TIPFSSGTSI SAVTKASQER ARRANQVREH QRAVQIQSRA RGCITRDRMQ RNIRADLVSK MYDLRSVRDL LSRSQNITTY IPPPATTTTL VRSLWFITSR RESQGHTSVP WGSRRVIVWN DRQDVILLSQ VLQYAVTPGL QSPDENVNPF ACWNSSEEGN YRMKFVMRII LVALVDPSVE PLVGNDVFDA CQQCLKAVMA VPRSISSLQA LSDCRVTVFR TSWDWLLPNQ TEAGPRTRLT TTTTASQLHQ PCAYFSSPLD MLSIWRHYLL FTVAGPKPIP PMLDLNRETC VSHLRKQRIG VWIRNVCDAV EQASDHDARG DFLMIRLVRE IYTIPLLTWK VSNELLAYWV TSSGTKGCPF VSALRLFGDK GDGLLQNDGV ENLFPFDDVP MTVCPATPTQ CFLANVVQLG LLCPTLNGTD SLKLDFGAAV VFFNLITVLV QAIPLATFSS RDSAVVWVDG VNGHTIPIVL SKSDKDLKHE KDMLEVGSSS AASLAAKEAR VDRNKSFWNS SKWARKLTKG MSSLLVGEDA KKRAAKNLKP SSLRNQSTVS RNLAEGVEGD CSDIGTIFST NIVPRSDYTV TFLFCLCRCF SVVVARWGGA GNDDMLLSQN RESAKGEFPK ACIRAEPFVV IILNALCFST PFVQCAWGIM QSDRRIASQI HSIVEMDEGK SFIRCLDMQT GLTGISSGIS DLDGAALLFM FAVVLSHTLI ITDDVEIHDM DRPLPKHQLR RVIQLLKKLL YRACCIDSTS LSVHSNYFGV ALISASSRAM RDLYDRSSRR PICVPKLWLL PNLLEKDLSK CNCHAEYVAL LSTPVLRMCP FLVSFKRRLK LFERIVTTNR VEIQGEKCLT TVRAFNILF
|
| |