Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45165 |
Symbol | |
ID | 7200209 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 376679 |
End bp | 380104 |
Gene Length | 3426 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179414 |
Protein GI | 219117239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000792411 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGGACTAT GAAACACTTC GACCGTGCTT AGGTTGGGTC TCTGCCGAAA CCGTCCGAAA AACCCTCATG GTCACCACGC AGCATGCCCG AGAAGTCTAC AACGTCCCGC TACGCAAGCA TTTCAAGTCA CGTTTCCCAG CCTTAGACGT GCATCGTCGC AACGAGCCCG TCGCCACTGA TACAATCTGG TCTGACACAC CCGCTGTGGA TAATGGCGCC AAGTTTGCAC AACTCTTTGT CGGCCACCGA TCTCTTGTCA CTGATGTGTA CCCGATGAAA ACCGATAAAG AATTTGTCAA CACCCTCGAA GATCACATTA GATACCGTGG CGCAATGGAT AAACTGATCA GTGATCGTGC TCAGGTTGAA ATCAGCAAAA AGGTCACTGA CATTACACGG GCTTATAACA TCGACCAATG GCAGAGCGAA CCCAACCATC AGCATCAGAA TTTTGCCGAA CGTCGTATTG CCACTATTGA AGCAAACACC AACAATATTC TCAATCGTAC CGGTGCCCCT GATTCCACTT GGCTTCTTTG CGTCACGTAC GTTTGTTGCG TTTTTAACCA TTTGGCGCAT GAGTCCCTCG ACAACCGTAC ACCCCTCGAG ATCTTATCTG GTTCCACACC TGATATCAGT GTTCTCCTTC AGTTTCATTT TTGGGAACCC ATTTATTATC GCCTCGAAGA TGCGACATTC CCTTCTGATG GTACTGAGCA ACGAGGACAT TTTGTTGGCA TCGCGGATTC CGTGGGAGAT GCACTTACTT ATAAGATCCT CACTGACGGC ACCAACAAAA TTCTGTACCG TTCTAGTGTT CGTTCTGCAA CCATCCCAGG AGAAACCAAC CTACGCCTTA CACCACAGGA TGGGGAGCGT GGTCCCAAGC CCATTAACTT TATCAAGTCG CGTAGAACCG AAAATCAAAA TTCCTATGCC ATAAAGGAGT TGCCTGGTTT CACACCCGAT GACCTCATTG GTCGTACGTT CCTCACCGAT ACTCGTGATG ATGGGGAGCG TCTGCGGGCA CGAATCACTC GAAAAATACT TGATCCAGAC AAACCTTCGG ATATCAAGTT TCTCGTCGAA ATCAACGATG GCGAACACGA CGAAATTCTA GCTTACAACG AAATCCTAGA CAAAATCGAG ACGAATCTAG ATCAAGAACT TCATGATGTG GATCGACAGT GGCGTTTTAA AGACATTGTT GCACATGAAG GCCCACTCTT ACCACGTGAC AAGAACTACA AGGGATCTAG ATACAACGTA CTTGTCAATT GGGAGACTGG GGAGTCCACT TATGAACCAC TAGATGTTAT CGGTGCTGAT GATCCGGTTA CCTGTGCTAT CTATGCCAAG AGTCAAGGGT TACTAGATAC ACCTGGGTGG AAGCAGTTCA AGCGGTTGGC GTCTAGAGAT AAACAGGTCC AGAGGTTTGT AAACCAAGCA CGCCAGCATT ATAGCAAGGC TACACCGGTT CACAAATTTG GATACCAGAT CCCTCGCGAC CACAAGGATG CAATTGATAT TGACAAGTCT AATGGCAACA GTAAGTGGCA AGACGCGGAG AGCACGGAAA GAGCTCAGTT ACACAAATAT GATACTTTTG TTGACATTGG TAAGGCCATC ACAATTGGTA AGACTATAAC AAATGCACCC ATGGGCTACA AGAAGATTCG TGTTCATACC GTGTACGATG TGAAACATGA CGGCAGACAC AAGGCCAGGA TGGTTGCTGG AGGGCATCTA ACACCTGTCC CAACCGAAAG TGTGTACTCC GGAGTGGTCT CGCTGCGTAG CCTTCGAATC GTTGTCTTCC TTGCCGAATT AAACGGACTC AAGTTATGGG GAGCCGACAT AGGCAACGCG TACCTCGAGG CCAAGACCAG GGAGAAGGTA TACATTGTTG CAGGACCTGA GTTTGCCGAA CTTGAAGGAC ATGTGTTGAT CATTAACAAG GCTTTGTATG GATTACGGAG CAGTGGCTTA CGTTGGCATG AACGTTTCGC GGACACCCTA CGCGATCTCG GTTTTATCTC TAGCAAGGCG GATCCTGATG TTTGGATGAG AGAAGATACT GGTGTCTATG AATACATTGC AGTTTATGTG GACAATATTG CTGTTGCCGC CCATAACCCC GAGGGAATTA TAAACCAGCT CAAGGAAAGA TACAAATACA AACTCAAGGG TGTGGGTCCA TTGGTGTACC ATCTTGGGTG CACTTTCGAA CGAGACAAGG ATGGCACCCA TTCTTACCAC CCAAGGAAGT ATATATCTCG TATGATGGAA CAATATGAAC GAATGTACCA GGAGCACCCG AAAAGTACGT ATCTCCCTTG GAAAAAGGTG ACCACCCAGA CTTGGATTCA ACACCGGAAC TCGACATGAA TGGTATCAAG CAGTACCAAT CCCTCATTGG TAGTCTGCAG TGGTTGATCA CGCTTGGACG TTTCGACATA GCTACAGCGG TCATGAGTAT GTCTAGATTC AGGGTTGCAC CTCGTGAGGG CCACCTTGAT CGGCTTAAAA GGATGTATGG TTATATCAAG AAGATGAAGA ATGGGGCAAT CAGAATTCGA ACCGACGAAC CCGATTACTC AGGGCTTCCA GACAACTCTC GCGACTGGGC AACTTCTGTT TACGGGAACG TTAAGGAACT ACTTCCTCAA AACGCTCCGA CGCCCCTTGG AAGACATGTT AAGTTGACCA CCTATGTCGA CGCGAACCTT TACCACGATA TGATAACAGG ACGCTCAGTC ACAGGGGTGT TGCATCTCAT CAACCAGACT CCAATTGAGT GGTACTCCAA ACGTCAGGCT ACAGTTGAGA CTGCTACGTA CGGTTCTGAA TTTGTTGCGG CAAGGATTGC GGTCGAACAG ATTATGGACA TCCGTACCAC TCTACGTTTC CTTGGAGTAC CCATCCTAGG CAAGTCCATA CTGTTTGGTG ATAACCAATC AGTTATCATC AGCTCCACGG AGCCACAGTC ACCAATCAAC AAGCGACATA ATGCACTGTC TTATCACAGG GTACGAGAGG CCATTGCCGC CGGTATTGTT GACTTCCAGA AAGTCTTGGG CGCCGAGAAC ATTGCCAATA TACTCAGTAA GCATTGGGGT TTCCAACAGG CATGGCCTGT TCTAAAGCCT ATGCTGTTCT GGCAAGGAGA TACTTCAAAA TGTGAGGTAA AGTCTTCCAA TTCATCAAAG CAGGCCGGTG GGGAGTGTCA CGATATACAC CAAGGCGACA TGTCTCTGGG ACAAGAGACG CCTAACACTG TCTATGGAAG TGTGAGTCAA GGTGGGAATC GTGGTGGGAT TGTTTCCTTT ATGTCGGCCT TATTGTACGA CCCGATGTAT ACGTGGTATG ACCAGAGTCG TCCCACATCG GAAGCCACGG AAATTGAACC CAAGGATGGA GAATAG
|
Protein sequence | MVTTQHAREV YNVPLRKHFK SRFPALDVHR RNEPVATDTI WSDTPAVDNG AKFAQLFVGH RSLVTDVYPM KTDKEFVNTL EDHIRYRGAM DKLISDRAQV EISKKVTDIT RAYNIDQWQS EPNHQHQNFA ERRIATIEAN TNNILNRTGA PDSTWLLCVT YVCCVFNHLA HESLDNRTPL EILSGSTPDI SVLLQFHFWE PIYYRLEDAT FPSDGTEQRG HFVGIADSVG DALTYKILTD GTNKILYRSS VRSATIPGET NLRLTPQDGE RGPKPINFIK SRRTENQNSY AIKELPGFTP DDLIGRTFLT DTRDDGERLR ARITRKILDP DKPSDINVSQ GGNRGGIVSF MSALLYDPMY TWYDQSRPTS EATEIEPKDG E
|
| |