Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45529 |
Symbol | |
ID | 7200725 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 462545 |
End bp | 466058 |
Gene Length | 3514 bp |
Protein Length | 926 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179850 |
Protein GI | 219118138 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.733997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCACTGA GCAGCACAGT GGCAAGGGAA AGAAAATAGA CATTGTACAT GCTTCGCAAA AACTGGTTCA TTGAGTAATT ACAGTGCTTT CTTGTGGACA CGCGCAGTTC CCTTGTCAGA TCAGAATCTC TATACAGACT GTAGATAAAA GTGCCCGACC TGGATGATGT ACAGCAAGAG AACTGGCGCT ACATGGCTAG TGACACTATT GTTTGGTAGT AATATCGGCA ATCTCGTCTC GGCGGAAGTC CCAAATCCCA GCATCCCGCA TGGAATCTAC GGCGGAGCAA CGAGGACGAC TCGTACTGCT GCCACAACCT CCAAAGGTGA TGTATCCACG GCGACGTACG AGAGAGCCGC AGAAGCGCCA GATCAGCCGA GTGTCTTTGT TGGATCATCT TCTTCAGTCA AACCCAAGAA ACACGAAGAT ACTTTGCATG TTTCCAAACG AGATGGGCGC CTGGAACTCT TGGACTCGAC CAAGCTGCTG CAACGATTGA CGGATTTGTC TGATGGCTTG GACATGCGCT TTCTGAATTT GGTCGCCTTG ACGGAATCTA TTGTGCGAGG CATGTATCCC AACGTGACAA CCCACGAAAT TGACGTCTTA GCTGCGGAAA CCGCTGCCAG TCTAGGGACG CAGCATCCGG ACTATGGCCG CCTGGCGGCT CGAATTCTCA TTACACAGAA TCACAAAACG ACTCCGACGT GTTTTTCCGA GGCCATCGAA ACTTTATACA ACTCCGGTAA AGGCTTTATC GACCATAAAG TTGGGGAACT AGTTCGTCGA CGCGGTCCCG AGATTGACTC CCGTATTGTG CACGAGCGTG ATCTTGAAAT GACTTACTTT GGCTACAAAA CGTTGGAACG CGCCTATTTG CTCAAAAAGG ACAACGGCAA CCAAGTCTTG GAGCGGCCTC AGTATCTCAT GATGCGTGTG GCCTTGGGAA TTCATTGTAC AAGTAAGATG AGCTCGCGAT CGGAAGACGA CTGTCTGGAG GCCGCCTTTG AAACGTACGA TCTCATGAGC CGGGGCTTTT TTACCCACGC CTCGCCAACA CTCTTTCACG CGGGTACCAC CCACCCGCAG CTATCGTCCT GTTTTTTGGT GCAAATGAGT GAAGACTCCA TTAATGGCAT CTATGATACA TTAAAGCGCT GCGCTGTGAT ATCCAAAGCC GCCGGCGGTA TCGGGTTGTC GGTGCATAAC ATACGCGCCA GAGGAACGCC CATCCAAGGA ACGCGCGGTG TTTCCAACGG TCTAGTCCCC ATGCTCCGGG TATTTGATGT GACAAGTCGG TATGTTGATC AGGGCGGCGG CAAACGCCCT GGGGCTTTTG CCATTTATCT AGAACCATGG CATGCCGATA TCTTTGATGT TCTAAGTTTG AAAAAGAATC ATGGAAAAGA GGAACAGCGA GCAAGAGATT TGTTCTACGG TCTATGGATT CCAGATCTTT TCATGAAACG AGTCGAAGAA GATGACGTCT GGAGTCTGAT GTGTCCTCAT CAGTGCCCTG GTCTTGCACA TTGTTATGGT GCGAAGTTTG AGGCTCTATA TCAGCACTAC GAGCAGGAAG GGAAATTTGT ACGTCAAGTC CGTGCTCGAG AGCTTTGGGG TGCCATATTA GAATCTCAGA TCGAAACTGG CACTCCATAC ATGCTATACA AAGACACTTG CAACACCAAA AGTAACCAAC AAAATATTGG AACGATTCAA TGCAGTAACT TGTGCACTGA GATTATTCAA TACTCGGATC AAGAAGAAAC CGCGGTATGC AACTTGGCAA GTATTTGCTT GCCGCGATTT GTTGTTTCCG AGCGCGGTAC TTTCGGATCA ATTAGTCCTG AGTCTGGTTC GGCGTTCTTT GATCACGAAG CCCTTCACCG GGCAGCAAAG ATTGTGACGC GCAATTTGAA TTCTATAATT GACGTGAACA GCTATCCAGT AGACGGTGCA AAAACGTCAA ACTTTAAACA TCGTCCAATC GGAATCGGCG TCTCGGGACT GGCTGATGCA TTTCTCCGAT TGGGCTTACC GTTTACGTCT GCGGCAGCAA AAAAGCTTAA TGAAGCTATT TTTGAAACCA TTTATCATGC CGCGCTAGAA GCAAGCGCTG AGCTTGCTGA GAAAGAAGGC CCATACGAAA CATTCGCAGG AAGCCCAGCA AGTCAAGGAA AGCTTCAGTT TAACCTGTGG GGCCTGTCAG ATGACGAAAC GCCAAGTCAC AAGTACGCTT TGGCAAAGGA AACTCCTGTT CCAATTCAGA TGTATCCGAA TAGTAGCAAT GTCTGTGGAT ACGATTGGGA ATCATTGCGT CGGCGAATTG TCAAAACTGG GCTGCGAAAT TCGTTGTTAG TAGCACCTAT GCCTACTGCC TCTACGTCAC AGATCTTGGG CGTCAACGAG TGTTTCGAGC CTTTCAGCTC AAACCTGTAC ATTCGTCGCG TTAAAGCGGG AGAGTTCATT ATGGCGAATC CCCATCTTTT GCAAGATTTG ACTGACTGAG GCTTGTGGAC ACCATCAGTC CGCAACCAAA TGATGCGCGA TGGCGGTTCC GTGCAAAACA TTCCCGATAT CCCCGATCGA TTAAAGGAAC TGTACAAGAC AGTATGGGAG ATCAAAATGA AGGACATCAT TGATATGGGA GCAGATCGGG GCAAATTCAT TGACCAAAGT CAGTCGCTGA ATTTGTTCAT TGCAGATCCG AACATGGACA AACTTACGGC TATGCATTTT TACGCTTGGA AAAAGGGTTT GAAGACTGGA ATGTACTACC TGCGAACAAA ACCGGCTGTA AACGCAATCC AATACACTGT TGAAAAGATA AGTCCTACTG TTATTGTGGG CGGCATCCCA GAAGCTGAGC CCGATGACGT GTGCATAAGT TGTTCTGCTT AGCTAAATCA TAGCGCTAAG ACCATGTACA GTTTACTCAA GTCGTCGCTG CGAAGTACCA ATGAAATATC TTGTTTGGGC TTACTTTGGA GGTCCGTCTG CGTACTGAGG CTGAACGTAC TCTCAAGGCG AATGGCTACA AATTGTAAAG AGATGTGTCG TTTCAGACCG TTTCACAGTG TAAAACTACA TTCGAATCAC TCGCACAGTT GGGGAATTGT TTCAGATTCG AAGCTGCGAT TACCTTAAGA AAAATGTGGA TATTACATGC ACCGACATCT TCACTTTATA ACATATCTCG TTTATTTAGT ACTTCTTTAG ATCAACTTCA AACTTTTTGG CGGCGGCATC GATGCTCCCC ATTTTATTGA CAGTTTTGAT ACCCTTGGTA CTCAGGCGAA GACGGACCTT GCGATTCAAA TCTTCCGAAA AGAATCGCTT CCACTGCAAA TTGACTGCCT GTACCTTCTT GATGCGTCTG TGGGAAAACG TGACGACGCG AGCTTGACGA TTGGGCTTCT TTCCCAACAA GTCGCAAACC CGCGCTCGCA TTACGAGAGT ACTAGCACAA TTCTGACGGT GGATCTGAAA TCGAAAACAA AGAAAGCCAA ATGA
|
Protein sequence | MASDTIVWYI PHGIYGGATR TTRTAATTSK GDVSTATYER AAEAPDQPSV FVGSSSSVKP KKHEDTLHVS KRDGRLELLD STKLLQRLTD LSDGLDMRFL NLVALTESIV RGMYPNVTTH EIDVLAAETA ASLGTQHPDY GRLAARILIT QNHKTTPTCF SEAIETLYNS GKGFIDHKVG ELVRRRGPEI DSRIVHERDL EMTYFGYKTL ERAYLLKKDN GNQVLERPQY LMMRVALGIH CTSKMSSRSE DDCLEAAFET YDLMSRGFFT HASPTLFHAG TTHPQLSSCF LVQMSEDSIN GIYDTLKRCA VISKAAGGIG LSVHNIRARG TPIQGTRGVS NGLVPMLRVF DVTSRYVDQG GGKRPGAFAI YLEPWHADIF DVLSLKKNHG KEEQRARDLF YGLWIPDLFM KRVEEDDVWS LMCPHQCPGL AHCYGAKFEA LYQHYEQEGK FVRQVRAREL WGAILESQIE TGTPYMLYKD TCNTKSNQQN IGTIQCSNLC TEIIQYSDQE ETAVCNLASI CLPRFVVSER GTFGSISPES GSAFFDHEAL HRAAKIVTRN LNSIIDVNSY PVDGAKTSNF KHRPIGIGVS GLADAFLRLG LPFTSAAAKK LNEAIFETIY HAALEASAEL AEKEGPYETF AGSPASQGKL QFNLWGLSDD ETPSHKYALA KETPVPIQMY PNSSNVCGYD WESLRRRIVK TGLRNSLSWA STSVSSLSAQ TFRNQMMRDG GSVQNIPDIP DRLKELYKTV WEIKMKDIID MGADRGKFID QNPNMDKLTA MHFYAWKKGL KTGMYYLRTK PAVNAIQYTV EKISPTVIVG GIPEAEPDDI NFKLFGGGID APHFIDSFDT LGTQAKTDLA IQIFRKESLP LQIDCLYLLD ASVGKRDDAS LTIGLLSQQV ANPRSHYEST STILTVDLKS KTKKAK
|
| |