Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44231 |
Symbol | |
ID | 7204064 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1374525 |
End bp | 1377250 |
Gene Length | 2726 bp |
Protein Length | 814 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186241 |
Protein GI | 219113315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.253188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGGTTGTCA CGGAAGTTCA ACCCGGATAA AATGACGACA GCGACAGAGA AAGCGCATGA TCTCGCTTTC CATTTCTAGA ACCAACCGCG TAATCAGAAT TGCATAATGC CCGCAATTTA CTTGTTTCAC CGAAGAACGA TCGTAGGAGG TGATGACCTC CAACCGGCGG CATTCCTCAC GGCTTCTCTA CGCTGTGTTC AACTTGTTTG TCTGCTAGGT CCAATCTTGG CGCACATCCA TGACGAAAGC TCGAGAAATG GCGGCCTTTT TCACTACATC ATGTACGACC CAAGCGACGA TGTTTCATGT CGACACTCGA ACCTTTTTCC TACGCTTTTG ACGCTATACG CCGTTGCGTC GGTTGTTTAC TCATTTGCTT CCATTGCGTT GGAGTGGAGA CTCGCGCACT GGTCAAGCAT AGGGTCACCA ACCGAGACAG AGCCACGTAG CTCCAAAGTC CGTCACCTTT TGGAACTCAA ACTCGTTCCG TTTTCGATCC TTTTGCTTCT CGTTTGGATG TCAGGATTGA GCGCCGTCGC CTTTGCCCCG CTTTACAACT ACTGCGTCGA CCTAACCCAG CAGAACAACT CTGAGATTCA ACGAGAGATG GCAGATGATA TTGACAAATA TTCGGTTGAG GTGTTTACCC CGCTCCGAGT GCATTTATGG TGGCTAGCGC TTGCAATCCT GCTGGTCGGT CAGCTTTCCG AAGTACTCGT GTCGTTCGCC TTTTTATGGC ATTTGTGTAA GCAACCTATG CAGGCTCATA TACTCGAAAA TCCAGAACTT ATGGAAGTAA CGCTACCGTC CCATCGCATG ATCGAAGAAA TGTGGGCTGA TCGATGCGCT GCGGCCTGTC AGTGTTTAGG CTCCGCCAGT TGCTTTATGT TTGGTGGTCG GGAGCTTTTG GGCCAGGCCG AGTTTGGGGA TGTCGCTCGG GCCTTGGCGG ACTATTTGGA TACGGGAGGG ATACTGGACG TTGTGCCCTC CGACATTGTA ACGGGATTCA TGCTTTTACA ACGGATACAA CGACAACGAA TATATAGGGC GCGAGAAGAA GTACTGCAGC ATGTGGAAGT GGCGGCGCGC CTGGAGGGAC AAAGTGTACA GGATAACGCC GGCGACGAGC TCATGGTACC ACTGGACTCC CTACAATCTA CCAGCTACCT GCCATCATCA TCGACAACGT GTCGTGTCGG TGGGAGACAA TCCTCAGTGT TTCGATTTGA TCCTGATGGG ACCTACGAGC GGCGAAATCG AGCCTTATTT GAAAGGCACA ACGTTGACGA AATGAGTGTG CTCGAAGAAG GAACCCGTTA CGCAAAGTAT GCACTGGCAA TCTACACTTG GGTTCTTTAT TTGTACGTCC ACCCATGTTC AGGAATACCT AGGCTTTTCG CTAAGTCTGG GCGTCTCTGT TGTCGCTCCT CCAAAACCGA TCGAAGGGGA GAATCGTCTG TAATACCCCA GCTTGCAGCA AACCTGATCG ATCAACATGG TCGTATTGAA GGTGACAATC TATGCGAAAC CAATAAAGCT GCGCTTTTGC TCACAGTGGG ACTGATGGAA GCCGATTTGA TCTACGCTCA GCTGCGAAGC GGCTTTGCTG ACACTCCTTA CGCAATTCTT GTCGACCATG GTAGGTAAAC AAGTGCCATT TGTACAAAGT GTGACTAGCT CCGAACTGGT GACTAACTCT GTATGCTTTC CCAGAATGGA AGTCGATCGT TGTATCGATA CGAGGGACTT TTAGCCTTGA GGACTGCGTA ACGGACGTCC TCATTGATCC GGAACCGCTT GAACAGCTTG GAGTCGACTT CGGTTTCGAC GCTAAGGATC AATACTGCCA TGGTGGAGTA TTGACATGTG TCCGTAATGT ATACCGCGAC TTGCAGCGTC ACGGCATACT GGACCGACTC CTTCTAGGGG AGCATGCTCG CTTCCCAGAG TATCGGCTGC GATTGGTGGG GCACAGCCTG GGAGCATCAA CATGTACACT TCTATCGTAC ATGTTGCGGG GAAAGTTTGC ATCGATTCGA TGCGTCAACT ACAGTCCGCC GGGTCACAGT CTGACATGGA ATCTTGCAGT GAGTTGTCAT GAATGGTGCA ACTCCTTTGT TTTAGATTCA GATCTTGTTC CTCGTCTCTC ATTCAATGCA ATGGAAATTC TGAGAAATGA GATTCTTTCT CTGATTGGTC GTATCAAGGT ACCCAAGATC GAAGTCGCGA GTAGAGTTGT GAGTGGATCT GGACTTTCCA ATTGTCGTTT TTGTCTGGAT CAAGATCCCG ACGAGCACGC CAATATACTC GAAGATATCA ATGAGATGCT CTATGCTCCA ACGGAACTAC CCGAGTCTGA GTACCAACAT CAACTTGAAA GGTTTCAAAC CGTACAGGAA GAACGACGAC GAAGTCGTGG ACATTTACGC TCACTGCAAC TGTATCCTCC TGGTAAATTG GTGCACTTGG TCAAAATTGG CGAAAGGAAA TCGTGTCTTC ATGGTCTTGC TAAGTGCTTA ACATGTTGTA CAACAAATGC AGGTTCAAAG TATCAGCCTG TCTGGATAGG CAATGACGAT CTGAACGAGA TTGTGGTGAG CCCTACCATG GCAACGGACC ATTTCCCTAA TCGACTTTGT GATTTGCTGC AAACAGTTGC TCGGGAGTAT AAGGTCAAAA CAAGTTGATA AGGTTTTATC GATAAAACAG CCATCTAAAC TGTCTC
|
Protein sequence | MPAIYLFHRR TIVGGDDLQP AAFLTASLRC VQLVCLLGPI LAHIHDESSR NGGLFHYIMY DPSDDVSCRH SNLFPTLLTL YAVASVVYSF ASIALEWRLA HWSSIGSPTE TEPRSSKVRH LLELKLVPFS ILLLLVWMSG LSAVAFAPLY NYCVDLTQQN NSEIQREMAD DIDKYSVEVF TPLRVHLWWL ALAILLVGQL SEVLVSFAFL WHLCKQPMQA HILENPELME VTLPSHRMIE EMWADRCAAA CQCLGSASCF MFGGRELLGQ AEFGDVARAL ADYLDTGGIL DVVPSDIVTG FMLLQRIQRQ RIYRAREEVL QHVEVAARLE GQSVQDNAGD ELMVPLDSLQ STSYLPSSST TCRVGGRQSS VFRFDPDGTY ERRNRALFER HNVDEMSVLE EGTRYAKYAL AIYTWVLYLY VHPCSGIPRL FAKSGRLCCR SSKTDRRGES SVIPQLAANL IDQHGRIEGD NLCETNKAAL LLTVGLMEAD LIYAQLRSGF ADTPYAILVD HEWKSIVVSI RGTFSLEDCV TDVLIDPEPL EQLGVDFGFD AKDQYCHGGV LTCVRNVYRD LQRHGILDRL LLGEHARFPE YRLRLVGHSL GASTCTLLSY MLRGKFASIR CVNYSPPDSD LVPRLSFNAM EILRNEILSL IGRIKVPKIE VASRVVSGSG LSNCRFCLDQ DPDEHANILE DINEMLYAPT ELPESEYQHQ LERFQTVQEE RRRSRGHLRS LQLYPPGKLV HLVKIGERKS CLHGLAKCLT CCTTNAGSKY QPVWIGNDDL NEIVVSPTMA TDHFPNRLCD LLQTVAREYK VKTS
|
| |