Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49353 |
Symbol | |
ID | 7195758 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 11090 |
End bp | 12708 |
Gene Length | 1619 bp |
Protein Length | 470 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184045 |
Protein GI | 219127652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00242738 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTTCAAGG ACAAGATCGT CAGATGCAAC GCCAGACTTT GCCACAACGT CAAATGAGGT GACTTACGCA GACTGAGAGT AAACTCCTAA AGCTGGTAGA ATGCGGCGAG ATGCTAAAGC TGTTTATCGC AGACCAGCAT CCAAATCTTG GCATGAAATC CTTGTGCTCC TGACAGCGAC GAGTACATTT GTTTGTGCCT TAGTCAATCT AAGCCACCAG TCCTCATATG CTTCTCAACC CTCACATCGT ACCAGTGTTT GTCAATTGGA ATCTGGATAC GATTTCCGCA GCTTACCACG CGTGAGGCGA GGACCACTAG TGCAAATGCC TGGAATGTAC AGCTCAAGGA GTGAAAGTAC CGCAACTGGA CTTACGGGAC CGTTGAAATC CAACGAAACC GAAAATCCTG TCCTCTACAG CGAGACATTT CAAATCCTTT CGGTGGATCT GCCCGAGATA AGCCCATACT ATCGCCACGA CCGACGAGAT AAACTAGGGG CACCAATCAA TGATACCGCT TCCAAAGATC TGAGTCGTCT GCTCGAACAA CGTTTTCAAG CACGGAAAGC CCGTAACTTT GAGCAGCTAA ACCAAGTCGA AGCCGGGACA ACTACGACGT GCGAGTATAC GATCATCCCC CAATCTGGAC AAGGCTAAAG GAACCTCCTA GAGCCCATTT ACGGAGACAG GCGCGCAAAT GGCTCTGGAA GGCCGAAACC TTGTACGGAC CCCGGGGGCA TCCCTTGGTT CAGGTAGGTA ATTTGATGGA TACAGACTCG TTCGTATGCC CTTTGACGAT TTCGCAAATA CATTCGTCAT TGATGCAACG GGAGCTTGGG CGAATGCAGG GGCAATTCGA TCAAGTGAAT GCAATTCGGT TGGAGTTATT AGTCTACGGT GTTCGTGTTC ACGACGACTT TCGCCAGTGG ACGACTGACC CAAATCACGT CTTTCTTGCG AATACTGCTT CGAAACATTC CCCTGCATTT CCACATGCAT ATCAATGCGA TCCGTCTTCG CAATCGTCGA CATCTCTAAC TATAGATGGG AGCGAAGCAG ACCGTCTGAC GCAACGAATC GAGTTCCTGG TACGCACGAG GGCCGCGGCA CTCTTCCGTG GCGACGACAA AAAAGCTCTG TTCATAGCTT GCAAGCTCTA CATGACTTAC GGAGTTGGAG TCAACGATAC AACGAGGACA TGGTCAATTG GGTCTCGGTT CCTGAAAAGT TACGAAAATG AATGGAAGGC TCCGACCATT TCAAAAATCT CTGAAATGAA GGAAAAAGTG TCGTTTACGC ATGAGCTCTT TCAAATGCGT CGCCATTTTG AGAGCCCAAA CTTCAGACGA AGCCAAAATT CGCATTTTTT TCCAAATGCG ATTGTAGAAA AGCGAGTGGC TTCGATGGTA CAGGAGCGTA TTCACAACCG AGAAGAGGGT ATGTTTTTGG AAGCTGATGC CATCCGTCGG GAACTTTGGT CCACTTACGT AAGTGCATTT TTTTATTTCG GTGCTTTTTG ACAACGCCAA TACTGATTTT CCTTGTTATT CGCCCACAGA ATGTTGGAGT CAATGATCGG CTACAGCAGT ACAGCCTGGG AGGAGTATTT GAAATTTAA
|
Protein sequence | MRRDAKAVYR RPASKSWHEI LVLLTATSTF VCALVNLSHQ SSYASQPSHR TSVCQLESGY DFRSLPRVRR GPLVQMPGMY SSRSESTATG LTGPLKSNET ENPVLYSETF QILSVDLPEI SPYYRHDRRD KLGAPINDTA SKDLSRLLEQ PKPSRSRDNY DVRVYDHPPI WTRLKEPPRA HLRRQARKWL WKAETLYGPR GHPLVQVGNL MDTDSFVCPL TISQIHSSLM QRELGRMQGQ FDQVNAIRLE LLVYGVRVHD DFRQWTTDPN HVFLANTASK HSPAFPHAYQ CDPSSQSSTS LTIDGSEADR LTQRIEFLVR TRAAALFRGD DKKALFIACK LYMTYGVGVN DTTRTWSIGS RFLKSYENEW KAPTISKISE MKEKVSFTHE LFQMRRHFES PNFRRSQNSH FFPNAIVEKR VASMVQERIH NREEGMFLEA DAIRRELWST YNVGVNDRLQ QYSLGGVFEI
|
| |