Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46647 |
Symbol | |
ID | 7204573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 72952 |
End bp | 74401 |
Gene Length | 1450 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185620 |
Protein GI | 219120777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.406601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTATGCTAG TCCAAAACAT ATCTGTTATG AGTCTCGTCG TTCACGATGG CATCTGTACC AACTCACCTA GTAGGAAATA CTGTCATCAC TGATACCAGT TATCAGCCAG TTTCGTCGCT ATTACCTGCA GCGGGAAATC AATTGGCGTC ATTGACTAGC GCCTCGGTGA CGCTGCATAA CGCGGGCGAA CCTCACAGCA TGCATGCAAA TCAGACGAAC ATAGCCACGA GTGCCTCTTC TTTGTCGTGC GATGAGTCAC TTCCCCTCTG TGTACCGTCC AAGAAGCGCA CACATTCCAG TAGGGAAACG GAAGTTACAT CGTCAGCGGA GAGCGAGTCG AGAGCGAGCT CATCTGTAGA ATTTTTTCCC CACACGGCAG AATGTAACCG CAGGATTCCT ACGAGAGGAA TAGATTCCCA GCTAACCGCT CAGGAGCAAA AGCAGAGACA TTATAAGCCG ACGACAATGG TTTTACGAGA CAGTCAACCA CTATCTACGC TTGGCACTAT GCCAGACTTG GGGACAATGG AGCGTCAACT TCACGTCCGT CCAGGAATAA CTCCTGATGT CGCCAAGACT GCCGCCAAGC GTGAGTACAA TCGCCGCAAT GCAGCTCGTG TTCGAATGCG GAATAAAGGA CTCGTGAGCG ATCTTCAGAA AAAAATTGCC AACTTGGCCC AGCACGAGGC CGAGCTCCAA CGAGCCAACG AAGTCTTGCA AGCAAAAGTG GAGGTTTTGG AAAAACAATA TCAGGATTTG CTCCAGGCGC GCAAAATGGA ATACAACGGG GCTATCATAA AGCCTCCTCC GACACAAACC AGTTTGGAGG CTTTGCTTGC TACCGGTCTT CGGCAAGAAA CGAATACTCC GGTAGCGGCT CCGACTCCAC AACTGACGTC TCCTTCGGCA TTGGGAGGCT TGGACGAGAG TATTCTGCAA TTACTGTGGA GGATCGTACT CCAGCAGTAT CAGCAACAGA TGCTGCAAGG TGCTCCTGCA CCCCCTCAGC ACGCCACTTG CGACTCTGTC GAGACCATGA AGCTTCTTCA GAGTCTCCTG GCGAATAGTG CGAATGTGCC TGCTGTGACG AGTTGTCAAA AACCGAGCAA CGATTGGTCG ACGTTAAACC CAGCCATGTC ATCCGGGCTT GCTCCCAAAG TATACCGCAA TGACGCCAGC AACGTTCATA TGCCCTATGC TTACCACCAG ACTCCAGCTC CGGCGACCCA GGTACAGATA CAGGAATCAA TACAAGAGCA ACTAAGAGCG TTGCAAGCTG GAGGTGGCAA AGGTGGTTTC CCATCGTTTG CGCCGGCGCC CATGAGAAAC AGTCAATCTG CGACTTTGCC TGACATACTG CGGGCGCTTT TGCAAAATGG CCAAGCTCCT ACTCCAACGA CACAGACAGA TCCCCATCTG CGGCAGTTCT GGCAGTCTGG GAAATAAATC
|
Protein sequence | MASVPTHLVG NTVITDTSYQ PVSSLLPAAG NQLASLTSAS VTLHNAGEPH SMHANQTNIA TSASSLSCDE SLPLCVPSKK RTHSSRETEV TSSAESESRA SSSVEFFPHT AECNRRIPTR GIDSQLTAQE QKQRHYKPTT MVLRDSQPLS TLGTMPDLGT MERQLHVRPG ITPDVAKTAA KREYNRRNAA RVRMRNKGLV SDLQKKIANL AQHEAELQRA NEVLQAKVEV LEKQYQDLLQ ARKMEYNGAI IKPPPTQTSL EALLATGLRQ ETNTPVAAPT PQLTSPSALG GLDESILQLL WRIVLQQYQQ QMLQGAPAPP QHATCDSVET MKLLQSLLAN SANVPAVTSC QKPSNDWSTL NPAMSSGLAP KVYRNDASNV HMPYAYHQTP APATQVQIQE SIQEQLRALQ AGGGKGGFPS FAPAPMRNSQ SATLPDILRA LLQNGQAPTP TTQTDPHLRQ FWQSGK
|
| |