Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44029 |
Symbol | |
ID | 7204220 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 739332 |
End bp | 741131 |
Gene Length | 1800 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186120 |
Protein GI | 219113073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.492048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTTTGTCA ACGAAAGCAT GCAAATCAGA AATTGCCTGC TACATCATCT ACCCATATTT TCAAAATTTT CATTTTCCTA TTACCAAATT GCCGTGCACT TTACACAGGA TTCTGACAAT AAAACCATGT CTTTTCCTTC AATCGAACTT GAGACCCGAC AAAGCAAACA GCGGGTCACG AGGGCGGCAA AGACGAATGA AGTCAATTCC AACGCCCCTA TATTCTTGCG GGTAAGCGAA TGAAGGGAGT AACCATGTCC ACAACCTTCC AATCACTCAA TGATTCTCAA TTCGCTGATA CAGAAAACGT TCACTATGAT CGATAGTTGC GATCCCTACG TTGCGGCATG GTAAGTCTGG TAACAGCAAA TATTGAAGAC TTCATTATTT GAATCTGCCA TGGGTGTGGA TTCGGCAAAA GGAGTTTCTA CCAACCCGCG ACCGCTTGTC TGTAGATTGC AAGCTACGAT TTGCTCGATA TATGTGTGTG ATGAAAACCA GTGTTAGCGC AGACGCCAAC AATATTTCTG CCTAACTGGA ATTCCAAAAC GCAGCGTGAT CTGGGACGCA ATTTTCTTCC AAAGAAATCT CAGAAAATGT TTGTGGAGAA AGTTTTCACC GACAAGATCA CGTACAGTCA GTCCCTAAAC CTTACTCTTT CATCTTTTCA AAAATAGGAG CGACGATGGA TACACTTTTG TGGTCAAAGA CACTGAGAAG TTTGCTTCGG AGGTCATCCC GGAGTTTTTC AAGCACAATA ACTTCTCATC ATTTGTTCGC CAGCTCAATT TCTACGGTTT CCGAAAAATC AAGTCTGATC CCCTTCGCAT AAAGGATGCC GAAACAAATG AGGAATCGAG GTTCTGGAAG TTCCGTCACG AAAAGTTCCA GCGAGGGCGT CCCGACCTTC TCGGAGAAAT TCGGAAGTCG AATCATAATG AATCTGCGGA TAAACGTGAA GTGGAACACC TCAAGAACGA GGTTGACCAT CTTAGGAGCA AACTTGCCAC AATGTCCAGT GACCTGGAGC AGCTCACGGG TGTTGTGGGT ACACTTATGA AAAACTGTCA ACTACATGAT ATTGACTCGA AGAAGCGCAA GATTACGCAA GGCCCTGATC CAGTCCTTAG TTGGCATAAA ATGGAACACG GCACTCCCGA CCTTTCTTCC TTGGAACCAA TGCCCGTGGG TTCCCTGTCA TATGAAGCTG CACTTTTCGA GGATCTCGCA AAGGATCCCA CGATCGATCC GTTTGCCAGT GCCATACATA ATTCTGAACA ATGCGAATAC TTTCCTCGTT CGGTTTCCTT GGAGGGACAC GAGTCACAGG ACGATGAGGC GATGGCTTCT CTTCTTGCTC TCGACCCAGT TGACGAAATT AAAATCTTAC AGAATCCTGA TAGTTCGGGT ATCGGGGTGG AGTTATCTGA AGCTATCAAG CCGGCTGCAA CAGGAACTGA CCCACATCTT ATTGAGAAGC TTCAAATATC CCTGGGAAAT CTACCGAAGG ATATGCAAGA GCTGTTTGTC GACCGCGTGG TTTCCTTTGC AGCAAATCCG GAATGCTTCC AGCGGCAGAT CGACGCAATG ACCTCGTTGG CTACTTCTGC GGCCGACGAA GCCCAACGGC GCTTAATTGC CGCAGGGAAG AGCCCGAGCG ACCCCAAGTG TGTTCCTTTG GCTTCAGCAG TATTGGGGGC TTACCTTACT CGATTTGCGA CGCTTCCTGC GTCCGATCTG CAGTCTATGG AAGCCTCCAG CCATGTGCCG TGCTCGGGTT CTCCTTTCAC ACACATTTGA
|
Protein sequence | MQIRNCLLHH LPIFSKFSFS YYQIAVHFTQ DSDNKTMSFP SIELETRQSK QRVTRAAKTN EVNSNAPIFL RKTFTMIDSC DPYVAAWSDD GYTFVVKDTE KFASEVIPEF FKHNNFSSFV RQLNFYGFRK IKSDPLRIKD AETNEESRFW KFRHEKFQRG RPDLLGEIRK SNHNESADKR EVEHLKNEVD HLRSKLATMS SDLEQLTGVV GTLMKNCQLH DIDSKKRKIT QGPDPVLSWH KMEHGTPDLS SLEPMPVGSL SYEAALFEDL AKDPTIDPFA SAIHNSEQCE YFPRSVSLEG HESQDDEAMA SLLALDPVDE IKILQNPDSS GIGVELSEAI KPAATGTDPH LIEKLQISLG NLPKDMQELF VDRVVSFAAN PECFQRQIDA MTSLATSAAD EAQRRLIAAG KSPSDPKCVP LASAVLGAYL TRFATLPASD LQSMEASSHV PCSGSPFTHI
|
| |