Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38086 |
Symbol | |
ID | 7202946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 31993 |
End bp | 33969 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182147 |
Protein GI | 219123678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTTT ATTCCTGGTT TGGATCAACG GCGGCTCGAG CGCACCGTCA CTTTCCCGCA CTACCGGAAC CACTAGACGA CGCCGTAGCC ATACACGACG AGCACCAATC CTCTTCCGGT TCCTGCAAGG ACGACCCGCC AGATATTCCG GTACGGAAGA ATGACGAAAC GTGTGTACGT ACCGCCGTCC ATTCGATCAC GATTCCGAAA TGGGACGCTC CGCTCCAATC GGCAGCCCAC GAACCGTCAC TGCACAATCT CGACAATCCG GACCAAAGTG CGCACCAAAC AGACAATAGC GTCACCTCTT GGTGCCGTCA GCACGTACCT TCCGGTGCTT CTTGGAACGG TAGGGCTAGT TGCCGTCACC AGCAGTACAT CACCAATCGA CAGCAACTCC TAACGGAAGA CGAAGAAACG GATTCGTTGT TTCCTTTACA CACGATTCCG TTCGTCTCTA TTGAACTGCG CTTGGAACCC ACCGATCACA GTGACGACGA TGACGACGAC GATGACCTCT TTGGATGTCC TCCCCACGGA CCCCGGACTC GTCAAGCCTA CACGTCATGG GACGCGCCAG ATATTGCTGC AGACGTCGTC TCGGCGACAC CATCCGATCC ACCCCAACGC ACGGGTCAAA TTCGAATAGG CGGAACAAAG TCGATAGAAT TTCCGGACGC TCTGCCAAGT CGTCTTGACG AGACTGTCCA TTTATACAAC GCGAGTCAAG GGTGCCATGC CGACTCACCG CCCGCGCAAT TCGTCTCGAC CATCCCGGCG CATTTTGCAG TCCCCGTCGA AATTACTGTC GTTCCGAACC ACGATCCAGG AAATTGCAGA AAGGATCAGG CCAGCTACAA TGTCTGCTGG AGCTCCCTTA TCGAGGACAA CGAATCGGAT AACCGGTTGG CGGATTGCTC TCTGATGAAC GATCCAACGG AGCCCTTGCG GTATCCATGG ACGAACCTCC CGACACAAAG ATTCAGCGAC CGCAGCACTC GCAACAAAGA GAGTCTTGTC CCGCCACTGC CCGATCCTCT CGATCGATGC GCCGCCGAAT TCTACAGCTT GTTGCTGCGG CACCTCGAAA CCAAACGTGT GGCCGATCAA GCCGCCACTC AGGACTTGCT TGCCTTTTTA CGGAAATATC CCTTGGTTGG CCAAGTACAC TTCCGCTTAC CCGACTTTGC CAGTGCATAT TGCTTGCCAC TGGCTTACTT TGCGGCGATA TCCTCGTTGG AGGGCTGTCA GCTCGCGTAC CGTCTCTATC CCGAAGCGAT CGGGATGGAA GATGACTTTG GTCTCCCTTT GCATTATGCC TGTTACTTGC AAGCCGATGT ACAGGTAGTA TCGTTTCTAC TGGCCCGCTA TGGGGAAGCG GCTAAACGAA CCAATCAGGA ACATCAAACG CCGCTACACT TGGCTTGTCA GGTTGCCACT CCTGGACCGA CGACGAGCTC ATCCTCGAGT ACTAGTTTGG ATCGACTGGT CCGGGAACCC AATCTGCAGG TTCTGAAAGT TCTCCTCGAG CACTACCCTA CCGCTTCTCA ATTGGCCGAT CGGGAGGGGA ATTTACCGTT GCACTGGGCC TTGCAAACCT CGGGCATTTC GTTGTCACGT TGCCAGGCCT TGGCCGCTCC GAAACCGCAC CATCAAACGC TTCGCCGGAG CAACCGGATA CTGGAGAAAC CTTTGCATGT GGCTTGCCGG TTTGGTGTGT CGATGGAAGT CTTGTCTTGG CTGCTCGAGG AGCATTTGGG CGCGGCCAAG ACTACCAACG AAAGGTTTGA AACACCACTG CACGCGGCTG TATTGGGGGA ATCCGAGTCG AGTCGGAACA TGCGGTGGAT GCAAGCGCTG GTCCGAGCCT TTCCTGACGC CCGATCTTGG ACCGACGAGC GGGATGAACG GCCCGTGGAC AGTGCCATAC GAATGGGCGC ACCGGAAGGT ATTGTGTCTT TGCTAAGCGT GGAATAA
|
Protein sequence | MFFYSWFGST AARAHRHFPA LPEPLDDAVA IHDEHQSSSG SCKDDPPDIP VRKNDETCVR TAVHSITIPK WDAPLQSAAH EPSLHNLDNP DQSAHQTDNS VTSWCRQHVP SGASWNGRAS CRHQQYITNR QQLLTEDEET DSLFPLHTIP FVSIELRLEP TDHSDDDDDD DDLFGCPPHG PRTRQAYTSW DAPDIAADVV SATPSDPPQR TGQIRIGGTK SIEFPDALPS RLDETVHLYN ASQGCHADSP PAQFVSTIPA HFAVPVEITV VPNHDPGNCR KDQASYNVCW SSLIEDNESD NRLADCSLMN DPTEPLRYPW TNLPTQRFSD RSTRNKESLV PPLPDPLDRC AAEFYSLLLR HLETKRVADQ AATQDLLAFL RKYPLVGQVH FRLPDFASAY CLPLAYFAAI SSLEGCQLAY RLYPEAIGME DDFGLPLHYA CYLQADVQVV SFLLARYGEA AKRTNQEHQT PLHLACQVAT PGPTTSSSSS TSLDRLVREP NLQVLKVLLE HYPTASQLAD REGNLPLHWA LQTSGISLSR CQALAAPKPH HQTLRRSNRI LEKPLHVACR FGVSMEVLSW LLEEHLGAAK TTNERFETPL HAAVLGESES SRNMRWMQAL VRAFPDARSW TDERDERPVD SAIRMGAPEG IVSLLSVE
|
| |