Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54405 |
Symbol | |
ID | 7200682 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 167932 |
End bp | 170838 |
Gene Length | 2907 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179594 |
Protein GI | 219117604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTCCGTCC AGCAAAATGC CACCCAAGCA CGAAAGTCAA GAAGACCTCA AGATGTCAAC ATCTAAACAA GATGAAGACG AAGTCCGCAC CATTGACTTT CTCGACCACG ACGATGGTAA CCAAGGCAAC GGCTGGGGCC GTGGTATCGT GAAAGACTTC CGCAAGACTG TTGGAACGCA CTGGGTCAAC GAAATGACAA ACTTTAACCA GAAGTCGATT GCTGTTTCCT TTTTCATCTT CTTTGCGGCT GTAGCTCCCG CGATTACTTT CGGTGCCGTC TATTCCAAGG TGCGTTTTCC GACTTGTGTG TGTGTATGGT ATTGGTCATC GCCACGTGTT CGTCTCTCGA TAGTATCGTG TGTTCTTCGG GGTATCACCG AGTCACCGTC ACCTTGACGC TTCCTCACAT CCTCTTGTCT TATCCGCATT TCCATAGACC ACCAATGACG CGATTGGTGC CGTCGAGATG CTCATTGCGA CTGCTTGGTG CGGAATTGTC TACGCACTTA TTGGAGGACA GCCCATCATG ATCAACGGTG GAACCGGTCC CGTTCTTGCC TTTAGTGCCG TGCTTTTTGA TATTGCGGAC AACATGGACG TCAACTTTTT GACTTTGAAT GCCTGGACTG GACTCTGGGT TGCAGGATTC TTGATCATTG CGGCTTTCGT TGATTTGAAC CGTCTAATGA AGCATGCTAC GCGCTTCACC GACGAAATCT TTGCCCTGTT AATTGCGTCC ATCTTCGTGA TTGATGCACT TGGTAGTCCC TTTTCTGATG TAGGTATTTA CTGGTACTTC ACCCGCAGCC ATGATTCGCA TGACGAATTC GAAGACCAGG AAGACTACTC ATACATGGCC ACAGCGTTTC TCAGCGCCGT TCTCTGTCTG GGAACAACCT GGTTGGCCTT CTTCCTGAGG GATATTAAGT TTTCGCCCTA CTTTCCCAAC GATTCTTGGC GCACTCTCAT CTCCGATTTT GCCGTGGTTG CCTCCATTCT GATCTGGACT TTGATCGCCA ACGGACTCTT CGACAATGTT GAAGTGGAGC GCCTCAATGT CCCGGATAGC ATCACGCCGA CTCAAATCTG CTGCACCGCC GATTGCATGA CGTCGTTCCC CGATGACTGT CCCGACATTA CACCGTACGG ACGCCGTTCC TGGATTGTGG ACCTTGGTGC CGTCAACGGA AAGTCCTGGA TCCCTTTTTT TGCCGCCATT CCGGCTCTTT TGGCATTTAT CCTTGTTTTC TTGGATGATG GTATCACCTG GCATTTGATC AATCACCCGA GCAATAAGCT TACTCACGGA GACGCTTACA ATTGGGACAC GGTTGTTATT GCTGCTATGA TCGCCGTCAA CTCTATGCTT GGTCTTCCCT GGTTGGTCGC CGCCACTGTC CGATCCCTCA CCCACGTCAA TGCTCTCGCC GAACGTAGTG AGAACGGCAA AATTATCAGT GTGCAAGAAA CACGCTTGAC GCATTTGGGA ATTCACTTGC TTGTGCTTGC TGCTCTCTTT GCACTGGATG TGCTCAAGCT CATCCCTGTG CCAGTCTTGT ACGGAGTCTT CTTATATATG GGAGTGGCCA GTTTGGCATC CAATCAATTC TTCCAGCGCT TCCTCATGTT TTTTATGCAG CCCTCCAAGT ACCCCCACGA GCCACACACT AAGTACATGG CTCCTAAGCG CATGCACTTG TTCACAGGGA TCCAGCTTGG ACTTTTCGTA ATTTTGACAG TATTTCGATC TATTTCTGTC ATTGCCATTG CTTTTCCGAT TGTCATTAAG GCTTGTATTC CAGTCCGGAT GTACATCTTG CCTCGCTACT TCACCTCCGA AGAACTTCTC ATGATCGATA CCGATGACAG CATCGTCAAC GAATATCTCG AGTACAAGGA GTCCAAAGGC GAGAAAGTAC CCGTCCGTCA TTGTGGCGAA CAAGAGCCCG AGGAAGTTCC GATGCTTCAA GTGACTCAGC ATCCTATCCG AATCGACGAT GGCAGCGATG AAGAAGGGTC CGTCGAACAG GTTTAATGTT TTTCTGTTAT GTAACGGAAT GGAATTTCTA CTAGCTTTCA AAAGAGAAGG CTTAAAGTTT CAGACCGTAG TATTGATTGC CACGTGTCCA CTAGAGCCAT CTAAACGCCT TCCTTCGCTG CGAAAAAATG AATTTGAGCA TCCTTGTAGC CAGTAACCAT TAAAGGCGCT TGACCACCAG TTGGGACATC TCAACATGAC AGTGGTAGTC TCGGTTTGTT GAAGCCGATG CTTACTAGCT GTAAAGAGAC CCTCAATGCT CAAAAGGGAT GCTCTTTGGT ATCAACAGAT ATGTTTTCGA ATGTGCTTAT TAATAGACAA CATAAACGAA AGTACTTTCA AGATACAATT TGTATACAAT GTCTGCAGTT GAAAAAGGTG GACTCACGTC CCATCGTGGC CCAAATCGAT CTACACTGCA ACTCCAGATA GGAAACCGTT TTTCTGAGAG TAGTATGCGG TGGAGAGCTT TGCTTCTCGC CAATACGATT CTCCAACAAC TCTACGCAGA CGAGGTACTA CCGCTTTATG GAGTGGCGTA CATAGAGGAT CACCTTCTCT GTTAAGTTGG ATGCAATCAG AGATAACTGC CTTGGCTCGA TTCCGAAGAG CCGCGTCGCA CGGATCCCTA TCAAGAGCAG CGAAGAGTTT TCTTACGAAA ATTAGAAATC GCTGTCGTTC AGACAAGACT CGTCGGATTG GCATAGGAGA TCTACCCGGA AGGTGGGAGG TACGCTCGGA AATATCGTTT GGCAACATGT TTTACCGGAC TTAAAGAAAA ACCTTCGCTT ACGCAGTCTT TGGTTGGGGC GAGCTACGTA GAGAATTCTA ATGGAGCGTT TGTTGATTAT TGAACTCGGT CTGCTTT
|
Protein sequence | MPPKHESQED LKMSTSKQDE DEVRTIDFLD HDDGNQGNGW GRGIVKDFRK TVGTHWVNEM TNFNQKSIAV SFFIFFAAVA PAITFGAVYS KTTNDAIGAV EMLIATAWCG IVYALIGGQP IMINGGTGPV LAFSAVLFDI ADNMDVNFLT LNAWTGLWVA GFLIIAAFVD LNRLMKHATR FTDEIFALLI ASIFVIDALG SPFSDVGIYW YFTRSHDSHD EFEDQEDYSY MATAFLSAVL CLGTTWLAFF LRDIKFSPYF PNDSWRTLIS DFAVVASILI WTLIANGLFD NVEVERLNVP DSITPTQICC TADCMTSFPD DCPDITPYGR RSWIVDLGAV NGKSWIPFFA AIPALLAFIL VFLDDGITWH LINHPSNKLT HGDAYNWDTV VIAAMIAVNS MLGLPWLVAA TVRSLTHVNA LAERSENGKI ISVQETRLTH LGIHLLVLAA LFALDVLKLI PVPVLYGVFL YMGVASLASN QFFQRFLMFF MQPSKYPHEP HTKYMAPKRM HLFTGIQLGL FVILTVFRSI SVIAIAFPIV IKACIPVRMY ILPRYFTSEE LLMIDTDDSI VNEYLEYKES KGEKVPVRHC GEQEPEEVPM LQVTQHPIRI DDGSDEEGSV EQV
|
| |