Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47599 |
Symbol | |
ID | 7202656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 237527 |
End bp | 239251 |
Gene Length | 1725 bp |
Protein Length | 567 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181869 |
Protein GI | 219123100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCGA GGTACGCGTA CGGGATCGCG GTCCTCCTCG GGGCGGATCA AGCCATCGCC AGCACCACGC CAACCGGCAT TCGTGCCGTA CCATGGGCGA CGCCACACTC TCACAACAGA ATGATTGATC AAGCTCACCA TTTGCTGATG CAGGCACGTG TCCGTGATCT CTGCGAGTAC GCGACGAGTA GTTATCCCAT TTTCCAAACG GGCTGTCGCA AAGAGGCCCG CGTCGCACCG GACAACACGG CTCGGCCGAT CCCATACGGA ACGGTGCGAC CTTCCGCCCA CGCAGATACA ACCCCAAAGC TCTCCCGTCG ACTACTCGAC TCCGACGAGG CCTTTCGTTG GGATTGGACC AGTCTAATTC TCGCCTTTCT CTGTGTCTTG TGTGCCGCCT TTTGCGCTGG TCTCATTATG GGTATTCTCA GTCTCGACGA GCTCCAACTC CACATTAAAA TTCGGGCGGG ATCCGATCCG GAAGAACAAC GCTACGCCAA CCGGCTTTTG CCACTCGTGC AACAACGTCA TTTGGTTCTG GTGTCGCTCT TGCTTCTCAA CTTTCTGGCC GACGAAGTTT TGCCACTCTG TCTCGACAAC GTCATGCCGA CCTGGATGGC CGTCTTGACG TCCGTCGTCC TCGTCGTTTT TGTTTCGGAA ATCATTCCCT CCGCCGTCTT TATCGGACCC GATCAGTTGC GCTTGGCGAG TCAGATTTCA CCCTTCGCCT ACGCCGTCAT TTATCTATTC TATCCCATTG CCTATCCTAT AGCACTGCTC CTCGACTATC TCCTCAAAGG TGAAGACGAA CTCGGCAACC AGTACAATCG GGGCGAACTC TCCGCACTGG TACGAATCCA GTACGAAGGC CGCCTGGCGG CCAAGCGCCG GGAACTCAAG GAACGACGCA TGGAACAGGG GATTGCGGGG CTGGACGACG ACGAGTCCCA ATTATCCGAT ATTCCACCCT CGATTACCTT CCAGCACGAC ACGCATTCTA TTCAGACTAC CGAAGTCAAC ATGATGCAAG GCGCACTGGC CTTGAAAACC ACCAACGCTC GCGACGTGTG TACCAAAATT CGGAAAGCTT ACACCGTCAT TGACAGTATG GTGTTGGACA GTGGCAATGT GGCCCGTATT TATGGAGTGG GTTATAGCCG CGTCCCCGTC TATCAACGCA ACCAGCGGAG ACCGAGAGAT ATCACCGGCA TTGTTGGTAT TCTACTAACC CGACAACTAA TCTTGATTCA GCCCGAACAC CGCCGACCCG TCTCGTCGTT GCCTCTGTAC CAGCCCGTGT GTGTTGGACC GGAAGCCAAC ATGATCGAAT TGCTACAAAT GTTTCAGGGG GGCAGTGCCG GGAACAAAGG TGGGCACATG GCCCTCGTGT GTGAGCGTCC GGGGATCGCG ACAACCGCCC TGGACCAGAA AAAGGCCATT CCTCCGGAAG CCGGCGTCAT TGGTATCATT ACCATGGAAG ACGTAATTGA AGAATTGTTG CAGGAACCAA TTTACGATGA AGGCGACCGA GAAGAACGGG AAGAAATGGA AAGAGCCGAG TGGGCCTTTC GCAAATGGCG TTTGTTTGTC AAACTCCGGC GTCGCCAGCG CGAGCTCTTG ACAGAATTAG AAAGCACTGA AGGCACGCCC TTACTGACCA ACCACAAAAT GTACAATACT TCGACCATTT TCCGGCCGGA ATAACCTAAA CCTAGTTGCT AGAGA
|
Protein sequence | MIPRYAYGIA VLLGADQAIA STTPTGIRAV PWATPHSHNR MIDQAHHLLM QARVRDLCEY ATSSYPIFQT GCRKEARVAP DNTARPIPYG TVRPSAHADT TPKLSRRLLD SDEAFRWDWT SLILAFLCVL CAAFCAGLIM GILSLDELQL HIKIRAGSDP EEQRYANRLL PLVQQRHLVL VSLLLLNFLA DEVLPLCLDN VMPTWMAVLT SVVLVVFVSE IIPSAVFIGP DQLRLASQIS PFAYAVIYLF YPIAYPIALL LDYLLKGEDE LGNQYNRGEL SALVRIQYEG RLAAKRRELK ERRMEQGIAG LDDDESQLSD IPPSITFQHD THSIQTTEVN MMQGALALKT TNARDVCTKI RKAYTVIDSM VLDSGNVARI YGVGYSRVPV YQRNQRRPRD ITGIVGILLT RQLILIQPEH RRPVSSLPLY QPVCVGPEAN MIELLQMFQG GSAGNKGGHM ALVCERPGIA TTALDQKKAI PPEAGVIGII TMEDVIEELL QEPIYDEGDR EEREEMERAE WAFRKWRLFV KLRRRQRELL TELESTEGTP LLTNHKMYNT STIFRPE
|
| |