Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33153 |
Symbol | |
ID | 7204272 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 196183 |
End bp | 197613 |
Gene Length | 1431 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186012 |
Protein GI | 219112857 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCG GGACAAAAAA CAGCTTCTTC GCCACCACAG TCTGCGTTGC CTTGTTTTGC TTTGTAAGAG CCCTTACCTC TTTGGGTACT GTCCAGAATC TCCAAGTCTT CGATACCGAA CTCAAAGACA TAGCAGAAAG CGTTCATTCA CTTTCGGTCC ACGGGACCGG TGCTGACAGC AACAACTCAT TGCCAAAGGC AAAACCTGAT GATGTCTACG TATTAGAATC AAAGACCAAG GCTGCAGTCC GTGAAAGTAG CCTCAGGGAA AGCACTCTGG TCACCAATGC CGGTATCGAA TTTCGGAAAC CAATATGGAG TAACGCTTCC AGCTTTGACA ATGGCGCACC GAGCATTCTT GTCCAACTCA ACGGAGAATT GGCAAACTAT CTCGGATTTA TTGCCAAGGC TTTTGGACTG GTGTGGTGGC TGGAGCGGGA GTATGGTGTG AATCCTACAA TTGTGCTGAG ACATCAACAG CATCCCAAAT GGGTTGGTGC TCACGCGGAT GTTACTCGAT GTTTTCCGTA CTTGAGAGAC TTTAATTTTG GGGCCGGAAA TACTCGGGAT ATAAGCAAAG AACTGAGTGT TCTGAGTCAA TCTCATCAAC AAAGCAATGG TACGGCTGAA AGAGTTGTTG ATATCAGGAG CGAAGTGCCA TACGACAAAA CAATCCAGTC CTTTTTGAGT CTCTACGCGA AGAGCCACAT CCACATTGGG GGAGAAAAGA GCCGTATCAA CATACCTTTT CTGACCACTA AGCAAATGAG TTGCAGAGAC CTCATAGTGG ACAAGTACTA TGACGATATT CGAAGAATAT ACCGATTTGA CAAAAGCTGC TGCGTTGATG TACCCGACCC GGATGAATCT GTCTTTGTAG GTAGCTATTG CCAATCAGAT ATGTGCAGAT TTTGGCGTTT CCGTCACTGA CTTTTCTTTG TTTTGCTGTG CTAGCATTTT CGCAACTTTG TAAAGGAAAG GTCTGGACTA CGAAAGCAAC CTGGATATGA AGAGCTTGCT CCGGAGCAAG TGGCGAACGA GCTGTTTGCG CATCTGAATC CGGGCGACAA AGTAGCAATA ATCTCTCGCT ACCCCAATGA CTTCCGAACC CAAATGATTG TGGGCGCATT TGAGAAGCGG AAGATTCGGG CTCGAGTTGT AGAGCCACGG TCCGGGGTGG CAGACTTTTG CTTTTTGATG CATACGCAAA AGGAGATGGT TGGCACGGCT TGGTCTACTT ATTTTCTTTG GGCTGGCCTG CTCGGAAATG CTACAAGCGT ACGACCGTAT ACAGCCATTG TACCTGGTCG GAACAGCAAA ATCGACTCTC ACAATTTTAC GCATCCAGAC CTCAAATCCC GTTTTCGTTT TGAGCACTAT ATCAGCAACT TTACTGCTGG GGATCTACGT AAACCAAAAG GTGAACAATA G
|
Protein sequence | MSTGTKNSFF ATTVCVALFC FVRALTSLGT VQNLQVFDTE LKDIAESVHS LSVHGTGADS NNSLPKAKPD DVYVLESKTK AAVRESSLRE STLVTNAGIE FRKPIWSNAS SFDNGAPSIL VQLNGELANY LGFIAKAFGL VWWLEREYGV NPTIVLRHQQ HPKWVGAHAD VTRCFPYLRD FNFGAGNTRD ISKELSVLSQ SHQQSNGTAE RVVDIRSEVP YDKTIQSFLS LYAKSHIHIG GEKSRINIPF LTTKQMSCRD LIVDKYYDDI RRIYRFDKSC CVDVPDPDES VFERSGLRKQ PGYEELAPEQ VANELFAHLN PGDKVAIISR YPNDFRTQMI VGAFEKRKIR ARVVEPRSGV ADFCFLMHTQ KEMVGTAWST YFLWAGLLGN ATSVRPYTAI VPGRNSKIDS HNFTHPDLKS RFRFEHYISN FTAGDLRKPK GEQ
|
| |