Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45894 |
Symbol | |
ID | 7200986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 599097 |
End bp | 600296 |
Gene Length | 1200 bp |
Protein Length | 354 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180271 |
Protein GI | 219119009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAACGTTGT CTCTCAACGG CGAACACGCT GCACAACCAA ACAAACCCCG TCCGCCGTTC ACGGATCGAT ACGACCTCCT CCTCCCACCC AGTATCCGGC GCCGAACCAA CCTTGCAACA CTGAACCTCA AAGCTATGGG TAAGAAGTCA CGTAAAGGAA ACGTCCGCAC GCAAACGCAA CGTCCAGCGG CGGGCCAGGG CAAATCCAAA CATCAAGGAT CCAGCTTGTC GGGTTCCGCG TCCCGGGCTT CGCGAGAAGA TCTTTCGGTC GCCTTTTCCA GTGAAAATGG TACCGATGCT GCGAATCCGA CTACGTCGAA GACGCCTGTC TTCAAGTCAA AGAAACCTAC ATCTTTACCT AAATCTGTCA TGCTTTCTCC GGCTTCCACG GCCAACGAAA CACTCAGTCC AGACGGTTAC GAAAACAACT TCAACTTTGA CACGAATGAG AAAGCACCGC TGCTGGATAT TCCCGTCACC ACGACGACGA CCAACAGCGA CGTCCTAGCG GCGACCACTA ACGTGGCACC TGCCGCTACC AGCGCTCCCG TACTCACCAC CGCTAATACC GTGCCGCCGG AACCCGCCCT GCCAACACCG TCCACCACGA CCAAGACGGA GGTGTTACCG CCAAACCGAG TTCTGATTCT TGTCTCCCAA CAAAGCATGA TACGCACTGT GACAACGAAC CAGCAGAATG CTGTCGTTAT GTTGCACGCG AGCGATATTC CATTTGAGCT ATTCGATGGC TCGGATCCGA CCAACAAGGA TCGGCGCAAC GAACTCTTTG CCTTGAGTGG TAAACGTTGC GTGTATCCGC AGTTTTTCGT GATAGAAGAA AGCAAACCAA AACCTCGTTT CTGGGGTGAC TATGAGACCA TGGAAGTTTC CAACGAGAAT GGAACATTGG CCGAGGATAT TTTTTCCAAA ACCACGGAAA CCGCAGAGAA ATCAAACAAC AACAATGGTA AATCCGTGTG GGATTCCACA TTGCGAGAGC AAGTGAACAA GAGCAATCCA GACAACAAAG CGGTGCTGGA TGTTTCTATA CACGAGGAAC ACGAGGAAGC AAGCAAGTCT GCAGCGGGCC CAGTGTCCAA ATATGTGCCA GTTTCCACTT CTCGAGTCCT GGACTTGAGC GCGCCAGCAC CGGACGAAGC TAAGGCCAAA CAGAAGGATT GCGAATGCGT CATTCTCTAG
|
Protein sequence | MGKKSRKGNV RTQTQRPAAG QGKSKHQGSS LSGSASRASR EDLSVAFSSE NGTDAANPTT SKTPVFKSKK PTSLPKSVML SPASTANETL SPDGYENNFN FDTNEKAPLL DIPVTTTTTN SDVLAATTNV APAATSAPVL TTANTVPPEP ALPTPSTTTK TEVLPPNRVL ILVSQQSMIR TVTTNQQNAV VMLHASDIPF ELFDGSDPTN KDRRNELFAL SGKRCVYPQF FVIEESKPKP RFWGDYETME VSNENGTLAE DIFSKTTETA EKSNNNNGKS VWDSTLREQV NKSNPDNKAV LDVSIHEEHE EASKSAAGPV SKYVPVSTSR VLDLSAPAPD EAKAKQKDCE CVIL
|
| |