Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41686 |
Symbol | |
ID | 7196494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1279871 |
End bp | 1281163 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176755 |
Protein GI | 219110006 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACTT GGGCGCCTGG AAAAGCCCGC CACAACACTC CAATCATCAC GCACGGAGAG GGCGTGTATC TCTATGATGA CAAGGGGACG AAATTTTTGG ACTGGACCTC GCAGGCAGTG TGCTCTAACA TCGGATACGA TCTTCCCGAA GCAGTGATTG AGGCCACAAC CAAGCAAATG TCGACTCTTC CGTTTGTATA CGGTGGCCTT GGTATCTCTG AAGTTCGAGC GCGACTCAGT AAGCTCATGA CGGAACTTCT TCCCGGAGAT TTACAGGGAA TGGTCTTTCC GTCGTGTGGA TCGGAAGCGA ACGAAGCAGC CATCATGATG GCGCGTCGCT ACACAGGAAA GTACAAGGTA ATCAACTGGT ACCGTTCCTA TCACGGAGGG ACGTCCAACT CGCAACAAGC AACGGGGGAC TTCCGCCGAT GGTTCGGTGG CGATCACGTT CCCGGCTTTG TCAAGGCTTT CCACCCCTTT CCACTGTTTT GGGACTTGGC GGGGGCGACG GAAGAAGAAC GGACACAGCA CGCCCTCAAC ATGCTCGAAG AACAAATACT GAACGAAGGT CCCGACTCAA TCGCTATGGT CCAGTTTGAG TCGGTCATCG GCGGTGGCGG TGTGCTCGTT CCTCCCAAGG GGTACATGCA AGGAGTCCGT GCCATGTGTG ACAAGTATGA CATTCTCATG CATTGTGATG AAGTCATGGT CGGGTTCGGG CGCACCGGAG AACTGTTTGG TTTCCAAAAT TACGAAGGAG TAATTCCCGA CATTGTAACT GCTGCGAAGG GCGTGACCTC GGCAGCTCTT CCTTTGTCCA TGACGGCTTG CCGCAAGCAC ATTATGGAAG CCTTTGAAGA AAAACCATTG GGCTGGGGAT CAACTTATGC GTCGCACCCT GTCGCAATGG CTTGCGCTTA CGAAAACATC AAATACTTGA TCAAGAATGA TGTTATTGGA CACGTGCAGC GGCTTGCCCC TACCCTCGAA AGCGAAATGC GCAGATTGAC CGAGAACCAT CCGTCTATTA AGCAATACCG TAGTATTGGT ATGTTCGGTT GCTTCGATGC TCACTTACCG GACGGTTCCA ACCCACAGCT TCAGCACACG GCTATTGACA AAGCCTTTGT CGAGTATAAG AAAGCCTATA CCGCCAATGG CCTCATTGGT CTACTCCGCC CCCCACACAT GCATGTTGCG CCACCGCTCG TTATCTCTGA AGAAGAACTT CTGGATGGGT TTGATCGACA GGACAAAGCT CTCTACGCCC TTGATGACGC TCTCGGCTTC TAA
|
Protein sequence | MLTWAPGKAR HNTPIITHGE GVYLYDDKGT KFLDWTSQAV CSNIGYDLPE AVIEATTKQM STLPFVYGGL GISEVRARLS KLMTELLPGD LQGMVFPSCG SEANEAAIMM ARRYTGKYKV INWYRSYHGG TSNSQQATGD FRRWFGGDHV PGFVKAFHPF PLFWDLAGAT EEERTQHALN MLEEQILNEG PDSIAMVQFE SVIGGGGVLV PPKGYMQGVR AMCDKYDILM HCDEVMVGFG RTGELFGFQN YEGVIPDIVT AAKGVTSAAL PLSMTACRKH IMEAFEEKPL GWGSTYASHP VAMACAYENI KYLIKNDVIG HVQRLAPTLE SEMRRLTENH PSIKQYRSIG MFGCFDAHLP DGSNPQLQHT AIDKAFVEYK KAYTANGLIG LLRPPHMHVA PPLVISEEEL LDGFDRQDKA LYALDDALGF
|
| |