Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26029 |
Symbol | |
ID | 7198046 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 356038 |
End bp | 357711 |
Gene Length | 1674 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178487 |
Protein GI | 219115383 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000382516 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAACTTGTT GACCTTCGAA ACAAACAAAA CCAACGCTTT ATACCAACAA CACCTGACAA CACAAATATG TCCGATATCG ACGATAAGCC CACCCCAGTT GAGTCTGGTG AACCCGTAAG TAAAATGATT CCTTGGTTCC TTCGTCGAAA CGAGTGTGTC AAGACCTCAC TGTCAATTTT TCTCTTATTT TTTGCTTTCT CCAGTCTGAG TGGAAGAAGT ACAGCTCCTA CCATATCAAG ACTGATCCTG ACCAGGATGA TAAGGCTAGC GAGATCAGAC TTTGTAGCTT TGCTCGGCCC CACATGCGGG CCTTCCATTG CGCATGGTGG TGCTTTTTCA TCGCCTTTTT CATCTGGTTT GCCATTGCGC CTTTGCTTTC CGAAATCCGT GACGATATTG GCATCACGAA ACAAGACGTT TGGACCTCCT CCATTGTCGG TGTTGGTGGC ACTATTTTGA TGCGTTTCGT CATGGGACCC ATGTGCGACA AGTACGGTGC CCGTGTCCTT TTCATGCTCA TTCTCTGTTT CGCCTCCATT CCCACGGCGT GCACAGGGTT CGTCAACAGC GCTACCGGTC TCGCCATCCT CCGACTTTTT ATTGGAGTCG CGGGAGCTAC CTTCGTTCCC TGCCAGTACT GGTCGAGCCG AATGTTCACA AAGGAGGTTG TTGGAACCGC AAACGCCTTG TGTGGCGGAT GGGGTAACCT CGGAGGTGGC GTCACACAGC TTGTCATGGG ATCAGCCCTT TTCCCTCTCT TTAAAGTCTT TTTCGACGGA GATTCCGAAA AGGCCTGGAG AACGGTTTGC GTGGTCCCGG CTATTGTTGC CATGGCTTCT GGTATCATGG TCTATCGAAT CAGTGATGAT GCTCCCAAGG GAAATTACGA CGAGATGAAG AAACACGGTA CCATGCCGGA GGTTTCTGCA GCTGCTTCTT TCCGTTCCGG AGCATTGAAT TTGAATACAT GGGTTCTGTT CATCCAGTAC GCGTGCTGCT TTGGAGTCGA ATTGACCATG AATAACGCCG CCGCCCTTTA TTTCAAGGAT GAATTTGGTC AGTCAACCGA ATCGGCTGCC GCCATTGCCT CGATTTTTGG ATGGATGAAC CTTTTCGCTC GTGGCCTCGG AGGCTTTGCC AGTGACAAGG CTAATGCCAA GATGGGAATG CGCGGCCGTA TTTGGGTACA GACGATCTTT CTTGCTCTCG AAGGAGCTCT TGTTCTCGTT TTCGCGCAGA CGGGATCACT TGGAGCTGCC ATTGCTGTCA TGGTGTTCTT TTCGTTGAAC GTACAAGCTG CTGAAGGTGC CACCTACGGA ATTGTCCCTT ATGTCGACCC CGCGTCCACC GGATCCATCT CGGGTATCGT CGGTGCTGGA GGAAACACCG GTGCGGTCTG CTTCGGTCTC GGATTCCGTC AACTTAGCTA CGAAAAGGCA TTCAATATCA TGGGATATTC AATTCTTGCG TCAGCATTGA TGTCGGTCTT CATTAATATC AAGGGACATG CCGGCCTCTT TTGGGGCAAG GACGATGTCG TGCAAAAGGC GACCCTCACT GTCCCTGCGC AGGAAGAAGA AATTGAAGCC TGAGCTTGAA GAGGCATGCC CAAAGGGGAT TCTGATTGTC AATTTTGATA TAAAAATTTT CGTAAAACTT TTGTTACTTT TTGA
|
Protein sequence | MSDIDDKPTP VESGEPSEWK KYSSYHIKTD PDQDDKASEI RLCSFARPHM RAFHCAWWCF FIAFFIWFAI APLLSEIRDD IGITKQDVWT SSIVGVGGTI LMRFVMGPMC DKYGARVLFM LILCFASIPT ACTGFVNSAT GLAILRLFIG VAGATFVPCQ YWSSRMFTKE VVGTANALCG GWGNLGGGVT QLVMGSALFP LFKVFFDGDS EKAWRTVCVV PAIVAMASGI MVYRISDDAP KGNYDEMKKH GTMPEVSAAA SFRSGALNLN TWVLFIQYAC CFGVELTMNN AAALYFKDEF GQSTESAAAI ASIFGWMNLF ARGLGGFASD KANAKMGMRG RIWVQTIFLA LEGALVLVFA QTGSLGAAIA VMVFFSLNVQ AAEGATYGIV PYVDPASTGS ISGIVGAGGN TGAVCFGLGF RQLSYEKAFN IMGYSILASA LMSVFINIKG HAGLFWGKDD VVQKATLTVP AQEEEIEA
|
| |