Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31877 |
Symbol | |
ID | 7196412 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1155188 |
End bp | 1156510 |
Gene Length | 1323 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177229 |
Protein GI | 219110955 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00010716 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAG AACAATCAGG ACCTGGTGTC ATTATGAGCG GCATTCGCCG CATTGGTCAT GGTCTCAAGA CTTCATTAGC CATAGTTGGT TTTGCAACAG CTTCCGGTCT TTACATGGAG TACCGGAAGT ATTACCCTAT GGAAGAAGAC AACAATAAGA AGAAAGTTCT GGTGATTCCC TTTCACCATT TACAGTTGAT TGAGAAAGAA AAAAAGAGCA TTAGATCCCA GCTATCGCGT TTCGATGTGG ACACCAAAGA TTGCCCGGTG CAAATGGAAA TCAAGGATCT TGTGGATGTG TTGCACCACG CGGCGTCAGA TCCCAGTATT GTCGCTTTGT ACGGCGTCTT TGGACACGGC TCGACCTTGT CCCAAGCGGG TTGGGCGGAT TTGGAGGAAG TTCGGAACGC GTTACGAGTT ATCCGCGAGT CGCATCGTTG GCATGCGGAG CCCAACCTTC AGCACAAGGC CCAGGTGATT CCAGGAGTTG AGAATAAGCC CATGTACGCC TACGCGGATA CTTTTGCAAG TCTAGGGGAT CCCGCTTACA AAGAGTATTA CTTGGCGTTG ATCTTTACAC ACATTCATAT GCAAAAGACC GGGGAACTCA ACTTGTTTGG TGCCATGTTG CAGCAATTCT TTTTGCAGGG ACTACTGGAG CAGTATGGTA TTGCATTACA CGTCTTCAAG CATGGATAGT ACAAGAATGC GACCAACATG TTCACCAAAG CACGTTTGAA CAAGCCACAT TGTGAGAACG TCTCCAACAT TCTTGAACAG ATCAACAACA ATGTATGCCA AGATATTACC AGATCTCATT CCAAGGCCTT GTTGACGTCT TGGCTCAAGC AGGGTTGTCG GGATGACGTG GATTTGTGGA AGCGCATACA TCAATTGGGG ACGTTTCCAG CTGTGACGGC ATACAAAGCA GGCCTCATAG ACTTCCTGCC CTGGCGCAAC CAAAAGAACA CAAAAAGCAA GGTAAAGCCA CTTGATGGTA CAGGTAACAA AGAGTCTGCA ATAGATGATA TTAAGAACAA ATGGGCATTG CAAGAAACTG ACTTTGAGCA ATTCAAAGCA GACACGGCTG TCAGTCTCCA GGCCTACGCA AAACAAGTTG CAAAGAAGAA ACAAAATGAG CAAGATATTT TCGACCAGTA TGGAACCCAA CATCCTGCCA TTCAAAGTAT TCTTGCCAAA ATTGGCATGT CTTCTGTTGA TGATGGAGAA CCACATCCTC AGAAGGAAAC AATTGCATTG CTAAGAGTCA ACAAAGGTAT TGGCAACTTG ACAGCCCGCA AGCTAGTCAA TTCGATTTGC TGA
|
Protein sequence | MSKEQSGPGV IMSGIRRIGH GLKTSLAIVG FATASGLYME YRKYYPMEED NNKKKVLVIP FHHLQLIEKE KKSIRSQLSR FDVDTKDCPV QMEIKDLVDV LHHAASDPSI VALYGVFGHG STLSQAGWAD LEEVRNALRV IRESHRWHAE PNLQHKAQVI PGVENKPMYA YADTFASLGD PAYKEYYLAL IFTHIHMQKT GELNLFGAML QQFFLQGLLE QYGIALHINN NVCQDITRSH SKALLTSWLK QGCRDDVDLW KRIHQLGTFP AVTAYKAGLI DFLPWRNQKN TKSKVKPLDG TGNKESAIDD IKNKWALQET DFEQFKADTA VSLQAYAKQV AKKKQNEQDI FDQYGTQHPA IQSILAKIGM SSVDDGEPHP QKETIALLRV NKGIGNLTAR KLVNSIC
|
| |