Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33814 |
Symbol | |
ID | 7197858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 413171 |
End bp | 414307 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178222 |
Protein GI | 219114853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000658856 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACG AGGAGGAAAA GCTTGCCAAG ACCTCAATGA CGACGTCCAT TGATACCTCC GCAGCCAACA AACCAGCGAC CGAATCGCCT GAAATCGAAT CGTTACTGAG CCCCGTGCGA CAGAAGAGAG AATGGCTAGA AAACAAGTTC AAGGAAGACT TTTTGGCGAA TCGGCCACGA CTTAGTCCGG GACATGAACT TGTGGAGGCC CGCCACAAAT GGTTACAAGA AGAAGCCCGT CGGAATCGGG AAGCAGTTAT ACGCCTGAAT GACATTGAAG TCTCGCAAGA TGTCCTGGAA GCCAAGAAAA AGTGGCTTAC TGAAGACGAA CGGATCGTCC AGGAAATGCG TGTGCATGTG TTCTCCGAAA ATCCAGCTGG AGATGGCAAC GTTTCGGGTG AACCCGAAGT TGATTTTCGG GCGTATTCGT CTAAAAGTGA GCGACGGGAT TTGCCCAATG ATAGAGAGAC AGAAACCAGC TCTAATGAGG ACGAGCGCGA AGATAACTTT GAGGACTGGG GTAATCTGTT GGAAAAGATT TTTACCGACG AAATGGTCTT TATCGATGAG TCCGCCGATG ATTATATTGA GGTAGCTCAA TCGAGGGCGG AAACACTAGC TCCCATGCCA GGTGGCATGC CTTCATACCC ACTTTTCGTT GAAAGCGACA CGTCGGGAAT CTCGAATAAA AATCATTGTG TTTCAGAAAG TTCAGATGTC CGCAACGAGC GGACGACAGT GGAATGCGAA TTATTTCCGG GTGAATTGGC TTATCCCGAA AAATCGAGGC GGGATCCTGA AAAGACGGAG CTTTCGTCTG GAGAGACAGC TAATCTGGCA GTGCTCAAAG CCGAAAAGAT GACATCTGCG AAGAAAAAAG ACCTAGAGTC TGCTGAATCG TACTTAGCAA TTGCCGATAA AGTCTACTGG GAAAGTATGG CACTTTTGGA TCTCACTAGA GGTCAAATGC ATGCCTCTCC CGCGAAATTG TCGAAAGGGG AAATTTTGTT GTCGACAGAA AAGGCAGGAG TGGATAGTAA CAAGATTCGA GTTGTGGAAA AGGAACCCCT TGTTCCGCTC AACACGGACT TTGGAGTAGT GGAAGCGCGT TGTCTTGAAC GTTGTGTCAT TTCTTAG
|
Protein sequence | MIDEEEKLAK TSMTTSIDTS AANKPATESP EIESLLSPVR QKREWLENKF KEDFLANRPR LSPGHELVEA RHKWLQEEAR RNREAVIRLN DIEVSQDVLE AKKKWLTEDE RIVQEMRVHV FSENPAGDGN VSGEPEVDFR AYSSKSERRD LPNDRETETS SNEDEREDNF EDWGNLLEKI FTDEMVFIDE SADDYIEVAQ SRAETLAPMP GGMPSYPLFV ESDTSGISNK NHCVSESSDV RNERTTVECE LFPGELAYPE KSRRDPEKTE LSSGETANLA VLKAEKMTSA KKKDLESAES YLAIADKVYW ESMALLDLTR GQMHASPAKL SKGEILLSTE KAGVDSNKIR VVEKEPLVPL NTDFGVVEAR CLERCVIS
|
| |