Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33380 |
Symbol | |
ID | 7203992 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 741619 |
End bp | 743505 |
Gene Length | 1887 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186121 |
Protein GI | 219113075 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.113495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAATA CGATGGCTCC CATCATCTAT CAAGATGTCA AATTGAATCG AAAGAGGTCC GGTAAAGGCC ACAAAAAGAA GGAGCTGTGC ACCAACGCGC CAATCTTTTT GAGGGTACGT TCAGATGATA GTCGCTGTCA TGGTGTGATG AATGATGAAT GGTGTTCGTT GTGAAACATT TCTGTAGAAT CGTCCGTTGT TGCTTGAAAC TTCATAGCCA TTGTCACCAA CTCAAGTTGC ACGCCTATCA CCGCCAAGAA TGGGGAAATG TTCATGACCG TGAGCAGAAA TCGGCAGGTT ATGCAAGTAT TGTGTTTGAT TCCTTCTAGA GGCAGTTGCC TTTAAATTTT CGTGAAGAAC TTTCCTGACA TTTGCATCCA TCTCGTATCT CCCCTTGTAA CAGAAAACAT ACACAATGAT AGATACCTGT GATTACTCTA TTGCTTCATG GTAAGTGACT GCCGCAGATT CCCATGGTCT TCGCCGTTGG GAAAAATCGT TCGTTCGCCG AGCACAATTC TACGGCGGCT ACTCCGTAGA AAACATACTG ATCTGGGTTG AACAAATTGT TCGCCGTTAG TGTCATTGGG ACATTCGGCT CAGGCGGATG AGAATCCTTC GCCATGGTTT TTTTTTACTG TTCTTGGATG GTGGCCGACG GTCCAGTTCT AGGATCGGCA ATCCCGAAGC TGTTGTTGGT AACTGAATAG CTCTCAACTA CGTTTACACA TTTTCCTTAT CAGGAGCGAT GACGGTTTTA CGTTTGTTGT GAAAGACACC GATAAGTTTG CTTTGGAAGT CATCCCTGAG TTCTTCAAAC ACAACAATTT CTCGTCATTT GTTCGCCAAT TGAATTTTTA TGGCTTCCGG AAAATCAAGT CGGACCCTCT TCGAATCAAG GAGGCTGAAA TGAGTGAAGA ATCAAAGTTC TGGAAATTCC GTCATGAAAA GTTCCAGCGT GGGCGTCCTG ATCTTCTCGG GGAGATACGC AAGTCCAATC ACAACGAGTC TGCCGATAAG CAAGAGGTCG AGCATCTGAA GGGGGAGGTT GATCAACTCA GAGCGCATCT TTCTGTGATG AATCGCGAAA TTGACAAACT TACAAATGTA GTTGGTAGTC TTATGACGAA TCACCAGATT CAGCAAAATT ACGCATATGA CTCGAAGAAG CGCAAGCTTG TAGATGGTCC GGATGCAGTC ATATCGAACC TAGGAGGTGA AGGAGATGTT TCGGATTCTT TTCTGTTGGC TCCGTTGCCT GTTGAGTCGC AGTCCAGTAG GGCGGCGACG ATGATAAGTG ACCTTGTGAA GGATCCGAAC ATTGATCCTT TTGCGAGTGA GATCCAGCCT GCGCTCAATC AGCCAGATAC ATACGCAGAG TCAAGAAAGG GCCCTAATTT CACCTCGCAA GATGAAGAAA TGCTAGCTTC TCTTTTTGCT CTTGATCACA ATGATGACAT CGATGTGCTG GGGCATAGCG CGCCAGCAGC CACGGTGTCC GGGGAAACGG GTGGCTCAAA TAGCCCGGCA GCAAATCACG CTGACGCGCA GCTTGTAGAA AAGCTTCGTG CTTCGCTTGG CAATCTTCCG AAAGACATGC AAAGACTTTT AGTCGACCGA ATTGTGAAGA GTATCGAAAG TCCCGATCAA TTTCAGAAGC AGATTGACGC AATGACTTCT TTGGCGGCAA CTGCGGCCAG TGAAGCACAA AGACGCCTGT TGTCGTCGGG AAAGCCCCAG ACGGATCCGA AGAGTGTTCC GCTGGCATCC GCTGTTTTAG GGGCCTATCT TTCAGGACTT TCTTCCGTCC CCGATTCTAG TGAGCAGCAC GCGAGAGCTT CGCCTGACAT TGCTCCTGTC GCTGAGAATT CATTGATGCT GCATTGA
|
Protein sequence | MNNTMAPIIY QDVKLNRKRS GKGHKKKELC TNAPIFLRIP VITLLLHVSL GHSAQADENP SPWFFFTVLG WWPTVQSDDG FTFVVKDTDK FALEVIPEFF KHNNFSSFVR QLNFYGFRKI KSDPLRIKEA EMSEESKFWK FRHEKFQRGR PDLLGEIRKS NHNESADKQE VEHLKGEVDQ LRAHLSVMNR EIDKLTNVVG SLMTNHQIQQ NYAYDSKKRK LVDGPDAVIS NLGGEGDVSD SFLLAPLPVE SQSSRAATMI SDLVKDPNID PFASEIQPAL NQPDTYAESR KGPNFTSQDE EMLASLFALD HNDDIDVLGH SAPAATVSGE TGGSNSPAAN HADAQLVEKL RASLGNLPKD MQRLLVDRIV KSIESPDQFQ KQIDAMTSLA ATAASEAQRR LLSSGKPQTD PKSVPLASAV LGAYLSGLSS VPDSSEQHAR ASPDIAPVAE NSLMLH
|
| |