Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40956 |
Symbol | |
ID | 7198765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 376886 |
End bp | 378475 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184870 |
Protein GI | 219129384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG ATCCGACTGC GGGATCCGTC ACGCTACGCA ACATCGAAGC CTGTATGCAG TTATCCCGCA TGCTACGCAC TTCACTGGGA CCTCAAGGAC GGTGCAAGCT TGTGGTGAAC CATCTGGAAA GGCTGATTGT CACGTCCGAT TGTGCTTCCA TTTTGAAGGA AGTAACGATC GAGCACCCCG CGGCGCAACT GCTTGCCGAG GCTTGTCAGA AGCAAGAAAC GGAGCGTGGG GACAACACCA ACTTTGTGCT CGCCTTTGGT GGAGAAATTC TCTGGCAGAC GTCACAACTC ATTGCCAAAA TGACTTGGCA ACCCGCACCG GAGATTCTCG CCGGATACCA GCGAGCACTG ACGCTCGTCG AAAACAAGTT GTTGCCCGAT CTAGTTTGTG ATGGAGTTAC CGACTTGAAG GACAAGCATC AGCTTCTCAA GATTCTACAG CCGGTACTTG CTTCCAAGCA GTACGGATCC GAACAAACGC TCGCTCCCAT TGTTGCCGAC GCTTGTTTGA CCGTTATGGA CGCGCAGGGA AAGTTCAACG CCGAATCCGT TCGCGTTTGC AAGATTCCCG GTAGTTCCGT TTCGCAATCG ACACTTCTGG AAGGCTACGT TGCCATGCGG GGGCTCGAAA CCGTCGTGAC CAACGCGACT GACGCCAAGA TCGCGGTCTA CGCTTGCGGC TTTGAAGCGT CGTCTACCGA AGCCAAGGGC ACCGTCCTCA TGAAGACGGC CGCCGATTTG CTGTCGTACA ACCGTACCGA AGAAGCAAAA ATGGAAGAAA TCGTGCAAGG GATTGCTGCT TCGGGGGTCA AAGTTGTGGT TACGGGTGGC AATCTTTCCG ACATGGCCTT GCACTTTTTG GATCGCGCCA GTCTGATTTG TTTGCGCATC GGCAGCAAGT GGGAATTGCG TCGCTTGTGC CAAGCCGTCA ACGCCACTGC TTTGGTGCGT CTCGGGGCTC CGACCCCGGA CGAAATGGGG TTTTGCGAAT CCATTCGAAC CCAAGAACTG GGGGGCAAAA CGGTTACGGT CTTTCGGTCG CACGAGACCA AACTCGCGAC GATCTTGTTG CGGGCCAGTA CCAGTTCCGT CCTCAACGAT TTGGAACGGG CCGTCGACGA CGGAGTCCAG GCCGTCGTTC AGGCCGGAAA AGACGGACGA CTGGTGTACG GCGGCGGGGC CGTCGAAATG GCCCTGTCCA TGGCACTCCA GCAGGAAGCC AGTCGTGTTC CCGGCCTCGA ACAGTACTCC ATCGCCGCTT TTGGTAAGGC TTTGGAAATT GTACCCCGCA CGCTGGCGGA AAACGCCGGC TGGGATGCCG TACGCGTCCT GGCGGACTTG AAGGCTTCCC ACGCCCAGCA CGGTACCGAG TCCGTTTGCG ATGTTGGAAT TGACATTGAA CGATACGGAA CCGAAAACGA CGACACCGGT GGCACGTGTT CAATGAAGGA GCGGGGTGTC CTGGATTTGA TGTCCACCAA ATTAGCGGCA TTGCGCTTGG CCGTCGATGC CGCCACGACT ATTCTCAAAA TTGATCAAAT TATTATGAGC AAACCGGCGG GTGGACCGAA ACCGCAGTAA
|
Protein sequence | MDDDPTAGSV TLRNIEACMQ LSRMLRTSLG PQGRCKLVVN HLERLIVTSD CASILKEVTI EHPAAQLLAE ACQKQETERG DNTNFVLAFG GEILWQTSQL IAKMTWQPAP EILAGYQRAL TLVENKLLPD LVCDGVTDLK DKHQLLKILQ PVLASKQYGS EQTLAPIVAD ACLTVMDAQG KFNAESVRVC KIPGSSVSQS TLLEGYVAMR GLETVVTNAT DAKIAVYACG FEASSTEAKG TVLMKTAADL LSYNRTEEAK MEEIVQGIAA SGVKVVVTGG NLSDMALHFL DRASLICLRI GSKWELRRLC QAVNATALVR LGAPTPDEMG FCESIRTQEL GGKTVTVFRS HETKLATILL RASTSSVLND LERAVDDGVQ AVVQAGKDGR LVYGGGAVEM ALSMALQQEA SRVPGLEQYS IAAFGKALEI VPRTLAENAG WDAVRVLADL KASHAQHGTE SVCDVGIDIE RYGTENDDTG GTCSMKERGV LDLMSTKLAA LRLAVDAATT ILKIDQIIMS KPAGGPKPQ
|
| |