Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45621 |
Symbol | |
ID | 7200392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 742342 |
End bp | 743622 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179706 |
Protein GI | 219117837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.690367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCCA TTATTAGTGA CGAGTCCTTC CAACTTGTTC GCGCGACGGC TCCTGTCGTC GCCGAGCATA TTGAGGAGAT TACGGGTACG TTTTATCCCA AAATGCTTGG TCGCCATCCG GAGTTGTACC AATTTTTCAA CGAATCCAAC CAACGCGCGG TCCCCGGTCT CTGCCCCGCC GCTAGCGGGG TAGTAACCAC CCGCCAGTCC AAGACTCTAG GAGATGCCGT AGTGCAGTAT GCTCTCAACA TTGATAAGTT GGAAAACTTG AACGAGGCGG TACTTCGAAT TGCCCACAAG CACTGCGCAT TGGGCGTGAA GGCCGAGCAC TATCAGATTG TCCATGACAA CCTCATGGAA GCGATTGGCG AAGTTTTGGG TAGTGCGGTG ACACCGGAAG TCGCAGCCGC GTGGAGTGAA GCTGTCATGG CTTTAGGGAA GATATTTATC GAGCAAGAGC AGAAATTGTA CAACGAAGCC GAAAAAGTAC AGTGGTCGGG ACCGAAAGAA TTCATTATCA CGGATATTAT TGATGAGACC CCCGTTGTGA AGTCATTCCG TATGAAGAGC AAGGATGGGC AGAAGGTCTG CCCCTTCAAA CCGGGACAGT ACCTTAGCAT TTACGAGCAA CCCAACAACA AGAAATATTT TGCTCCTCGT CACTATACGA TTACTAGCCA GCCAGAAGAT GATTTCTACC AGATTACCAT CAAGAAACTC ATTGACCCAG CTGTTCCGGA TGACCGCACT CACGACGGTA TCCTCAGCCA CTACTTGCAT TCCAAGAACG TCAACGATGT CATCAAGCTT GGTCCCATCT TTGGTCCGGA GGTTTTACTG CAGGGGGAAA AATCCCGCGT TGCTGCTTTC ATCAGTGTGG GCATTGGCAT CACACCAACA ATGGGAATAC TCCCGACTGC CGTCAAGGAA CGTCCTCGTA CTGCCGTCTT CCATGGTGAC GTTAACGGCT CAAATCACGT TTCTCGCGAA GCTTTGGAAG AGTTTGGCAA CGAGCAAAGC CTGTTTTCAT ACTCTTACTT CAATCCCGAT GAAGCTGATA CAAAGCTGCA GCACTATTCG GAAGGTCTCT TAACGGGAAG CAAAATTGTC GATAAGTTGA AGGATGCTGG TGTTAATTTT GCGACAGGGA CAGACTATTT CATCTGTGCT GGCCCCACAG TTGCACCAAT TCTGGTCAAC GAGTTACGTG AATTGGGTGT AGACAAGAAG CTTCTACATT TGGAGTTTTT TGGCCCGTTT GTCTCTCTGA TTGAGGAATA G
|
Protein sequence | MSSIISDESF QLVRATAPVV AEHIEEITGT FYPKMLGRHP ELYQFFNESN QRAVPGLCPA ASGVVTTRQS KTLGDAVVQY ALNIDKLENL NEAVLRIAHK HCALGVKAEH YQIVHDNLME AIGEVLGSAV TPEVAAAWSE AVMALGKIFI EQEQKLYNEA EKVQWSGPKE FIITDIIDET PVVKSFRMKS KDGQKVCPFK PGQYLSIYEQ PNNKKYFAPR HYTITSQPED DFYQITIKKL IDPAVPDDRT HDGILSHYLH SKNVNDVIKL GPIFGPEVLL QGEKSRVAAF ISVGIGITPT MGILPTAVKE RPRTAVFHGD VNGSNHVSRE ALEEFGNEQS LFSYSYFNPD EADTKLQHYS EGLLTGSKIV DKLKDAGVNF ATGTDYFICA GPTVAPILVN ELRELGVDKK LLHLEFFGPF VSLIEE
|
| |