Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39093 |
Symbol | |
ID | 7194811 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 314386 |
End bp | 316141 |
Gene Length | 1756 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183201 |
Protein GI | 219125885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACGC CAGAGCCACA AGAATCCCAC GCTGCGGCAA AGAATCGCAC TCATCGCCGG CGTTCCAGCG TTTCGGAAGC GCATCAGATT TATTTGCAGT ATCATCGTGA AGCTCATCAC GATGACGGCA GCGACTCGCG CAAAGCGACG CAGATCTCTA CTGACGGAAG CACCAACAAA GCCCCCAAGC GCCACGACAG CACCGAAAGC CGTGATCACA AGCCGGTCAC TAGCTTTCCA ATGTTCCACC GGGTCCAGAA AACCGGCGTT ATTTACGCGA CCTCGCGGGC CGCAGCCCGC GGCTTTCAAG GAGACGGTGA CCCGTCGTCG GAGTGGGCCA ACATGGGTCA GGGTGCTCCA GAAACAGGTC CTTTGCCAAA CGCCCCGTCG CGGAATTTCA CCATGCACAT CCCGGACGCC GAGCTCGAAT ACGCGCCCGT GACGGGACTC ACGCCGCTGC GCGAAAAGGT AGCCGACTAC TATAACTTTT TGTACCGCCA GGGGAAACAG TCGCAGTATA CGGCAGAGAA TGTGTGCATC GTACCGGGAG GTCGAGCTGG CATTACGCGT ATCATGGTAC GGGTACAACT GTATGGATCG CATTGCGTAC AAGTTGTCTC CAATGTATCG TGTGCGTGTT CTGACCGCAC ATTGCTGTTC TCTATTTCTG CAGGCGGTGC TGGGAACGGT TCAGGTTGGG TATTTCACCC CTGATTACAC AGCCTACGAG CAAGCCTTGG GTCTCTTTTT GCGGGTTTCG CCGAGTCCTT TGCTTCACCG GGATGTGACC GAGGCTTGCA TGTCTCCAGA GGAATTTGAC TTTCAGTGCG GTGGGCGGGG ATGTGGTGCC ATTTTGATGT CCAATCCGGC CAATCCCACT GGACAGTCTA TTGAGGGAGA CGACTTGCGC CGCTATGTGC AGACGGCGCG GGATCACCAG ACGGCCATTA TCATGGACGA GTTCTATTCG TACGTTATTT CTCTCGTTCT GGCTTGCGCA ACCGAAGCTG GTTGTCTACC GGGTACTCAC GAAGATTTTT TGTTCGAAAA TGCCATTAGT CACTATTACT ATGACGGCCC AGATCAGCCA TTGGACGGTA ATACGAATGA TCTGCACAGC TGGCCCAAGA CCGTGAGCAG CGCCGCCTAT GTGGACGATG TCGACGAAGA TCCGATTCTC ATTGTCAATG GATTGACCAA AAACTGGCGT TGCCCAGGTT TTCGAGTTTG CTGGATTGTC GCTCCTAAAC CAATTGTGAA AATGCTAGGA TCGGCCGGGA GCTACTTGGA CGGTGGAGCA AATGCTCCGT TACAGCGATT GGCCTTGCCC CTTATGGAGT TGGCATTCAT TCGTCGGGAC GCAATTGCAC TCCAACAGCA CTTTCGACAG AAACGGGACT TTTTGCTACG CAAACTCGAG GAACTCGGAA TCAAAGTCAA ATTCAAGCCG ACGTCGACTT TCTACGTATG GGCCGATTTA TCAGGTCTGC CGCCGCCGTT GAACGATTGT CTCGTTTTTT TAGAGGAATG TACCAAGCAC AAGTGCATAT GTGTCCCGGG TGTGTTTTTC GATATCAATC CGAGAGGGAT TCGGAACATT CGCATGAGCA AGTGCCTGCA TCACGTACGT TTCAGCTACG GTCCGCCTAT GGAAAATCTT ACCAAGGGAA TGGAGTTGAT TTGCCAAATG ATTCAGTATT GGAAAAAGTG TCCCGAGCCG CCTGACGCGT ACGCGACCGA GTCGTTTGGG GAGTGA
|
Protein sequence | MTTPEPQESH AAAKNRTHRR RSSVSEAHQI YLQYHREAHH DDGSDSRKAT QISTDGSTNK APKRHDSTES RDHKPVTSFP MFHRVQKTGV IYATSRAAAR GFQGDGDPSS EWANMGQGAP ETGPLPNAPS RNFTMHIPDA ELEYAPVTGL TPLREKVADY YNFLYRQGKQ SQYTAENVCI VPGGRAGITR IMAVLGTVQV GYFTPDYTAY EQALGLFLRV SPSPLLHRDV TEACMSPEEF DFQCGGRGCG AILMSNPANP TGQSIEGDDL RRYVQTARDH QTAIIMDEFY SYVISLVLAC ATEAGCLPGT HEDFLFENAI SHYYYDGPDQ PLDGNTNDLH SWPKTVSSAA YVDDVDEDPI LIVNGLTKNW RCPGFRVCWI VAPKPIVKML GSAGSYLDGG ANAPLQRLAL PLMELAFIRR DAIALQQHFR QKRDFLLRKL EELGIKVKFK PTSTFYVWAD LSGLPPPLND CLVFLEECTK HKCICVPGVF FDINPRGIRN IRMSKCLHHV RFSYGPPMEN LTKGMELICQ MIQYWKKCPE PPDAYATESF GE
|
| |