Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43593 |
Symbol | |
ID | 7197469 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 905686 |
End bp | 907173 |
Gene Length | 1488 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178029 |
Protein GI | 219112555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.131498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAT TCTTTGGAAT GTTCGCTCCG CAGAAGGACG ATAATCTCCA GAAGAAAAAT TCTGACACGA CCAGCATGAA GGACGAGGAA ATGTCACCAT TTGTGACATG CCAAAGGGAC CACGCCTGGC CGAGTCTGTA TACTGAAGTT GATGAAATGC TAGAATCCAG CGTACTAGTC TATGCTTTGG CGGAGCTCCG TAGCCTAGCT CGCCAGGGAA AGCTTACAGT GCAATCGGAA AGGGCTCTAG CATTGCCTAT TACACATTCT GAAGTGCTAA AAGTTGTCCA AGGCAACCAG CATGAACTTG CGGATTCAAA ATTTGGCAAG GAATTCTATA TTGATTTGCT GCAGACTATC AGTGATCGGA ATATGATGGC TGGGTCGCCA CTCGAACACA GCGAAGGGAA AGTGCAAAAT GCCACTGCAG CTACAATCGT TGCATTCGAC GATGAGACTG AGAAAGAGGA GCTTGTCTAC ATGATTGAAG TTGATCACGT ACGTGAACGT GTTACCGTAT GCTTTCGGGG TTCGGTAACT CCTCTGGATT GGGCTACCAA TCTTGAGATG TACATGAAAG AAATCCCGAA TCGAATGAAG GCCAATGCAA GTCAGGTGCC TACTGTTAGA GTTCACAATG GGTTCCACGA CTACCTTTTT GAGCCTTCCA ACAGAGGGGC CAAAGGTCCC AACGGTGAGG ATCTATCAGA ATACCAGGAA ATATTACAGG AGCATGTTCT TCCTGTAATC CACAAGCACC ACGACTACAA GGTACGCAAA CACGGTCGAG GCTACGACGA AGACGTTCGA ACTTGGCGAT CCTCATCCGA CTCTCCTTTC CCCTTGGATA CGTAGGTGTA CGTGACAGGG CATAGTCTAG GAGGTGCCCT GGCAACTCTG TTTGCATTTG AGCTCACTTG CGAGCCCGAA GCGACCGTAC CAAAGCCGGT TACTCTCATC AATTTCGCCT GTCCATACGT TGGTGATTCA TCGTTCCGCC TTGCCCATCA AATGCTCGAG AGTCAGGGTA GACTGCGCCA TCTACGCGTC ACCAATCATA AGGACTTAAT AACTACTTTT CCGAAAGTAG CCTTTCGTTG GAACGTCTTT GATCGACGCG CTCATGTCGG TTCTCTCTTT AAACATGTTG GTATCAATCT TCGCATTTTT GAAGGTTCCA AGACGTTTAA GCTTTGCTAT CCCAAGGTCA CCGACGGGTT CTTTTCAGGA GCATGGGATG GAGCTACACG GGGGTGGAGC CAAGCGATTT TTTCGAATAT TATCTGGAAT CCTCTCAACT ACTGGACTTT ACACACGCTG CGCGAATACA ACAAACGTAT GGAAGTCAAT AATTCCGGTT TGCATGCTTT GTTTTTAAAC GATGTCTACG CACAACCTGA TATCGTGGGA AACTTGGTTC CTCAAAAGTA GCCCATAGGG ATCTTTCGAA TTGTAAAGCA TCAACTTAAT TCATGGAAAC TTTTAACT
|
Protein sequence | MKRFFGMFAP QKDDNLQKKN SDTTSMKDEE MSPFVTCQRD HAWPSLYTEV DEMLESSVLV YALAELRSLA RQGKLTVQSE RALALPITHS EVLKVVQGNQ HELADSKFGK EFYIDLLQTI SDRNMMAGSP LEHSEGKVQN ATAATIVAFD DETEKEELVY MIEVDHVRER VTVCFRGSVT PLDWATNLEM YMKEIPNRMK ANASQVPTVR VHNGFHDYLF EPSNRGAKGP NGEDLSEYQE ILQEHVLPVI HKHHDYKVYV TGHSLGGALA TLFAFELTCE PEATVPKPVT LINFACPYVG DSSFRLAHQM LESQGRLRHL RVTNHKDLIT TFPKVAFRWN VFDRRAHVGS LFKHVGINLR IFEGSKTFKL CYPKVTDGFF SGAWDGATRG WSQAIFSNII WNPLNYWTLH TLREYNKRME VNNSGLHALF LNDVYAQPDI VGNLVPQK
|
| |