Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47753 |
Symbol | |
ID | 7202919 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 733471 |
End bp | 736253 |
Gene Length | 2783 bp |
Protein Length | 700 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181966 |
Protein GI | 219123302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTT CGAACAACAT CCCACCATCT CCAACGCAAC GGATGCTGCG TGCATCGCAG CCGCGCGTAT TGGAGACAAA TTCGGTAGCA AGCTGGAACT CGACTGCTAG TACCATCGTA TCCTCGAGCA ACACGACGAA TCCAGGATCT TCCAACAATA GTTCGGGAAA CTCTGCGGAC ACTATGCAAA CAGTGCAAGC GCTCTTCGGT GGATCCATTC TATTGCTCCT AATGTGTTAT TTTTCAAAAC CTCGGATACC TGGTGCGGAA TACCACCGAG GGGAAATCTA CCGGGCGCAA GCCCTCGAAC GTCTGCGCCA CCAGCGTCGC GAAACCCGTT TAAAGGATCC GAAACAGCGG GCTGAGGCTA TTGACGCAGC CTTGATTGTT AAACGAATCG TCGCCTCGGA CGAGGTAACC GGACAGCTGA TATTGGGGGA GCCAGACGAA GCTATAGACC TGGACAAACA AGGTTCCAAA ATCAGCTATC ACTCTTTGGA AGAAACGGAG GAAGTGTCTA CGTGTGTTAT TTGTCTCGAT GTTTTCCGGG TGGGAGACTT TGTGGCGTGG GGTAAGTTCG CGGGCTCCTT GGCACCGAGG ATCACAGAAG AGAAGCGGGG TGACGACATC CGGTGTCGGC ACGTCTTTCA TCAGGAATGT ATCGCTCCGT GGTTGCAGAA CCCCAAGCAC GACGATTGTC CGGCGTGTCG GGCGATGATA CTACCGGAAC CACCAGAAGA CACATCGGAC GACGCGCTGG CAGAAGGAGG GGCTGTCGTT CAAGGCGCAG CGGATGATGA TCGCAGTCAA TCTTCTCAAT ACAGTCATCA TACAACCGGT ACTAAATCTG TTTTTGTCAT TGTACACGGA CTGATTTCGA GGGTGCGACG CGCCAGTTCT TCCCTCGTGG GCCAGACGAT TGGATTTTGT AGAATGGAAA ATGATCTTGA TTTACATCAA CCATCGCGCC TTCGTCGGGT CTTTTCGATG GGTGATGGAA GGATTCCGGA CAGGTATACC AATAAGAAGT ACCCCGGACG GTTTTCCGGT AGTGGTTGCA TATCGGCTGA TTCACAATTT CACCAAAAAC TAGCGACGAG CAGCGACAAT CCACATCACT GGTCGTCAAC AATCCAACTT CGTCGAGTCG TTTCGGCCGG TCCCGATTCG CCAGCTCGCC GAAGGCCTGC GCCGTCGTTA CAATGGAGAA CACGTAGCGT TTCGGAAGAT GAGGATGAAA CGGGAAGTGA TCAGGACGCG CTGATACCAC CATTACTTCC GCCACCTCCA GTTGGTGTTC CATTTCGACG AGTTTCTTCC AAGGCTCGAT CAACGACCTC GGTTCTAGAA GGCGGGCAAA ATCGGGAAGC GGACCCCGAT GAGAGCGATT ACAATGTGGA CGTACGAAGC GATTTCTCCC AGGCCATTTT TTGGGATAGC GCGCTTGCAC AGTCAATGCT TGCTCAGGAC GATGGCGCAA TTGTCCAGGG CGGTTCCTTC AGACCTTCAC GTATGCCCTT TCGACGCGTA TTTTCCGGAA CAACGTGCTC GGCCTTACGA AGGAACTCTG CCACGAGTAC AGGGAGGAAC CAATGGGTTA GGCCGTCCGT TTCCTGGCGG GATTTGGCAA CGTCGGCAAG CGAAGGCACC GAAGACGATG AAGAGGAAGC CATAATGCTA GAAGCGGTAT AAAAGATACA AGCATAACAA CAGATCAGTC TTTTTCCTTA GTTTTGGCTG ACTAACCAAG TTTGATCGGG TCGTGTCAGT TTGAATCTTT TTTAGAAGCA TGTCATTCAG GATATTTTTT GTTGCTTGAA GTAAGCAGGT TCTTTTTTTT TTCGGGCAAA GCTAAAGACT GCGTCCGCCC TATTACTTCT TGAGCCAATC AGTACGATGT AGTATGGCAA GGCGCTTATC GACGAGCGTC ATTTAAATGA TATTCGCGTT TGGGTTTGGC TGGCTGTCCT ACAACGTCAT ATATCTACTA GAAAACAAAT GTAACAAACG TTCCTTAAAT TCATCGTCAG CGAGAATCAT CGTCTCCATA AGCATCCTGC GCTCTGCCAA GCGCTTGTTG CTTTGTGGCG CTTTCGCTCA CAAGGTTGCT TCAATGGAAA CTTCCTTCGG TAACATTTGA TTTCTTCTAC TGTAGTCAAG GCAATTTTCC TTGTAGTCGA GAACCAACCG ATTCTTATGT ACAGTTGGTT CTCGTACACT TTCGAATCGG GATTGCCCAC TACAGTGTCA GTATCACCAG GTCTAAAAAT AGTTTCCAAG CGACGCCGGT CGACCGGGGA ACGGCTAGCG CCTCCGACGG GAGCACCGTC TTCAACGCTA GATCTTCCAC GAACAGTCCA TCGACTCCTG TTCACAGGAG GATGTACACG GCGGCAGGAA CTCTGTCAAT AACGCACAGC GGAGTGAATT CTAGCGTTTC GGCAGGTATG TACGCGCTTT GGAGGAAAAC CATTGTATGG CTCACCAGGA CTTCTTGCTC ATTAGCCTTG TTGAAAGACG GTCAAGGCCT CCTGGAGAAG AATCTGCCGT CGAAACCGAC GTCACCTTTG AAAAGGTTCA CATGCACCTC CTCTGGTCCT TAAACAGTAT TTTACCATGT CTATCTACCG CTGGAGAAAA ATTCTGAAGC TTTTGCGATC ATTGAGGAAG AACTCAAATA TATGAGTTTG TACGGCGGCA CCCGGCCAGG ATTTGCGTCG GTGACGGAAC GAGTCAACAA AGCAACATTA TATTTGCAAC ATCATGTACC CTCGCTTCCC TGA
|
Protein sequence | MSFSNNIPPS PTQRMLRASQ PRVLETNSVA SWNSTASTIV SSSNTTNPGS SNNSSGNSAD TMQTVQALFG GSILLLLMCY FSKPRIPGAE YHRGEIYRAQ ALERLRHQRR ETRLKDPKQR AEAIDAALIV KRIVASDEVT GQLILGEPDE AIDLDKQGSK ISYHSLEETE EVSTCVICLD VFRVGDFVAW GKFAGSLAPR ITEEKRGDDI RCRHVFHQEC IAPWLQNPKH DDCPACRAMI LPEPPEDTSD DALAEGGAVV QGAADDDRSQ SSQYSHHTTG TKSVFVIVHG LISRVRRASS SLVGQTIGFC RMENDLDLHQ PSRLRRVFSM GDGRIPDRYT NKKYPGRFSG SGCISADSQF HQKLATSSDN PHHWSSTIQL RRVVSAGPDS PARRRPAPSL QWRTRSVSED EDETGSDQDA LIPPLLPPPP VGVPFRRVSS KARSTTSVLE GGQNREADPD ESDYNVDVRS DFSQAIFWDS ALAQSMLAQD DGAIVQGGSF RPSRMPFRRV FSGTTCSALR RNSATSTGRN QWVRPSVSWR DLATSASEGT EDDEEEAIML EALVLVHFRI GIAHYSVSIT RSKNSFQATP VDRGTASASD GSTVFNARSS TNSPSTPVHR RMYTAAGTLS ITHSGVNSSV SAVFYHVYLP LEKNSEAFAI IEEELKYMSL YGGTRPGFAS VTERVNKATL YLQHHVPSLP
|
| |