Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13664 |
Symbol | |
ID | 7202095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 120210 |
End bp | 121847 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181140 |
Protein GI | 219121577 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACAAAAAAT TCAAGAAGAA ACGCCCTCGC GACGCCCGCA TTGACTCGGC GGCAAAGGTT TGCCTCGCGA TTATTCGTGG CGAATCGTGT CCCTACGGGG ACGGTTGCCG CTTTTCACAC GACATGAAAG AGTTCATGGC GAGCCGTCCC GAAGATATAA ACAATAAGAT CTTGGGCAGC GTCTGTCCAT ACATTGAGCT TTATGGGTAT TGTGTCTATG GCGCCATGTG CCGCCTTGGA GCGTCGCACA TTAACATGTC TACTGGCGAA AACATTCGAA AGGACATTGA TGGTCCGAAG CCCAAGCCAA TCTTAAATAC TCTACCGAAG GAAGTGCAAA TTCAGCTACG CAAAAAGTCT TACCCGTTCA AGCACCAGCG CTACAACGAA CGGGGTGGAA AGAATTCCAG GGAGACCACA GCTCCAAAAG ACACCGAGAA AGTGACGGAT GATGCGCTGG AGCCACCATC AAATACATCC GCTGAAGAAG GGAGAGCTAG TTCGTCGATT GATATGTCCC CCCTTCCGGA TCGTACGCGT AAGCTTATCG ATTTTTCCAA GAAAGTATAT GTTGCTCCAC TTACGACTGT CGGAAATCTG CCTTTTCGGC GAATAATGAA AAAATTCGGC GCTGATATTA CCTGCGGCGA GATGGCCGTT GCAACCAATC TACTCTCGGG TCAAAGCTCC GAGTGGGCGT TGCTTAAGAG GCATCCGGAG GAGGACATTT TCGGTGTGCA GATTGCTGCT GGCTATCCGG ACGTGTTTGC GAGAACCTGT GAGTTGATTG AAAACGAAAT GAACGTCGAC TTTGTGGATT TGAATCTAGG CTGCCCATTG GATATTGTTT GCAACAAAGG ATCTGGCGCT GCACTCATGA TGCGGGACAA ACGCCTTAGA TCGTCGGTTG AAGGTATTTT GGAAACTCTC ACATGTCCTA TTTCTATTAA GATGCGGACT GGTTGGGAGA TGACCAAGCC GTTTGCTCAC AACCTTGTAC AGAAAATTCA AAGCTGGGGA CTCGATGGCA TCAGTGCTGT CATGATACAC GGAAGGTCCC GCCTTCAACG CTATGCTAAG GAAGCAGACT GGGACTACAT CAGTATGGTA GCCAAGAGTC AGGATATGTC GCTAACTACA ATTCCGGTGA TTGGGAACGG CGATATATTT TCGTACCAAG ACTTTGAAGA GAAGATCGCT CGCGAAGGGG TCAACTCCTG CGCGATGCTT GCCAGAGGTG CTCTCATTAA GCCTTGGTTG CCAACCGAGA TTAAAGAGCG TCGGCATTGG GACATATCTG CGTCTGAAAG GCTTGACATT CTAAAAGATT TCGCTCGATT CGGCATGGAG CACTGGGGAA CTGACCAGCA GGGAATCAAC AACTGTCGCC GGTTTCTTCT GGAATGGCTC TCCTTCCTCC ATCGTTACTG CCCGGTTGGT TTATTAGAAG TGTTGCCGCA GCATATGAAT CAACGACCAC CCGCCTACAT TTGCGGTCGC AGCGACTTGG AGACTCTGTT GTTGTCGCCG GATAGCTCCG ATTGGATCAA AATATCTGAG ATGCTCCTCG GTCCCGTTCC CGACGGTTTT CGTTTCGAAC CAAAGCACAA GGCGAAGGGT TTCAAATCGG AAGGATAA
|
Protein sequence | DKKFKKKRPR DARIDSAAKV CLAIIRGESC PYGDGCRFSH DMKEFMASRP EDINNKILGS VCPYIELYGY CVYGAMCRLG ASHINMSTGE NIRKDIDGPK PKPILNTLPK EVQIQLRKKS YPFKHQRYNE RGGKNSRETT APKDTEKVTD DALEPPSNTS AEEGRASSSI DMSPLPDRTR KLIDFSKKVY VAPLTTVGNL PFRRIMKKFG ADITCGEMAV ATNLLSGQSS EWALLKRHPE EDIFGVQIAA GYPDVFARTC ELIENEMNVD FVDLNLGCPL DIVCNKGSGA ALMMRDKRLR SSVEGILETL TCPISIKMRT GWEMTKPFAH NLVQKIQSWG LDGISAVMIH GRSRLQRYAK EADWDYISMV AKSQDMSLTT IPVIGNGDIF SYQDFEEKIA REGVNSCAML ARGALIKPWL PTEIKERRHW DISASERLDI LKDFARFGME HWGTDQQGIN NCRRFLLEWL SFLHRYCPVG LLEVLPQHMN QRPPAYICGR SDLETLLLSP DSSDWIKISE MLLGPVPDGF RFEPKHKAKG FKSEG
|
| |