Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14998 |
Symbol | |
ID | 7203731 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 71943 |
End bp | 73889 |
Gene Length | 1947 bp |
Protein Length | 582 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182764 |
Protein GI | 219124971 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGATTCGCG AGCAAATGGA GCAAGAAGAG CGATCCCGAG CCATTCAGCA AGCGATGGAA ATGTTGCAAA CGACCGAAAC CAATACGGTC GAATCTATGA TGGAAAAGTC CGCTGACAAT GGCGCGGACG TGCACTTTAC CAATTTGGAC TTGCCCAATT TACGCGGCGG GGGTCAACCT TTGCTGCAAA ACGCTAACAT TACCTTTTCT CGAGGTCGAC GATACGGACT CATGGGACGG AACGGTTGCG GAAAGACCAC ATTGCTGACC TTTATGGCTA GTCGACAAAT GGAAGGAGCC GTTCCGAAGC ACATGAATAT GGTCCTCGTA CGTCAGGAAA TCATGGGCAA CAAATGGACG GCCGTCGAAA CAGTTCTCAA GAGTGATGTC AAACGAGAAT CGGTTAAGCG CTTTATTGCC TACTGTGAAG AAGAATTGGA AAAACTGGAT CAAGGCAACA AGAACCCAAC CATCGAAGAT GCCGACGACC GCGGCACGAA CGATGAAAGC AAAGGCAAGA ATGACGAAAG CAAAGGCCGA CAAAAGCTTC GGGAGCGCAA ACGACAAAAC CTGCAAAAGT CTGCTCGCAA GGCAGCGGAA TCTTCCACCA CAGCACAAAT GCAAGAGTCG AAAGATGCAC AGCGACTCAA GCTCAACGAA AAGCTGGGAT TGGCCTATCA GCGTTTGGCA CAGGTCGAAG AAGAGGAAGG CGGGGATCCG GAACCGCGCG CACGCAAAGT ATTGGCTGGC CTCGGATTTG CAAAAGAAAT GCAAGATAAG CCCACTGATG AACTTTCTGG AGGATGGCGG ATGCGGGTAT CGATTTCGTG TGCGCTTTTC GCAAATCCAT CGTTATTGTT GCTCGACGAA CCGACAAATC ATTTGGATTT GGAAAGTGTT CTCTGGTTGG AGCGATATTT GACAACCACG TTTTCTGGTA CGCTTGTGGT AGTCTCGCAC GATCGGCACT TTTTGAACGA AGTGGTTACG GATGTCGTAC ATTTCCATCG CAGCCAATTG ACCACTTATC GTGGAGATAT ATCCAGCTTT GAAGCAGTAC GGGATGACGA TCGTTTGCGG CAACAACGCC AGCGTGAGCA GCAAGAAGCA AAGCGAGCAC ATCTGCAGAA GTACATTGAT TTACACGCAC AAGCCGGTGA GAATGGTGTC AAGGCTGCTC GTCAACGAAA AAGTAAGATG AAGAAGCTTG ACAAACTTGG AGTCATGGCA CAGGACGGGA AGAAGTGGAA GGCGTCGTAC GATGGCGATG CTGAAGAGGT TGAAGAAGTA CTCGACGACG AAGAAGTCAT ACTGAATTTT CCTGATCCGG GGGCTTTCGA TGGTGACATT GTACGTTTGG AGCAAGTCAA GTTTGGGTAT TCAGCCCAAA ATATTTTACT AGAGACTGTC GATTTGACTG TCAATCTTAA GTCTCGAATT GCTCTACTCG GTCGCAACGG ATGTGGAAAG TCAACCTTGA TCAAGCTGGC GGTTGGGGCA TTACAGTCGA TGCAAGGCAA GGTCGTTATC GATCCCGGTG CCAAAATCGA GTACTTGGCG CAGCATCAAC TGGAGCAACT CGATCCCGAC GGTACTCCTT TGCAAACGAT GGTAGACCGA TATCCTGGAG ATCACAGCAA CACTCATATT GGTGAGCTAC GCCGATATCT TGCAAACTTT GGCCTAGGCG GGGAGATCTT GCCCGTCCAA AAGATTCACA CTATGTCGGG AGGTCAGAAA TGCCGCGTTT GTCTGGCTTG CGCTATGTAC CGCAAACCAC ACTTGCTGAT CCTGGATGAA CCGACGAATC ACTTGGATCT CGAAACAACA GCAGCTCTAA TTGACGCCAT CAAAACGTTT CAGGGAGGCG TGCTCTTGGT CAGTCACGAC CAGCACTTGT TGACTTCCGT ATGTGAAGAT TTGCTGGTAG TCGAAAACGG AAGAGTG
|
Protein sequence | EIREQMEQEE RSRAIQQAME MLQTTETNTV ESMMEKSADN GADVHFTNLD LPNLRGGGQP LLQNANITFS RGRRYGLMGR NGCGKTTLLT FMASRQMEGA VPKHMNMVLV RQEIMGNKWT AVETVLKSDV KRESLRERKR QNLQKSARKA AESSTTAQMQ ESKDAQRLKL NEKLGLAYQR LAQVEEEEGG DPEPRARKVL AGLGFAKEMQ DKPTDELSGG WRMRVSISCA LFANPSLLLL DEPTNHLDLE SVLWLERYLT TTFSGTLVVV SHDRHFLNEV VTDVVHFHRS QLTTYRGDIS SFEAVRDDDR LRQQRQREQQ EAKRAHLQKY IDLHAQAGEN GVKAARQRKS KMKKLDKLGV EEVLDDEEVI LNFPDPGAFD GDIVRLEQVK FGYSAQNILL ETVDLTVNLK SRIALLGRNG CGKSTLIKLA VGALQSMQGK VVIDPGAKIE YLAQHQLEQL DPDGTPLQTM VDRYPGDHSN THIGELRRYL ANFGLGGEIL PVQKIHTMSG GQKCRVCLAC AMYRKPHLLI LDEPTNHLDL ETTAALIDAI KTFQGGVLLV SHDQHLLTSV CEDLLVVENG RV
|
| |