Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45356 |
Symbol | |
ID | 7199986 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 934987 |
End bp | 937190 |
Gene Length | 2204 bp |
Protein Length | 479 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179322 |
Protein GI | 219117055 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.486955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACAGTCCTG CTTTGTTGCT AATCCCTTTG TTGTCTTCCC TGCTCTGTCA TGTCTCTGAT TGAAGAGCAA GAAAAAGCCT TACAGGCCAA ACTAAGTTCT TTAGATGGTA TGTTCCCGTT TTCTTTCGAT GCCTACACAA CGCGTTGTAC TACATCTCAT TCTTTTGTAA TTTGCACTAT GACTACCAGG TCCCGATTGG GATAGCGAGG ATGATGACGG GCTTCAGAAA TCAGCAGAAG AAGATGCTGC GTTAGCGAAA ACGAAAAAGG GCAAAGCCGC CGGTAAACAC GGCAAGAAGG GTCGGAACAA GCCTGGAACA GTCTTGTACC TGGGTCATCT ACCTCCCAAC TTTGAGGAAC AAGAAATTTG GAAGTTCTTA CAGCAATTCG GCAAGGTCCT GCACGTACGT TTGGCGCGTT CGAAAAAGAC TGGCGGTGGG AAAGGATACG GGTTTGTGGA AATGCAACGT CCGGACGTGG CCAACATTGT GGCCGATACA CTCTCGGGAT ACATCTTGTT CGGCCAAAAA CGTTTGGTTT GCCACGTGAT TCCTCCCGAA AAAGTTCATC GTCGACTGTT CTTTGCGCAC GCACCAAGGG CCAAGAAGGA AGTTAGCCGG GTGTTGGCTT TGGAAAAAAT AAAGGATATT ACGACACGGC TCATTTCTCG GGAGCGCAAA AAGAGAGAAA CACTGAAGGA ATTGGGTATC GATTACGATT TTCCTGGCTA TGAGGCCTCG GCCAATCAAG TTACAAGCAA GGAGGCAGAA AAATTGGATG CAATGGCAGC AGAAAGGAAC AAGAATGCTC GCAAAGAGTC TGTTGAATCG ATCGGAAGTG AAGGGGCGAA AAGTACGAAA ACGAATCGCG TGGACTCGGT TGCTTGTGAG AATAGTGAAA GTGCCCAGCT CAAAAGGCGG AAGGAATTAA CGGATGATGT GGATCGTTCG GGATCCAAGT TCGCCAAGAA AACGCGCAAA GACTCGATTG CATCGGTTGA GAGCACGGGG TCGAAGACGA AAAAAGATAG AAGGAGAAAG CATTCGGATG GATCGGTGAA TGACGTAAAC GTCGACATCG TGTCACCACC CGATAAAAGT CCGAAGCGAT CAACACGCAA TAAACCAGCC AAGGCAGTGG CTGAAGCGGA CCCGAAAAAA CTCGTAGTAC AGTCGGAAAA GAAACCAAAG CAGAAATTAA ACAAGCGACG CCGATCAACG TAAATAGGCG ACTTCACACC TTTGGGCCAC CTTTTAAGAT TACTATTTTT CGCTTTCGAT CTACGAATGT GAGCGAGTAA CTCACATAAG AATGCACAGC AGCTCACATA TATTGGTTTA AATCATATCA ATGAGACGGG ATTTGTTGAA TGGATGAAGC AGAGATTTTG TGAGCTGCTT TCTGTTAGGT CATAATTTGT GAGGTGAGCG TCTGTCTACT AGAGTAGCGG CTGAGAATGG AGCCGCAAAG GTGATAGTTC GAAGTCAAAC AATACGGTAT TCGAAAGGTA GCCATGCGTC CAAATTCCGA AGGAGCGTTT CCTTGATCGT CTGGTCCGGA CGTAGGTAAT CTGTCCCATT CCGACACCGA TGTGACGCCG AAAGATACCA ATTCTCTTTG ATCAAGCGAG ATGGCAGAAA GCGAAGCCCA AAGCACGTTT TGACACTCAC CCTCTGGCAG AATCTGTTGA GGATAGAAAG GAACACTCGA CAAGTAATTT AGTGTAAGTC ATCACAATCA CCGTCAGCAC AGCACACGCT CTCTCTACTT AGAATAGATA TCAGGTAGCT CCAATGAAAG CTCTCTCTTG CATCGTGGTA ACCCTCTTTG TCATCGTTGT GGATGCCACG TCCGGTCACC CTTCTTCGGC CTTTGGACGG CGAGGCACTC GGGCCAGTAG TAGTAACAGT TCTGCCGAAT CTCATTGGTC GTCGACAGGA GTGCAGGGTT TGGGCCACGA TGAAAGCCGG TCGGAAAAAA TAAAGCAGGT CAAGAAGAAG ACTGACAAGG GCCTAGGCAA GGAGCAAAGA CACAACAATT CCTCCCAAAG GAACCTCATG CAAAAGGTTC CACAAGCAGG CAGCGTCTTC AGGATGGACC CGCTAGATCC CAACCTTGGC GTCAACTAGA GACACGCGCG TCGCAGGCGG ATCTTTTAAC ATACGATTCA ATATTTTATC AACTGAACAT TTCTCTCCGC CTAC
|
Protein sequence | MSLIEEQEKA LQAKLSSLDG PDWDSEDDDG LQKSAEEDAA LAKTKKGKAA GKHGKKGRNK PGTVLYLGHL PPNFEEQEIW KFLQQFGKVL HVRLARSKKT GGGKGYGFVE MQRPDVANIV ADTLSGYILF GQKRLVCHVI PPEKVHRRLF FAHAPRAKKE VSRVLALEKI KDITTRLISR ERKKRETLKE LGIDYDFPGY EASANQVTSK EAEKLDAMAA ERNKNARKES VESIGSEGAK STKTNRVDSV ACENSESAQL KRRKELTDDV DRSGSKFAKK TRKDSIASVE STGSKTKKDR RRKHSDGSVN DVNVDIVSPP DKSPKRSTRN KPAKAVAEAD PKKLVVQSEK KPKQKLNKRR RSTYQVAPMK ALSCIVVTLF VIVVDATSGH PSSAFGRRGT RASSSNSSAE SHWSSTGVQG LGHDESRSEK IKQVKKKTDK GLGKEQRHNN SSQRNLMQKV PQAGSVFRMD PLDPNLGVN
|
| |