Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23164 |
Symbol | |
ID | 7195764 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 40645 |
End bp | 42396 |
Gene Length | 1752 bp |
Protein Length | 489 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184174 |
Protein GI | 219127921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.137553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGATCATTG CTTTCTCACG CCAACTGCTT CCGCCGAAGC CAAGAGTACC AATACCAAAC AACTGAGGGA TCCTTCGTAC CATGTCGGAA GACGAAGCCA TGTGGATTGA CGTTTCCAAG GCTCAAGATG GAGGCGTCAA GAAAAAGATC CTGCAGGAGG CACCGGACGG TGCGACGGGA CCGCCTCCCG ACGGTTACGA AGTCACGGCC CATTACACGG GTACCTTGAC CTCGGACGGG TCCAAGTTCG ACAGTAGCGT GGATCGCGGT AAACCTTTCA ACTTTACAAT TGGACAGGGA CAGGTCATCA AGGGATGGGA CGAGGGCTTT GCGTCCATGA AGGTGGGCGA AAAGGCCATG CTAGAGATTC GCTCGGACTA CGGCTACGGC GACTCGGGCT CACCACCCAA GATCCCCGGA GGGGCGACGC TCAACTTCGA AGTCGAGTTG CTCGGACTCA AAGAAAAACG CAAGGAAAAA TGGGAAATGA GTACGCAGGA AAGATTGGAA GTGGCCAATA AGCTCAAGAC GGAAGGAACA GAGCTCTTTC AACAACAAAA ATTCAAGGAT GCGGTTGCAC TGTACGAAGA CGCGGCATCG TACGCCGTAG ACGAAGGTAT CTCGGGAAAC GACGTCCCAG ACGAAGAAAG ACCTCTGTAT GTGAGTTGTT GGTCCAATGC GGCTTTTTGT TACATCAAGT TGAAAGATTG GCCCGAGGCC ACTCGCTCCT GCAATAATGT TTTGGAAATC GACACAGAAC TAGCGTCAAA TGTAAAAGCA TTGTATCGCC GTGGATTAGC GCGAATGAAG CTAGGTCTCT TGAAGGAAGC TAAGGAAGAT TTGATGGCTG CTTACAAGAT CGACGCAGTG AACAAAGATG TGCGCAAAGC ACTAACTCAG CTGAAGGAAG CGGTCGCGGA GAGCAAAAGG AAGGAAAAAG CTGCCTTTGG AGGATTTTTC AACAAAGTCG ATTTCTACGA TGACAAGAAA GGTCCGCTTA TCCCCAACGC CAAAGGAGAT AACCCGCACG TCTTCTTTCA GATTAAACAA GGGGAAGAAG ATCTTGGTCG CGTTGTGATG CAACTATACA GAGATATTAC TCCCAAGACG TCCGAGAATT TCCGATGCTT GTGTACGGGA GAAAAGGGTG TCGGAAAATC CGATAAGCCA CTGTATTTCA AGGGGTCAAC ATTCCATCGC GTCATTAAAG ATTTTATGAT CCAAGGAGGA GGTGAGTAGC ATACATGCTC TCCGAAACGT GAAAAGTGAA GCTGAATGAG TCCCTCACCC TCTTTTGCAT CGGTCGCTTT TCTCCAGACT TTACTGCTGG AAACGGTACA GGTGGTGGTA AGTTGCTTGT CGGCGCAGCA CAGAAAATTT CCATGGGCGC TTTCTCACTA ACTCGTGCTT CTCACTCGTT GCAACGTGAA CAGAATCAAT TTATGGCGAA AAGTTTGACG ACGAGAACTT TGTCATTAAG CACACATCTG GTGGGCTATT GTCGATGGCA AACTCCGGTC CGGGTACAAA TGGTAGTCAG TTCTTTATCA CGTGTAAGGA GACTCCCCAC CTCGACAATA AGCATGTCGT TTTTGGACAC GTTGTCGAGG GCATGGAAGT CATCCAGAAG ATCGAAAACA CTCCAACTGG AGCCAGCGAT AAGCCAGTCA CGGAGGTTAC CATTGAAGAT TGCGGTGAAA TGCCCGCGGA CTATCGCCCT TAGCCTGTGG CAGATTTAGA ATAGGAAGAG TT
|
Protein sequence | MSEDEAMWID VSKAQDGGVK KKILQEAPDG ATGPPPDGYE VTAHYTGTLT SDGSKFDSSV DRGKPFNFTI GQGQVIKGWD EGFASMKVGE KAMLEIRSDY GYGDSGSPPK IPGGATLNFE VELLGLKEKR KEKWEMSTQE RLEVANKLKT EGTELFQQQK FKDAVALYED AASYAVDEGI SGNDVPDEER PLYVSCWSNA AFCYIKLKDW PEATRSCNNV LEIDTELASN VKALYRRGLA RMKLGLLKEA KEDLMAAYKI DAVNKDVRKA LTQLKEAVAE SKRKEKAAFG GFFNKVDFYD DKKGPLIPNA KGDNPHVFFQ IKQGEEDLGR VVMQLYRDIT PKTSENFRCL CTGEKGVGKS DKPLYFKGST FHRVIKDFMI QGGDFTAGNG TGGESIYGEK FDDENFVIKH TSGGLLSMAN SGPGTNGSQF FITCKETPHL DNKHVVFGHV VEGMEVIQKI ENTPTGASDK PVTEVTIEDC GEMPADYRP
|
| |