Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35058 |
Symbol | |
ID | 7199996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 976209 |
End bp | 978227 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179330 |
Protein GI | 219117071 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATAG AATCACTGGC GGAAGAATCG TCTTCCGTGT CAGTGACTGA TTTGCGCATC GTTTGCTTGA TTCCTTCCGC CACGGATATT TGCGTCGCTC TCGGATTGCA GGCTGCCATT GTCGCCGTCA CGCACGAATG TGATACAAAC CTTTGGTCGC GAGCGTCGTC GCGTACGGCA GTCAAGATCA TTACAAGAGA CGGGGTGAAC GGGAACGAGA CGTCGCAGGG TGCCATACAC GATCAAGTTG TGGCGAGCTG TCGGGCAAAA GATGAGGCCG TCGGTGATGG GGACATTCCC GCACTGGCGG ACGTGCCTTC CTTGTATCCC ATTTGGGAGG ACGAATTCCG CGAGGCTCTA CAGCTCAACG ACGTTCACGA CAGTAGTAAG CGCCTTGTGA TTACCCAGGA CCTGTGTGAA GTGTGTGCTC CCTCGTCCGA AACCGTTCGT CGGCTTGTGG GCAAGGATGC CTCACAGCCG CCACCGCCAC ACGAGGTTCA CGTTGTGTCC CTCACGCCAC AATCGCTGTG GGACGTTGCC GCTAATATTC TCACCGTCGG CCATGCCTGC GGGGTCCCAC GACGTGCCAA AATCGTTCAC GATGCGTTTC TGAGTAATTT ACAGACACTG GAAACGACCG TTACCGAAGT TCGTTCCCAT GATGGTGCAC CCAAGCTCTT CCTATTGGAA TGGTTGGATC CACCCTTTGA CGGTGGCCAT TGGATTTTGG ATATGATGCA GTTCGCCGGC GTGCAACCCG CCCAACACAA GCACACCCAG AAATCGACAT CGACGACCTG GGCTCAAGTC CGTCAGGCCG ATGCCGATGT GATTCTGGTC GCCTGTTGCG GTTTTGGTCT GGAACGGAAC GTTCGTGATA CTTTCGGTGC ACGCAACCAG TTGCAACAAC TGCGTGCCGC TCGCAATCGT CGCATCTACG CCACCAACGG TGACCACTAC TTTGCCCGTC CCGGTCCTAA ACTACTGCAT GGTGCAATCA TAATGGCGTT GACAGCTTAC GCGGATCAGC CGGAGGTGGT GCAAGCGATT CAGGCTTTGG ACTTTGTCGA CGCGGAACTG GGTGGATATC AAATGGTCGA TGTTCTGGAC CCTACCATTG TACAAGCAAA CAACGATGTT CCCGACATGG AAGACTTTGA CCGCTTGCAC CGCGAAGCCT GCAGCGCCGG TTCGTTATCC TACCCGGACC CCGTTACGGG CTATAAGGTC TTCACCGAGC TCGCCCACCG TCAACGTGGC AAATGCTGTG GTTCGGGCTG TCGGCACTGC CCGTACAATC ACGAAAACGT CAAGAATAAG GCGGGGAAAA TACAACAGCC GGCCATGCTC ACCGCTGGCG ACCAGACGGG TCCACTGGCA CTGTCCAACG GAAACCTGCA CGTGCTGTTC TTTAGTGGCG GTAAGGATTC CTTCCTGGCT ATTCGGGCAT TGACTCGACA AGCCAAACAG ACTGCCCCGT TTGGGTTAGC CCTGTTAACC ACGTTTGATG CCACGTCGCG TATTATTGCG CATCAGGATA TGCCGATTGA TACCGTTGTG GAACAAGCGA CACATCTGGG TTTGGCATTG ATTGGTGTCC CCATACACCG GGGGAGTGCC GAAGGATACG TGACACGAGT TCGCAAGGGG TTAGAGGTAT TGCAGAGCAG CGTTAAACCC CCAAGCAAGG TCACAACCTT GGCCTTTGGC GATTTGCATT TGGAAAATCT GGTGGAATGG CGAAATTCCC AAATCGGATC GCTCGGCTAT AAATTGCAAT ATCCCGTATT CCAGACCGAG TACGAAATAC TGTGGCAGGA TTTGGAAGCG TCCAAGGTAC CGTGTGTCGT ATCGTCATCA ACGGTTGATC ACATCCGTGT TGGGGATGTC TACAGCCGAG AGTTTGCCCA ACGGTTACCG GAATGCGTCG ATCGCTTCGG TGAGAATGGT GAATTTCACA CAATCGCACA AGTTTGGGAA GTAGACCGCA TCACGGCTTT GGGATTTATA GATAGTTAA
|
Protein sequence | MSIESLAEES SSVSVTDLRI VCLIPSATDI CVALGLQAAI VAVTHECDTN LWSRASSRTA VKIITRDGVN GNETSQGAIH DQVVASCRAK DEAVGDGDIP ALADVPSLYP IWEDEFREAL QLNDVHDSSK RLVITQDLCE VCAPSSETVR RLVGKDASQP PPPHEVHVVS LTPQSLWDVA ANILTVGHAC GVPRRAKIVH DAFLSNLQTL ETTVTEVRSH DGAPKLFLLE WLDPPFDGGH WILDMMQFAG VQPAQHKHTQ KSTSTTWAQV RQADADVILV ACCGFGLERN VRDTFGARNQ LQQLRAARNR RIYATNGDHY FARPGPKLLH GAIIMALTAY ADQPEVVQAI QALDFVDAEL GGYQMVDVLD PTIVQANNDV PDMEDFDRLH REACSAGSLS YPDPVTGYKV FTELAHRQRG KCCGSGCRHC PYNHENVKNK AGKIQQPAML TAGDQTGPLA LSNGNLHVLF FSGGKDSFLA IRALTRQAKQ TAPFGLALLT TFDATSRIIA HQDMPIDTVV EQATHLGLAL IGVPIHRGSA EGYVTRVRKG LEVLQSSVKP PSKVTTLAFG DLHLENLVEW RNSQIGSLGY KLQYPVFQTE YEILWQDLEA SKVPCVVSSS TVDHIRVGDV YSREFAQRLP ECVDRFGENG EFHTIAQVWE VDRITALGFI DS
|
| |