Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47550 |
Symbol | |
ID | 7202622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 89184 |
End bp | 90542 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181842 |
Protein GI | 219123044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGACG CTGTCGTCGT GGGACTCGGA GGTGTTGGTT CGTTTGCTTT GCGAGCCTTA ACTCACAGCG GTACCCGTAC AGATACAGAA TTCGTGAGCA ATGACACCAA CACAAAGAAC CCAAAGGGCA AGACGTACTT GGGCATCGAA CGGTTTGCCC GCTGTCACGA TCGTGGTTCG TCGCATGGAT ACACCCGCAT CTATCGACAA GCGTACTTTG AACACGCCAA TTACGTACCC TGGTTAAAGT TTTCCGTAAA AGCCTTCCGA GAACTTGAAG TCGCCCAAAA TGTGTCGCTC TTGCACGAAT CTGGCTGCAT CGTGATGGAG CCAGCCGTTG CTCAAACGGA AGGAGATCCC TCCCAAATTT CCATGCCACC GTACTGCAAG GCGTCGTACG AATCCGCCCT ACGGCACGAC ATTGATGTGG AGTTTCTTGA CACGACGGCC TTAAAAGCAC GCTTTCCTCA ATTTCTATCC GACCACGATA TGGTGGGACT ATTGGAACCC CAGGGTGCGG GCTTAGTTCG GCCCGAACGC TCCATCGAGG CGGCTTTGCG AGACGCCGCA GAGCACGAGG GTGTGAAAAT ACAAGAGCAT ACCCAGGTCT TGTCTTATCG GCAAAAACAA TACCACGATG ACACCGAGAT TGTCGAAGTC GTCATTCAAC GGGACGGAGA AGACGCATCC GAAACGATTC TGACTAAATC GCTCCTGATA GCCGCCGGTG CGTGGGCCTC GACTTTTGTT CCTTCTTGGA AACCGTACGT TGTTCCCAAA CGGCAATTGC AAGGATGGAT CGATGTATCT CATACCGCGG ATGCGTCATT GTACGACGGG GGTAAACTAC CGGGGTGGAT CCTCGTCACA CCATCGTGGC CGGTACCCAT GTACGGGCCG CCGTGTGATC CGAGCGGCGA CGATCCGGCT CATCGCCATT GGCTCAAAGT TGGCTTGCAC GGAAGAGACA TACCGATCGC GGATCTCTCC CAAAATCCCC GCGAAGCATC GGAAGACGAA ATTCAAGAAG TCCGCGAGGC AGCAACCCAG GTATTTACCC GGGACGTCTG GGCCAAGAAC GACGACCAGA AATTTCCTGA TCTAGCGCAA GTAACACCGT GTATATATAC CATGACCCCC GACACTCACT TCGTTATTGG CTCACCGCCG CTTCTCTCTG ATCGGCTTGG AACGAGCCCT GCTCCAAAAT CGTGCGTCTT TGCGATTGCT GGCTTGTCCG GACACGGATT CAAAATGACT CCGGCCTTGG GACAAATGAT GGCGGATTTT GCTAACGGTG TTGACGTTGA AAGCGTTTGG GGAACGTCTT TTTGTTCACC ATTCCGCTTT GGCATTTAA
|
Protein sequence | MYDAVVVGLG GVGSFALRAL THSGTRTDTE FVSNDTNTKN PKGKTYLGIE RFARCHDRGS SHGYTRIYRQ AYFEHANYVP WLKFSVKAFR ELEVAQNVSL LHESGCIVME PAVAQTEGDP SQISMPPYCK ASYESALRHD IDVEFLDTTA LKARFPQFLS DHDMVGLLEP QGAGLVRPER SIEAALRDAA EHEGVKIQEH TQVLSYRQKQ YHDDTEIVEV VIQRDGEDAS ETILTKSLLI AAGAWASTFV PSWKPYVVPK RQLQGWIDVS HTADASLYDG GKLPGWILVT PSWPVPMYGP PCDPSGDDPA HRHWLKVGLH GRDIPIADLS QNPREASEDE IQEVREAATQ VFTRDVWAKN DDQKFPDLAQ VTPCIYTMTP DTHFVIGSPP LLSDRLGTSP APKSCVFAIA GLSGHGFKMT PALGQMMADF ANGVDVESVW GTSFCSPFRF GI
|
| |