Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31725 |
Symbol | |
ID | 7196022 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 777708 |
End bp | 779691 |
Gene Length | 1984 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176653 |
Protein GI | 219109799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCTT CTAGCACGAA GGCAGACAAT AACGATGTTT CGACTACGGC AATATCTACG CATGTATCCG TAGCGGTGGA ACAAAAAGGG GAAGCAAAGT ACCGTGCTTT AGTGATTGAT TCCGGTCCGA TCATTCGTCT CGCTGGCCTG ACGACTCTGT GGCGTCGGGC GGATTCTTTC TATACGGTGC CGGCCGTACT GCAAGAAATC CGAGACGCCA AAGCTCGTCA GCACTTGAAT ACTCTGCCGT TCGAGCTGAA AACACGTGAA CCGAGCGCGG AAGCAATCCA GGCCGTCGTG GAGTTTTCTC GTCAGACGGG CGATTATCCC AGCTTATCGT CGGTGGATTT GCAAGTGCTC GCGCTGTTGT ACGACCTCGA AAAGGAAGGA TGTGCAGATA TGAGTCACGT GGGGAAAACT CCCAAGCGGA CGGTAGGAGT TGGAAAGATT CAAATATTGG CAAACGATGC CGAAGGTAGC ATTACGGCAG CCCGCCTCGA GCCGCAGGTT GATCATTGTG GAATGGACGA CTACTCGCAG CAGGAAGCCA AAGACCAATC CATACACGAC GATGGAGAAA GTACGGTGGA CGAGTATGAT GATGAAAGTG TGACATCTGA AGACTTAGAA CTGGAAAATG TGTCACAAAC AAATGATCTG GTAGTGCCTG CGAGCGCAAA GCCCCCCAAA ACGTGGGCGG CCCTTGTCAA TCCAATTGCC GCGTCGAAGG AATTACCAAA GAATGACAGT ATTGCCAAAC CCGTCTTTGA TCAGGCTCTA CACATCCCTT TTGGTCGGAT GAGCCTACGC TCGGTGGCCG CAAATAACAA CAAGGAGGAG AATGGACAGT TTAGCGATGC AGATAGCGAC CGTGGCTTTT CGTCGGAGGA CGAGGAAAGC GACGATGACT TTGACTCCAA TCAGGAAATT AGTGACGAGG AATGCGATGT ATATATATTG GATCCGGAGG AAGTGGAGCG AAAGAATAAG ACGTTCGATA CTACAGTGCA AGACGAGCAC TTGTCAGACT TTCCAAGTCT CGCTGCCTCA ATCCGTGTTC CATACGAAGA AGCAAATGAT GATGAAAGAG ATGCACAAAG GCAAATTGTG GAAGACCGAC GGAAGCAAAA GTCACTACAG CCCGTTTCAA ATTCCGGGAA GCTTTACAAC TCATTTCGAA AATATACGAA TTTGATGAAA CCCAAGGCTG CCATCAAGAA GAATACCCCA AAGGTGACAT CAACAGTGTC AACACCCGGC GAGAAGTACG ACGAACTTCG TACTCAATCC ACAATCGATA ATACCCAGTC ACGAATTATT GGCGGCACGG CGTTTGCTGG CCAGGACGCC GATTTTGTCG ACGACGGAGA AGGCTGGATC ACTTCTACAA AGGAAATAAA AACCTTACGT GCAGCTGGGA GTTTGGATCC AATGAAGAAC TTAGGCAAGA GTGGTGAGCT GGTAACTACT GCCATGGGGC CATCGGTCGG ACAACGCGCG GCTTGTACCA CAACTGACTT TGCTATGCAA AATGTAATTT TGCAAATGAA TTTGGAGCTT TTATCTGTGG ACGGTATCAA AGTTCGGAAA TTGAAGTCAT GGGTGACTCG CTGTGGAGCC TGCTACAAAG TCTACACTAG TCATGAGAGT TCTGGTCCAC TGGGAAAGCG ACTGTTTTGT GAGCGATGCG GCAGTGATAT GATCCAACGG ATTGCTGCCA GTGTCGATGG CAAAACAGGA CGCCTGCGGC TCCACTTGTC GAAGCGGTAT CGTCACAACT TACGCGGAAC CAAGTATTCG TTGCCGAAAT CGGGGTCAGG CAACCGGTTC CAGGGAGATC TTTTGCTTCG TGAAGATCAA CTTTTGATGG GAGCATGGAA CCAAAAGGTA AAGATGCGCG GTGGTGGTAA ATCCAGGTCA GACGCACAGT CAATGTTTGG TCGAGACATA GCATCGAACG TTGGCTGCCA CACGAGCGCA GTAA
|
Protein sequence | MVPSSTKADN NDVSTTAIST HVSVAVEQKG EAKYRALVID SGPIIRLAGL TTLWRRADSF YTVPAVLQEI RDAKARQHLN TLPFELKTRE PSAEAIQAVV EFSRQTGDYP SLSSVDLQVL ALLYDLEKEG CADMSHVGKT PKRTVGVGKI QILANDAEGS ITAARLEPQV DHCGMDDYSQ QEAKDQSIHD DGESTVDEYD DESVTSEDLE LENVSQTNDL VVPASAKPPK TWAALVNPIA ASKELPKNDS IAKPVFDQAL HIPFGRMSLR SVAANNNKEE NGQFSDADSD RGFSSEDEES DDDFDSNQEI SDEECDVYIL DPEEVERKNK TFDTTVQDEH LSDFPSLAAS IRVPYEEAND DERDAQRQIV EDRRKQKSLQ PVSNSGKLYN SFRKYTNLMK PKAAIKKNTP KVTSTVSTPG EKYDELRTQS TIDNTQSRII GGTAFAGQDA DFVDDGEGWI TSTKEIKTLR AAGSLDPMKN LGKSGELVTT AMGPSVGQRA ACTTTDFAMQ NVILQMNLEL LSVDGIKVRK LKSWVTRCGA CYKVYTSHES SGPLGKRLFC ERCGSDMIQR IAASVDGKTG RLRLHLSKRY RHNLRGTKYS LPKSGSGNRF QGDLLLREDQ LLMGAWNQK
|
| |