Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40080 |
Symbol | |
ID | 7195774 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 105934 |
End bp | 108242 |
Gene Length | 2309 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184184 |
Protein GI | 219127942 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTCG ATCAACGGAC GATTGTAGTG TTGCTGGTGT CAACGGCGTA CGCGAGCTGG CCTACTACTG GCGGTGGTGG TACGTGCGGA GTCGAGGCGT CGGCACCATT CAATCCCGGT ACTTACTACG GCAATGCGTA CGCCCAGGGA AACGAACGAA ACGTCAACCG CAATGGCAAC AACGCGTGGC CAGCTCAAAC GTCTCCGCAA GAATCCGCGT ACGACAGTTA CGGGCAATCG TCCACCGCGG AGCCTGTAAG CGACTTGCCA CCGCTACCGG AAGGCTGGAG TGAACACTTG GACCCTGCCT CGGGACAACT ATACTACTAC AACGCCAACG ATGGAACAAC CACTTGGGAC CGCCCCTTGC GTTTGGAAGA CGAGGCCAAA GCGGAAGAGG TTCAACCCCC AGAGACCAAT ACGAGACAAA ATCTGAGTGG AGCGGACGAG TCACGAGACA ACGACAACAG GGCGACACGG ATGGCAGAAT CGGACGTGAA GGAAACACAC GCCGACAAAA TGCTGGACGC AGAGTCTCCG CAGGAATCAG ATCATGAACA TAAGCAGCAC TCATCGGAAG ACGCCTGGCG GAACTCTGCA TCATGGGATT CGCCAGGGGA ACATGATCCT GAATCTTCAC CAAAGGAAGC CCCAGCAGAA CCAGAACGCG GCGCAATACC GGAACTTGAC CAACATGGTT GGGAAAATCA GAACATGGCT ACTGCAGATA ATTCAGCGAC CGAGGATTTC CAGCAAGAAG GACCAGGACG GGATACTGGT CATTTTCCTG CTCAGCGACA GCCGGTGGTT GACAATGAGC AGCGTTTGGA GGGCTTTTCT GAAAGTCAGT TGAATGTAGA TCGGCCCGAC CAAGGACCTC CCACGAGTCA TACGGGTAGC TGGGGAGCAC CGCGCTCTGT CGAGCAACCT CGCAGAGAGC AAGACGTTCA TGAACGGCCG TACGAACCAA GGTCCGATCA CGTGTCGGCG GACACATTTG GACGCCATGT ACCGGATGCC AACCGCCCGC AGCCAGAAAA GCAATTACTA CCGGTACAGC ATCAAGACCA GTATGGCGTC AATCCTCGGA GTTCCGTATT CGGACTTGTG CCGGAACCCC ATAATCCACA GCAATATCAA CGCACTTCGC CCCAGCAATC CATTCATCGT GATCCAGAGC AGAGTGTTCC TATCCAAGAA AGGGAACAAT ATCAGGAACG GCCAGGGACT GCGATGCCAC CGCGGCAGCA AATCATGCAG CAACATCCGT ACGGACAGCA GCGTCCAACT CATCCACAAC AATTGCAGCA ACAGGGACCG CCTTCACAGT CACGGGTACT ACCTCCTCAG TACCAACAGC AGCAACCCCA TCCACAACAA CAGCAGCAGC AGTCTCATCA CGGTCAGTAT GGTGGTCCCT ACGGACAACC TTACGGAGGT CCTTACGGTC AATACGGGCA GCCATCTCAC GGTGTCAATA CTCCTCGAGG GTACGGAACG CAACAACAAT CTCAGCAGCC ACAACGACAG TTGATTTCAG AGGACACGAC ACGTCCTGTC AAGGAAGCTT TGGGCCGGAC TTGGCAGAAC ATTCTAGGCT TGAGTAACCG TACGAAGGAA GCCGTCGATC ATGCACGGGA ATCGGTGGTC ACGGGAGCCA AGGGAGCCAG TCAGTCCCTA AGTACCACTA GTGCGAGTAA GTGAAAGTAT CATGTAAAGC ATGCACGCGA TTGCCAAGAT TGTTCTTAAC GTTGATGTTT GTAATTCCCT CAGGTTGGTG GGGACAAGCA AAGAATACGT TCGGATCCGT GTTTGAAAAC GAAAATGGAC AGCCATCGCA ATATTCTCTC TCTGGACAAT ATGGTGGACA AAGAGAGCAA GTTATTCGTG GTCCGCCACC CGGTTATCCG CCACAACAGG GTGGCCATCC TGCTCACGGG CAGCCGCAGT ATCCTCCGCC TATGCACCAC CAAGGTGGAG GGTATCCGAC GTATCCTGGA CAGCAGTACC CTCCAGGCTA CGGGCCACCG CAGCCTCGGT CGGATCCTTC GTATCCGCAC GATCCTCAAT ATCGTCCACC CGCGCAGAGC CAGCAGCCTC CACTCCAGGC ACAATGGGGC CAGCAACAAC ATTATCCTCC TCAATACCAG CAAGGTGGGC CAGGCGGACC AAGAGGGGGG CCGGGTTCTC CACAACAACG GCAGCCGCAG CAAAACGAAC AACGACCACC GCCACGGCCG CAAGGTGGAC CTAGCCCGGA TACAGACGAT CCATGGCAGC ACCCTGGACT CGGAACAGAC GGCTATTAG
|
Protein sequence | MRFDQRTIVV LLVSTAYASW PTTGGGGTCG VEASAPFNPG TYYGNAYAQG NERNVNRNGN NAWPAQTSPQ ESAYDSYGQS STAEPVSDLP PLPEGWSEHL DPASGQLYYY NANDGTTTWD RPLRLEDEAK AEEVQPPETN TRQNLSGADE SRDNDNRATR MAESDVKETH ADKMLDAESP QESDHEHKQH SSEDAWRNSA SWDSPGEHDP ESSPKEAPAE PERGAIPELD QHGWENQNMA TADNSATEDF QQEGPGRDTG HFPAQRQPVV DNEQRLEGFS ESQLNVDRPD QGPPTSHTGS WGAPRSVEQP RREQDVHERP YEPRSDHVSA DTFGRHVPDA NRPQPEKQLL PVQHQDQYGV NPRSSVFGLV PEPHNPQQYQ RTSPQQSIHR DPEQSVPIQE REQYQERPGT AMPPRQQIMQ QHPYGQQRPT HPQQLQQQGP PSQSRVLPPQ YQQQQPHPQQ QQQQSHHGQY GGPYGQPYGG PYGQYGQPSH GVNTPRGYGT QQQSQQPQRQ LISEDTTRPV KEALGRTWQN ILGLSNRTKE AVDHARESVV TGAKGASQSL STTSASWWGQ AKNTFGSVFE NENGQPSQYS LSGQYGGQRE QVIRGPPPGY PPQQGGHPAH GQPQYPPPMH HQGGGYPTYP GQQYPPGYGP PQPRSDPSYP HDPQYRPPAQ SQQPPLQAQW GQQQHYPPQY QQGGPGGPRG GPGSPQQRQP QQNEQRPPPR PQGGPSPDTD DPWQHPGLGT DGY
|
| |