Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44106 |
Symbol | |
ID | 7203867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1002025 |
End bp | 1004028 |
Gene Length | 2004 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186444 |
Protein GI | 219113721 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.287897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCAGACCCG CTAGTTTGTA CGATGCAAAT CAGCTTCTAG AGAGACGATA CATTGCCGGA TTACCAACAC GTACAAAGTG TTAATTGTGT CCATGCCTGA GTGTATCTAG TAGATATAGA TACTTTTGGT AGGAAGCTGC CTGCTGCGAC CGCAGAAACG CCTCACTGTC AATCGATATA TCCACAGAAT CGAATGTCTA GTACATCGTT AACCGCTCGG GAAAATTGTG ACACAAACGA TGTGAACGCT GTAGAGTTTG TAAAAGTGGC CATCGTTGGA GCGGGAGCTT CTGGGCTGCA GTGTGCTCAC ACATTAATTC GAGACTTTGG CTTCGCCCCG TCCGATATTG TAATTTTGGA AGCGCGTGAA AGAGTAGGAG GTCGCCTTTA CACCACGATG GAAACAAGAA GAGGCCTGGA TGGAACTTCG TTGCATTTTG CCATGGATCA CGGTGCGGCA TGGGTACACG GAACTGGCCT CGATTGGGAA GCTCCACTGA GTAAAGAAGA TCGCTCCTTC CCTATGAGGA ATCCCATGAT GGCGCTCTTG GAAAAAGCTA CACCTTCAGG CGAGTCCGTA TATGAAAGGC ATTTGAACCC GATCTTTCTA GGCAATCCTT GGATGCGACC CCAAAGTATA GCGCACGGCG CCAATCAGAT CGTTTTGTAT GTAAATGGAC AAGAGCTCGC TAAAGATTCA CCATTGATCT CGTTAGCGCT TAAACGCCAT TACGCTCTTT TGGACCGTGT TTCGGATGTT GGTAACACCA TGTTTGAACA AGGAGAAGGC ATGGAGACGA CAATTCAAAG CGTGAAAGAA ACAATTTCAA AGATTCAAGA CGAGCCAAAT TTTCGATCGG AACTAGAACG TTTGTCCGAG GATGACATGG AACAGGTACT TGCTTTAACC CCTTTTTATC TGCACATGAT CGAGTGCTGG TACGGAAAGG AGACTTCGGA TTTACAGCTC TGCGAGTTTG TCGATGACAA ACTGAATGAC GATAACGCCG ATGAGACATA CACTGCGGAG GGCGACTTTT ATGGACCACA CTGTACCTTG AAGAAGGGTA TGAGTTCGAT TTTGGAACCT TTACTACGAG ATGGCGTGAA CAAGCGGATA CGATTGAAAG AGGAAGTCAT TAAGATATCC AACGAGACTA ACACCGTCCT TCTAAACACG GTCTTAGGGA CGCAAATCAG GGCGAATGCG TGCGTACTAA CCCTCCCAGC TGGTTGTTTG AAAGAGACTG AAGGTAGGTA CAAATTCTTT GAACCTGCAA TGAGCGCGAG CAAGCTTGAA GCAATCAGTC ACATGAGCAT GGGCAGCTAC AAAAAAGTTT TCTTAACTTT TGATCGTATA TTCTGGCCGA AGGAAGAGGC GTTTCTGGGG ATGATCCGTA AAAGCTCTTT CCAGACGTCA GATGAGCCGC CTGGTAACTG CATGCTTTTC GACAATTTAT GGGCGCGAAA TGATATTCCT TGCATTGAAG CTGTCCTGTC TGGATCTGCC GGAAGCTGGG CCGTCGGAAA AAACGACGAG ATTATTCGAG ACCACGTTCT TTCATTTATG AAGGATGCCA TGGGTATCGC TGACGAAATT TCGTCATATT GTCAAGACTG TCAAGTCACC CGCTGGGAAG AAGACCCTTA TAGTCGAGGC GCGTATTCAT CGATGTCACT TGGAGCGTTG AATCGGCACG TGGAAGAATT GAGAAATCCG GAATGGGAAG GACGCCTCAT ATTCTCTGGG GAAGCTACAG TCACAGAGTT TGCAGGCAGC GTACATGCGG CGCTCTTTAG CGGACGCAAT TCTGCCGAGA AAGTCAACGA ATATTGTACA CTCGTAGAAG CGAAATTATG TTGCTCTCAG CTAGATGATG CGGCTGATAA GATTGGATTC CTAAAGCCGT CGAAACTCAA TTGGTAGCAA CTCGTTTGAA AAGAAAGGTG CCTTTCCAGT ACTTCCATTA GTGATAAATG TATACTAGCT GGCAGGTTTA TTTC
|
Protein sequence | MSSTSLTARE NCDTNDVNAV EFVKVAIVGA GASGLQCAHT LIRDFGFAPS DIVILEARER VGGRLYTTME TRRGLDGTSL HFAMDHGAAW VHGTGLDWEA PLSKEDRSFP MRNPMMALLE KATPSGESVY ERHLNPIFLG NPWMRPQSIA HGANQIVLYV NGQELAKDSP LISLALKRHY ALLDRVSDVG NTMFEQGEGM ETTIQSVKET ISKIQDEPNF RSELERLSED DMEQVLALTP FYLHMIECWY GKETSDLQLC EFVDDKLNDD NADETYTAEG DFYGPHCTLK KGMSSILEPL LRDGVNKRIR LKEEVIKISN ETNTVLLNTV LGTQIRANAC VLTLPAGCLK ETEGRYKFFE PAMSASKLEA ISHMSMGSYK KVFLTFDRIF WPKEEAFLGM IRKSSFQTSD EPPGNCMLFD NLWARNDIPC IEAVLSGSAG SWAVGKNDEI IRDHVLSFMK DAMGIADEIS SYCQDCQVTR WEEDPYSRGA YSSMSLGALN RHVEELRNPE WEGRLIFSGE ATVTEFAGSV HAALFSGRNS AEKVNEYCTL VEAKLCCSQL DDAADKIGFL KPSKLNW
|
| |