Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41241 |
Symbol | |
ID | 7198977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 283077 |
End bp | 284930 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185166 |
Protein GI | 219130006 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.5011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGGC TGGAGAAACT GCAAAGGACA ATTATTCACT TCGGAAGTCT GAAGACAACA GAAGCGGTGG AAGACGATAT CGTTTGTGGG ATCGAGATCA CGCCGTGTGC GAGTGATAGT CTATCAACAC TTTCTACGAT TCCTATATCA AGCAATGAAT CGCAGGCCAT TAGTTGGTCG GATGCGAGCC AATCGACTTC TGATTCCTTT CCACAAAATA CTTATTTGGT TCATTCTTTC TCTTGCGAAA ACGTAAATTG CCCGTCATGG GATACCAGCA ACTCGTCAAA AGATGTAGTA TTTGACGGTA TCTGTGATGA ATATTACGAA TTTTCGACAA AAAGAAACCA TTCTTTTATT GGACCTTTGA CACCGAAAGA ACACCAGTTT AACACTTGGC AGTTGCGAAA TCGTTCTGAA AACCCCAATT TTTGCCTTGG CCGCGGCTAC GTTTTACCAG CACAATATGA ATGTTCTTCA GATGCATATC TCTCTGAGCA CGGTCTACTT CATTCATTTG CAGAACGTGA CAACTGTCCA GACTTTATTA ATCTGGTAGA GTCAGAACCT GAGAGGCTCC ATGTGTATCT TACACAATCG ATTGCCAGTA CGCTGTGCCA GTTGACGACC AGGGCCTCTG ATACACCAAC TATTTCTAAT GATTCGACAA GACAAAAACT CGCCTTGCAA CCTCAGAATT CGGCATATAC GCAGCAGGAG TCGTTGGTAA TTTCACTGCA AGAGAACACG AATACGATTA TGATACTACT TTCGGGACCA CTAGTGCACC TCTCTTCGGT GATGAACATA TTGGAGGCAG GAAGGATTTC GAGTCGCAAA ATGTCCATTC ATAGTTTTTT CTACCAGCAG AATATCGCCA AAATGTTGAC ATGTGATTCA GTTCTCAGAA GTGCAGTTGC TTTGAGAGTC CAACTGAAAA GGTGGTCCAG GCATATCACC AATAAGGAAA GCAATTCTAA GGAAACCGGC TCTTGTCTCG ACAGTGATGG AACGAAAATT ATCTGCACAC CAAAAATGCT TCCATTGGCT TACGCATATA TCAGCGAAGG CCAAAAAAAT GACGTGAACT TGATTCAAGA CGTGGAGCTG AAAGCCGTGC TGAATATAAC GGATGGGGGA AATAGTGCGT CGGCTTTGTT TCCCACCTTC TCTAGCGATT CGATCCTGGT GTGTGAGTCA GATGATGGGC TCAGTGAAGA ACTGTATGCC CTGGACTCAA TTGGAAGCAG ATTGCGCGAA GAACTTGAGA ATGCAGAGCA AACACTGAAA TATGCAGCTC AGAGCGCTCA TGCCAAAGTA AGCGGTCCCC CAGCTCCGAC GATTCACGAT GATGGTGATC CCTACAGCTA CCACCTTGTT GAAATCAAAC AAAAATTAAA AGCTTTGGAA CGAATCGAAC GTAGCCTTCG GAAAGCAATG TCCCGTGCAG ACAAGATCAT TTGCCAAGTC GCCTCTGACG ATGATGCTTT AGACGATACG ACAGTGTCGA CCGATGTTAG TTCAACTGAG TCGAAGTCCC CAGGGCCTTC AGTCAAAAGC AGTCGATCGG TAAGAATGCC CGGGACAGAT GGCCGTAAGG TTCACTTCGC AGCCGAACAT CAAGAATTTG TTTTCGTCAC CGACACCAGT TATAGTTGGG CAGGGAGTGG AGAAGACGGA CCAGAAGTAG ACGAGTCGAG CGGTTTTCTT GGTAAGCTAG AAGACGTCTA CTATGCCTGT GAAGACATTA TGGACGAAAT GGCGTTCTCC TGCACGAGAT TTTTCGAAAG CAATCGGCCA AGCGCAACAT CATCGACTGT ACCGATTCGG AGAAGTACGA TACACTTTGT ATAA
|
Protein sequence | MTRLEKLQRT IIHFGSLKTT EAVEDDIVCG IEITPCASDS LSTLSTIPIS SNESQAISWS DASQSTSDSF PQNTYLVHSF SCENVNCPSW DTSNSSKDVV FDGICDEYYE FSTKRNHSFI GPLTPKEHQF NTWQLRNRSE NPNFCLGRGY VLPAQYECSS DAYLSEHGLL HSFAERDNCP DFINLVESEP ERLHVYLTQS IASTLCQLTT RASDTPTISN DSTRQKLALQ PQNSAYTQQE SLVISLQENT NTIMILLSGP LVHLSSVMNI LEAGRISSRK MSIHSFFYQQ NIAKMLTCDS VLRSAVALRV QLKRWSRHIT NKESNSKETG SCLDSDGTKI ICTPKMLPLA YAYISEGQKN DVNLIQDVEL KAVLNITDGG NSASALFPTF SSDSILVCES DDGLSEELYA LDSIGSRLRE ELENAEQTLK YAAQSAHAKV SGPPAPTIHD DGDPYSYHLV EIKQKLKALE RIERSLRKAM SRADKIICQV ASDDDALDDT TVSTDVSSTE SKSPGPSVKS SRSVRMPGTD GRKVHFAAEH QEFVFVTDTS YSWAGSGEDG PEVDESSGFL GKLEDVYYAC EDIMDEMAFS CTRFFESNRP SATSSTVPIR RSTIHFV
|
| |