Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36547 |
Symbol | |
ID | 7201703 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 722209 |
End bp | 723627 |
Gene Length | 1419 bp |
Protein Length | 313 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180886 |
Protein GI | 219120289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000358149 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTGGT TTAGTAGCCG TCGCGCCAAG CCAGCCAAAG AGAGCGCGTT CAATACCGAC GACAAACCAC AAAATGCTTT TGGAGGATTC GGCGCCGTGG CGCACTTTGA CGACGGTCCC TTGCGAGACC CTCGGATGGC CAACGACTTT CTCGAAAGGG GAACGAAAAA CGAAAATATT CATACGCATT CGGACACCAG TAGCGACGGT GACGACGCTG GTTCCGAGGC AGAGTCGGGT TCTCTCAGCG ACGATTCCTT CGTAGGAGGT CCGCTCATGT ACGAGTCAAC GTGTTCCGAC GATTCTATTA CTGTAGACAC TGCCGATCCA GATTCTCAGG ATTGGAATCT ACGAAAAGCC AATAACTTTC TGCAGGATTT TTACGACAAG GAGGACCTGG AACGGGTGTC GCAAGAACAG CACCGTTTAA GGTCGACTGA CGACGAGAAC GATGCAGAAT CGGCATCGTC TTCCGACCAG TCTTCGGAAG AAGGCGAGTC CGATTCGGAA ACGCTAGAAA AATCCCACTC TCCTCTGTAT AAAACATGTA AAGATGCCGA TATTTGTCAC GAAGACCAGG ATCTGGGTAC GGCCAAAATA GACACTGCTA CTACCTTATC AAGCCACGAT GCAAAAAAAT TTCCGATAGC CCCGGAAATC TTTGACGAGG GATATGGCGC GGAGCATGAT GGCTATGATG TTGCTCCCAA CGAGTCCCAG ATACATGAAT GTTCTGAAAC AGAGACGAGT TCGCAGCATG TAGTTGTCCA CGATTTTTCT ACAGCAAGCG CCTTGAGACC GGCTGTAGCT GAAGCAAGGT CTTTCCGCAA CCTTGACAAC TCATATANNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGAAAGAAG CGAGGTCGTC TGACTGTNCT GTTTCGCTTG ATGCCGGCCG ACGCGAAGGA TCGTCAGATG AAGAAAATTG ACAGATCTGG CGAAAGAATT GGCCACTTCA CAGTCACTGT CAATTCGAAA CCGAGACTGT ACCCACGCCT ATCAAATTGC CTACAACATT CTTTTGGCCG AGTGGTTTGA TAGCTAGACA GTCAGACAGG CCTTAGAATA GTCCATGGAG CGAAAGTCCC TGTTGAAAGG TTATTCGCTT CCGTTTGCGA AGGTCACCAG TGTGGCGTAT CCGCGGGCAG GCTTGTTTCC TTGTCTTATC TTCAGACTGT AGCTCTAGTG CTCGTCGAAT CACAGGTCAA CCCAACGAAG TTGTGAAGCC ATTTCCATGA AGAACCTAAG ACGAGCCAGG TTAATGGATG CTTCTACAGG ACAAATGCAT GAGTATTGGT CCCGCACGAA AAGGCATAAA GACCAATGA
|
Protein sequence | MSWFSSRRAK PAKESAFNTD DKPQNAFGGF GAVAHFDDGP LRDPRMANDF LERGTKNENI HTHSDTSSDG DDAGSEAESG SLSDDSFVGG PLMYESTCSD DSITVDTADP DSQDWNLRKA NNFLQDFYDK EDLERVSQEQ HRLRSTDDEN DAESASSSDQ SSEEGESDSE TLEKSHSPLY KTCKDADICH EDQDLGTAKI DTATTLSSHD AKKFPIAPEI FDEGYGAEHD GYDVAPNESQ IHECSETETS SQHVVVHDFS TASALRPAVA EARSTQRSCE AISMKNLRRA RLMDASTGQM HEYWSRTKRH KDQ
|
| |