Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39552 |
Symbol | |
ID | 7195365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 109217 |
End bp | 110554 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183676 |
Protein GI | 219126881 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCT TACAGGAATT TGACCGAAGC ACCTTGCTGC GACAACCAAG TTCCTCAGGC TGTCGGAGGA GGACCCCTTC GACTTGGCTG ATGTTCACAT TTACAGTTGG TAGTCTGTTT CTGAGTCTAC TCACGGAGAA TTCGCTGGCG GAGGCCGCTT TGGTGAATGG CAACCCGCTG ATACGAGGGA ACCCCGTCGA GGGTACAACA GCACGCCAAC GTCGTCAACA AAATGACCAG AACGGGGACA ACCAAAACAT CGGTGTGGAG CATACTCCTG TTGTCTCCGT GCTAGAGGAA ATTTCTGCCG TGTCGCTGAC GTTTTCAGTG GCCTTTCCAG ACGACGAATA TTCGGCTGTC ACACTCGAAA CGCGGGGAAA TGCTGCTCAG CAAGGTGTAC TGGAGGCTGT CGTGCAAGTC TTGTGTGAAT CGGTTCGCGT TGTCCACGGA AGTGCAAACG TTTGCGATCC TGAACCCCAA GCACGGTTTT TTCAGTCCAA CTATGACGTT TCCAGTCTAG ACGCCGATCC GAACGTTATT GTGACGACAG TGAGTAGCGC GAACATGTCC TGGAGTACTT GGGATGTTAC TTACACCGTC CAAGACTTGG GCCAAGATAT GTTCAATATG ATACCACCAG AGAAAAGCGA CAATGCTTTT CTGGTCGCCC TGGAAATGTT ACAGGAAGGT ATCCAACTCG CTTTGGACAT TTCCATTATG GAAGAAGCAT TCAATATGCT TCTGCGAGAT TCGGTCGAAA CACGGGCCGT CGCTAGCCCG ATTGGTCAAG AAGAGACCAT CTTCGCCTCT CTCCCGGAAT CCTCCTCTAT TTCGATCAGC GACTTTACTC CTCGCATGTG GAATGGACTG CGATTTGGTG GAGTTGCCAT TCTAAGCCTG ACTTTCCTTG GTTACGCTGC GTTAGTGTAC CTAAGTGCGA ACCGACGGAA AAATATCCTG TTGCAACTAG CTCATCAGCG AAAAATGGAT CATATCGTGC TGCAAACACC AGACGGAGTC GACGCGCTGT TGGCACAGTC GGCTCAATTT TCTATGCCAC CTCATCTGCA GAGTCAACCG GCAATAGGGG ATTCCAACGA TTCGGAAGGA GAAGACGACG AAGAGGAGGA CAACGACTCC TTTCGGCTAC CAAGCCATCT GCGTCTGCCG AGTCCACCTC CAACGCCATC TCGCTCGTCT CCACGAGACG TCGGATCAAA AGTAGTGTAT TCTCAGCGAA CTTTGACTCC ACCGTCAATT GCAGCGCTTT CGTCGCCAGC TCTGGGTTGG ACACATTTCG CAAACAACAT TTTTGAAAGC GATATTGATG AGCCATGA
|
Protein sequence | MASLQEFDRS TLLRQPSSSG CRRRTPSTWL MFTFTVGSLF LSLLTENSLA EAALVNGNPL IRGNPVEGTT ARQRRQQNDQ NGDNQNIGVE HTPVVSVLEE ISAVSLTFSV AFPDDEYSAV TLETRGNAAQ QGVLEAVVQV LCESVRVVHG SANVCDPEPQ ARFFQSNYDV SSLDADPNVI VTTVSSANMS WSTWDVTYTV QDLGQDMFNM IPPEKSDNAF LVALEMLQEG IQLALDISIM EEAFNMLLRD SVETRAVASP IGQEETIFAS LPESSSISIS DFTPRMWNGL RFGGVAILSL TFLGYAALVY LSANRRKNIL LQLAHQRKMD HIVLQTPDGV DALLAQSAQF SMPPHLQSQP AIGDSNDSEG EDDEEEDNDS FRLPSHLRLP SPPPTPSRSS PRDVGSKVVY SQRTLTPPSI AALSSPALGW THFANNIFES DIDEP
|
| |