Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42552 |
Symbol | |
ID | 7196258 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 395498 |
End bp | 397188 |
Gene Length | 1691 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177082 |
Protein GI | 219110661 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATCGCTCC ATGGACCAGC AAGTTATCCA AGAAGTATGA ATGGTTGTAA TGGAGGTTAT CCTACCTTAT CTCCGTACGT TGGCAACGAT TTCGGGGCAA CGCCTCGAAC CACGCACATG AATAGGTCAC CTCCACTTCA TCATGTTGTG CACACCGGCG GGTCTTCACC GGACCACTTG ATCCCATATT GGAAGCCCAC ACATAACCAG TATATTTCTT CCCCTGTACC ACGTCCTTTA AAAGAGTCGT CTCCAATAAC CAACATTCAA TCCGAAAGTG AAAGAGACTT CTCCTCGCCT CGGACTCTCC CTTCCTATTG TTCGAGGTCT TTCGATAGCC CCCCGGATCC TAACACGACG GAACGCTTGA TGTCATCTAC CATGCAAAAG CGCGTCTCGC CCAAGGAAGC CCGAAACAAT GTCCCCGAGA CGAGTGTACC CTTGAAAAAG CGAAAGACTG TGATGCAGAT GCATCGCAAC CCCGTGGTAT CACCTTTTCA CGCCAGTCCA ATATCTCATA GTAGTAAGGC GGTCGCCTCT ATCTCACTGT CTTCTAATCG AATCCCATCA TACGACAGTC GGGTATCATC ATACGACTCT AAAGACGGAC AAGCCCTTCT AGAAAGTTCC AAGATCGTGG ATCCGACGGA CATTAAGGCG GAGGAACCTG GAATGAAAGA CTTTCCTAAT GTCTTACATA GCGTACTTTC AGATTCCGAA TTTGCCGGAA AAGTTGTGCA ATGGCTGCCG CATGGGAAGG CTTGGAGGAT TGTACGATTT GACGCTCTCA GGAAGCTGGT GCTGCCGAAG TTTTTCGCCA ACCTTCGCCA ACCTAACAAT AACGAAACAA CCGGTTCCAT CGACACTTTC CTCAAGTATC TTTCTTCCTG GGGATTCGAG GAAGTTACTG ATGGCCCTGA CGTTGGTGCA TACACTAATG TGGTAAGTAT CTAATTTCAA GTTTTCAAAT GGGAGAGATA ACGTGGACCC TCATACCAAC TCCATTCTTG TCGCAGCTCT TCCGACGTGG CCTTCGTCGG CTTTGCTCCG AAATGAAGTT CAAACCTTGG GGAAAGGAAG ACTCAATACA GATAATTGAA TCATCGAAAC AGCCTCAATC GATTCTTCGT GTGCCTTCGC TTGCGTCAAC GGTGGATACG TTGGAATGCA CCTCAAGCAA AGAGATGGAT GGTCAGTTTG TTCAAATGAA CCCTTCGGGT AGCTCAGAGC GTCCGGAGTC GTGGCGCACC AACCAGTGGG AAAGATCTCC TGATAATCGA TTTCTCCGAC AGACACCACC GTCTGAAGCA TGGCCATGCA ACTATCAATC TGCCGTTTCG AACAGCTTTA AAGGGTCGGA ACAATCACGG AATATTCAAT ATTCTCCAGT GCGTATACGT TCCTCTCGTG GAGCGCCTCG CACTTTGAGC AGAACGAAAG CACCCGCTCA GTATCAGAAC CACCCACAGC TTCAAAAGCG TCCATGTGCC TTCCCAGTAT CTAATCGTGG CCGTGGAAAG GTATGGAGCC CTCGTCCATT CTCTCCGTCC GTTCACCCGT CGACATCTCC AGTTTCAATT GAGACAAATG TTATGTCAAT CGGTCAATCA CGCTCGACTC TCAACCGTAA GGAGCCACTG GCGTCCTCAC CGGAAGAAAC GATACGAGGA GAAAACGTCA CTGCTGTGTA G
|
Protein sequence | MNGCNGGYPT LSPYVGNDFG ATPRTTHMNR SPPLHHVVHT GGSSPDHLIP YWKPTHNQYI SSPVPRPLKE SSPITNIQSE SERDFSSPRT LPSYCSRSFD SPPDPNTTER LMSSTMQKRV SPKEARNNVP ETSVPLKKRK TVMQMHRNPV VSPFHASPIS HSSKAVASIS LSSNRIPSYD SRVSSYDSKD GQALLESSKI VDPTDIKAEE PGMKDFPNVL HSVLSDSEFA GKVVQWLPHG KAWRIVRFDA LRKLVLPKFF ANLRQPNNNE TTGSIDTFLK YLSSWGFEEV TDGPDVGAYT NVLFRRGLRR LCSEMKFKPW GKEDSIQIIE SSKQPQSILR VPSLASTVDT LECTSSKEMD GQFVQMNPSG SSERPESWRT NQWERSPDNR FLRQTPPSEA WPCNYQSAVS NSFKGSEQSR NIQYSPVRIR SSRGAPRTLS RTKAPAQYQN HPQLQKRPCA FPVSNRGRGK VWSPRPFSPS VHPSTSPVSI ETNVMSIGQS RSTLNRKEPL ASSPEETIRG ENVTAV
|
| |