Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49514 |
Symbol | |
ID | 7195736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 475071 |
End bp | 476693 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184256 |
Protein GI | 219128092 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCCC AAGAAACCAC TGAAGCACCC GAGACCCCAC CTTGCCCGCT TTCTCCTCCC GTTCCTTGCC AGCCTCACGA CACCAGCGAC GACTGTCTTG ACGACGATGA TCCTGTAACC GATCTTGACG ATATCGGCAA CGCCACTGTA AATAGTATCA CACGGGAACC GCCTTCGCGT AACGTCACGT TGACGCTAGC TTATACGGCC TGGGCCTTTG CGGGGCGTTC CATCTGGCAA CAAAGCGTTC TGGCAACCTG GGTCTTTTTG CTGCAGGGAA AACTGCAAGA CGTCGGTTAC GTTACCGCCA TTATGGGGAT AAGTCAACTC GTCGTCTCGA TACCTGCCGG TTTCTTGGCC GATACCTACC GTCGTGATAC TCTGCTGCGG ACCGCTTCGG CAGTGGGCCT ACTCGCCATC GCAACGACTC TGTTGGCCTG TCACCAGCGC ACCTTTCACT GCCTTGTCAT AGCCCTGGCC GTGTGGGGAT GTTGCTGGGG AATCGCCAAT ACCGCGCTCA GCGCACTCTT TGCCGATTCG ATTCGGGACG GGGAACGATC CTACTACTTT ACCCAGCGCT CCGTACTCAT CACCCTGGGG AATACAACTG GTCCGATTGT GGCACTCGTA CTGTTTAAGC TACTGGGAGA CCACTGGACC ATCCAAGACT GCGCTGCCGT CATGGCCGTT GGACAAATTG TATGCTTTCC CGCCATCGTC TTGCTCTGCT TTCTTAGCGA TGACTACATC CCGACGGCAT CCGACGCGGT GGATCCGCCC GACGAAACAG AGCCAGCTCG CGACGCATTT CTCCCCGCCG GGACTCCTGC CAATCCCGTC GATACGAACC AGGCTCTCAC ACAGCCTCTT TTGGCCGTAG ACACCGTGCA CACTCCCGTG CCTTACGTCT ACGGCTTTCT GCCACCCACC AGGGCTGTTC CCATTCTGGT AGCGCTGGCC GACATTCTCT CCGGTCTGGG ATCGGGCATG TCCATTCGAT ACTTTCCCAT CTTTTTCGTC CAGAATCTCG GCCTCGGACC CGTCCACGTA CAATTGCTTT ATATAACGGC ACCTTTGCTA CAGGCCAACC TGATGCGACT GGCACAAACA CTGGCGACCC AATTCGGGCG ATGCCGTGTA AGCGTCGCGT TCAAATGCGT AGGCGTGGCC TTTATGTTCC TCATGATTGC CTCGTACCAC TGGCATCTAC CGACCTTTTT GGTTTGTACT CTATATATTT TGCGAACCAG TTGTATGAAC GCCACCAGTG CCTTGACCCG CAGTATGCTC ATGGACAACG TCCCCCCACA CGAGCGCGGC AAGTGGAGTG CCCTGGAATC CGTCAACATG TTCAGTTGGA GTGGATCGGC CTTTGTGGGC GGCATGGTGA TTGGCTGGGG AGGGATACTG CCGTTGTTCA TTGTCACAGC TTGTGGACAG CTACTAGCTA CTCTCCCACT CGTGGCACTG TTTGGATACG ACGTGGCCTC GCCCGAAACG AATTACAACA ACAACGGTGT GCATTGGTTG GGTTGGATGT TCTCGGCGCG TTCCGTAAGC TACATACCCG ACAGTGACGA CGACCAGAGT TCATACGGCT CTCGTCAGGG GGTAGTAGTG TAA
|
Protein sequence | MPPQETTEAP ETPPCPLSPP VPCQPHDTSD DCLDDDDPVT DLDDIGNATV NSITREPPSR NVTLTLAYTA WAFAGRSIWQ QSVLATWVFL LQGKLQDVGY VTAIMGISQL VVSIPAGFLA DTYRRDTLLR TASAVGLLAI ATTLLACHQR TFHCLVIALA VWGCCWGIAN TALSALFADS IRDGERSYYF TQRSVLITLG NTTGPIVALV LFKLLGDHWT IQDCAAVMAV GQIVCFPAIV LLCFLSDDYI PTASDAVDPP DETEPARDAF LPAGTPANPV DTNQALTQPL LAVDTVHTPV PYVYGFLPPT RAVPILVALA DILSGLGSGM SIRYFPIFFV QNLGLGPVHV QLLYITAPLL QANLMRLAQT LATQFGRCRV SVAFKCVGVA FMFLMIASYH WHLPTFLVCT LYILRTSCMN ATSALTRSML MDNVPPHERG KWSALESVNM FSWSGSAFVG GMVIGWGGIL PLFIVTACGQ LLATLPLVAL FGYDVASPET NYNNNGVHWL GWMFSARSVS YIPDSDDDQS SYGSRQGVVV
|
| |