Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42927 |
Symbol | |
ID | 7196184 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1538671 |
End bp | 1539925 |
Gene Length | 1255 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177316 |
Protein GI | 219111129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00363535 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGATGA ACACGAAACT AGCACCAAAT GTTGTCCACT GCATGCTTTT TGGCATATCA AGCTTTTCCC TGATAACGAC TACGAAAGGA TGGCAAGGAG GATTTTCCAT TCAAAGGCCA CGGAAATTCT TGATCGTCCC CAAGACGTTG CCGCAGAAGA CCAGTCCTCG GTTTCTCCAA ACCGATGTAG TGGTGCAGCA AAGCGCTCGA GACATCGAAG ATCCCATATC AAATATACAT AATGGAAAGC GCCTATTGCG TTCGGAAAAA ATTATGTCCA TGTTTACGAA GAATGGAAAA TCCGAAAGAG AAGCAGTGGT TCTGCAACAA CTACGACAAG AGAATGACCT GCTGCGGGTT GCTCTACAAC GAGCCGAAGC CGAGAACGAG CGGCTGCATA GACACTACGA TAATGGAAAT CGTATTATTT TGGAAAGCTT TGAAGGAGAA GGAAGATTTC GAAGAGCCGA TGACGGAGTA ATGTCGGACA TTCCTATGAC ACTCACAGGA GAAGAAATGC TCACGGAAGA AGCTTCACAG TGGTGTGATG AATTGGAAGA TGATGCCTGT CCGCTGGAAC CGACCATTTC GTTTGGAGAG GCACTACGAG ATCGAGCTTA CTGGTTGGTG GGGCTTTTGA TCATGCAATC ATGCAGCGGC ATTATTCTGG CACGGAATGA GGTTCTACTG GCCAATCACC CTGTCAGTGA GTAATTTATG TTTGCTGCTG ATTTGATCAA CAGACCCCTC ATTGTCGTGC GAGCTGTTCA TTTATTCTGT CCTAACAGTA GAGTGTTTTC ATTTTTTAGT TATATACTTC TTAACCATGC TGGTGGGTGC CGGCGGAAAC GCCGGCAACC AAGCCTCGGT CCGAGTGATA CGGGGGCTTG CTCTCGGTAC ACTGAACGAA AAGACACAGG GCCAGTTTTT GTCACGAGAA CTCAAAATGG CGTGCGCACT TAGTGCTATT CTCTCGGTAA CTGGGTTTGT CCGAGCCATT GCGTTTCGGA CGCCCTTCTC CGAAGCGATC GCCGTTACAA GCGCGTTAGC ATTGATTGTT TTCTCCAGTG TGTGTCTAGG GGCAATTCTT CCACTGGGAC TGAAAAGGTT AGGCGTCGAT CCCGCGCACA GCTCCACGAC TATTCAAGTT ATCATGGACA TTCTCGGCGT CGTCATTGCT GTAGCTGTTT CCAGCATTTT GCTCGACAGT CCGCTAGGGA TTCTTCTCAT TTCTAGACTT GGTGGGGGTT CCTGA
|
Protein sequence | MVMNTKLAPN VVHCMLFGIS SFSLITTTKG WQGGFSIQRP RKFLIVPKTL PQKTSPRFLQ TDVVVQQSAR DIEDPISNIH NGKRLLRSEK IMSMFTKNGK SEREAVVLQQ LRQENDLLRV ALQRAEAENE RLHRHYDNGN RIILESFEGE GRFRRADDGV MSDIPMTLTG EEMLTEEASQ WCDELEDDAC PLEPTISFGE ALRDRAYWLV GLLIMQSCSG IILARNEVLL ANHPVIIYFL TMLVGAGGNA GNQASVRVIR GLALGTLNEK TQGQFLSREL KMACALSAIL SVTGFVRAIA FRTPFSEAIA VTSALALIVF SSVCLGAILP LGLKRLGVDP AHSSTTIQVI MDILGVVIAV AVSSILLDSP LGILLISRLG GGS
|
| |