Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39772 |
Symbol | |
ID | 7195206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 670083 |
End bp | 671381 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183652 |
Protein GI | 219126831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.140327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGCC TCGACGGTCG CTACGGTAGA CGTATCTTTG GGAGTCTTTG GCCACATCAA CCCCAAGCGT ACGCTTCCGT TACTACGGAG ACGATCAACT CGAAGAAGAG GGAAGATTTT ACAGCCAGAA CCCGTTGGGC GGTACAAATC TTCTCACGAG ACCAGCGATT GGCCCATCTT GGAAGCTCGT CCTTGCCGCG ATTGTTCACC GTGCGAAACT TCACCACAAC GTCCTCCTCC ACGCCGAAGC CGCCCCCGCA AAACAGCGAC GACAATCAAA CGGACAACCC GACAAACGAG CCCGACACCA CGTGGGATCT TACCACCAAG AGTCTTGGCC AAGTCATTTT TTTGAATTCC TCACACTCTG GAACTATCTT GCTGGCGAGT CTGGCCTTGG GCGATCCATA CTTGGCTTTT TTGGCAGGAG TCGGCACCGT CACAGCCAAC GCCACGGCCC GGAATGTCTT GCAACTCGAC GCACAAGTCT ACGGTAACGG CTTGTGGGCC TACAACGGTG CCTTGGTCGG ATGTGCCACG GCCGTCTTTG TCGCGCCCCT CGCCGACAGC GCCAACACAG TGGCAATGTT GAGTCCCGTA CTGACGGGCC TCGCGATTAC CACCACCGGA GCGGCGGTGT CAACGGCTGT CACGGCTAGC CTGTCCCGCA CCATCACGTC CATGCCCCAG TGGACTTGGG CCTTTAACGT TGTGGCTTTA AGTATTTTGT TGCGCACGCA GCCCTTGCGT CCAACATTCG ACCCCAACAG CGTTGGCGCG ACGGAAATGG TCGCCCCCTC GACAACGTCT CTGCTCGAGC TCGCCACGGG GCCACTGAAG GGTGTGTCGC AAATTTTTGT CGTCGAATCG GCATGGACAG GAATCGGAAT ACTCGCTGCC ATTCACGCCT ACTCCCCACT CCTGGCGGGA CACGCTCTTC TCGGCAGTAC CGTGGGCATG CTCACGGGAA CAGTCTGTTG CAGTGACGCA GCAACGGCGG AACTCGCCGC CGGCTTGTTT GGTTTCAACG CCGCCCTAAC GAGTCTGGGA GTTGGAGTCT TTTTTCGTAA CAACACGGCG GCTTGGGTCT TGTCCGGTAC GGGAGCGGTC GCCACCACGG TTCTGTTTGG GGCCCTACAA AATGTGCTCG GTGCCTTGCA CAGCCCCTGT CTGACCTTGC CCTTTTGCAT CACCATGTCG GCGTGCTATC AATTGGGTGG GAGAGTGGGC GGGCCGGAAG GTGTCGTTCC GAGCCTTGAA CTAGCGACGA ATCCGCATTC GCCGGAACAA AACCAGTAA
|
Protein sequence | MRSLDGRYGR RIFGSLWPHQ PQAYASVTTE TINSKKREDF TARTRWAVQI FSRDQRLAHL GSSSLPRLFT VRNFTTTSSS TPKPPPQNSD DNQTDNPTNE PDTTWDLTTK SLGQVIFLNS SHSGTILLAS LALGDPYLAF LAGVGTVTAN ATARNVLQLD AQVYGNGLWA YNGALVGCAT AVFVAPLADS ANTVAMLSPV LTGLAITTTG AAVSTAVTAS LSRTITSMPQ WTWAFNVVAL SILLRTQPLR PTFDPNSVGA TEMVAPSTTS LLELATGPLK GVSQIFVVES AWTGIGILAA IHAYSPLLAG HALLGSTVGM LTGTVCCSDA ATAELAAGLF GFNAALTSLG VGVFFRNNTA AWVLSGTGAV ATTVLFGALQ NVLGALHSPC LTLPFCITMS ACYQLGGRVG GPEGVVPSLE LATNPHSPEQ NQ
|
| |