Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37848 |
Symbol | |
ID | 7202651 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 217165 |
End bp | 218374 |
Gene Length | 1210 bp |
Protein Length | 350 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181866 |
Protein GI | 219123094 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCCC TAGTTCATGC CAATACACGA CCACCGTTCT GGACACGACT GACAGTGAGT CTCCCAATCG AACAAATCAC ACTTTGGCCA AGTAACAGTA ACAGTCAAGT TCAAAATTCG CCACGAATCC GACACACTTG TGTCTCCGGT GTCGCTCTGG CTCTCCGTCT TCCTTCACAC TTTCGTTCTA ATGAACGCGC TCTACGCACA GTCTAGAAGA AAGATATGGG TCGGCGACGG ATCAACGGAC GCGTCGCTTC AGGCGAGCCC CGACGACAAC GGCATCATCG TCGCGCTCGC TTCCGGTAAA CCCACGGTTC CGTTCACGGC GTATCAAACT GTGGATGGTA GGGATACGCC ACAAGCTTTT TATTTTGTGG CGACCGTCAC GGCGTTGGAA GGTCATCTAT CGGTTGGCGT TGCGACGCCT GCACGCGTCC GCCACGGCTA CCAAACGCGC GGTTTGTTCT ACAACGGCAA CCTGACCAAC GGAGCAGCCG CCTTACGGAC CTCCGTCGGT CCTTACGTAC AAACCGGTAG CCGTGTGGGG ATCTTGTGGG AGTCTTTCGT TAAAGACGAT GAGTCTACCC GTATCCGTGT GGTCGTGTAC GTCAACGACG TTTGTGTCGG ATTGGCCTTT GACGTGGCCG CGGAACGCGA CGGCGTAACG TATTGTCCTT GCTTGCACGT GACGGGGAAG GCCACCGTTC GACTGGAGTT TCCGGAGACT GTACCTACCG TGCGCACTCG GCAATCTGTA CCACCTCCCG TGGCGTCCTA CGTAGGCGAG TGGCAACTCG CGCAAGCCTT GGTGGGACCG GAACTCGGCG AACTTACTCT GCCTACGGGA ATGGTGATCG TCGCTCAAAT CACCGTGCCT TCGCCAGTAG ACGCGGGTGA GCGAGACGCC ACTCGTCACG CATGCTACCG ACTGTCCATA AAGGTGGCTA ACACGCTGCA CGCTACTTTG CAACTTACCG GCGACACGTT GGAAGCCTTT GACGGAGTTC GTCTGGTCGG TTCCGTGGCA TCGACCCGGA TGATGGGACC GCCACCGATG CAAGCGTTGG AACAGACCTT GTGCCAGAAT CTTCCCACCG TCACCAAAAT GAAACTGACT ACCCAAGGTT GGCTCTTGTC GGGACCGACC ATGGAATTGC ACTGGGTTCC CCACGAAACG ACCTTACAAC CTGTCACTGC ATATTTGTAG
|
Protein sequence | MTSLVHANTR PPFWTRLTSR RKIWVGDGST DASLQASPDD NGIIVALASG KPTVPFTAYQ TVDGRDTPQA FYFVATVTAL EGHLSVGVAT PARVRHGYQT RGLFYNGNLT NGAAALRTSV GPYVQTGSRV GILWESFVKD DESTRIRVVV YVNDVCVGLA FDVAAERDGV TYCPCLHVTG KATVRLEFPE TVPTVRTRQS VPPPVASYVG EWQLAQALVG PELGELTLPT GMVIVAQITV PSPVDAGERD ATRHACYRLS IKVANTLHAT LQLTGDTLEA FDGVRLVGSV ASTRMMGPPP MQALEQTLCQ NLPTVTKMKL TTQGWLLSGP TMELHWVPHE TTLQPVTAYL
|
| |