Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50105 |
Symbol | |
ID | 7198910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 52518 |
End bp | 53722 |
Gene Length | 1205 bp |
Protein Length | 360 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185038 |
Protein GI | 219129737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATGT CTTCTCGCCA AAGTGCGTCC AGACCCCACT ATAAGCAGTC TAGGATATCA AATCGAGCAA CACCTTCCTC GATGCCGTAT ACGACTAGCT CTAGTCGAAC CGAGGCGACA CCTGTACCAT CGTCTAACGC TCGGAATAGC GTGTCCGAGG CTTCTCTACC CATAACCGGC AGCGTCCCCG ACCTGTCCAA CCTTTGCCTT CCCAGGAAGT CCTATCGTGA CGGGAGATTA GCCCTCGAAT CGTCTCCACT TTCGGCAACG CAGACTAAAG GTCCTCGCAC GGCGCGTTCC ATCTATCCGT CGTTCGATAC GTCGCGCGGG AGCTCCAGCT TAGGGCAAGC GGACGTGACC GGAAGCATTG AAACAAGCGA GCAGTACATA GCAGAGGATA CGGACTTGAC GGTAGACGAG ATCTTCGCTA TGGTGGAAGA GCAGCTACCG CTACACTCAC AATGTCGCCA GGTGATTACA CGCGACGACT CCCATGCACG CTCAGGGACG ACCGGCACTG TGCTAGATCC AAACGATGTT CGCAAACCTG CCCCTCCCGC ATGGGTAGAT ACCGCTACAC ACACTACGCG CACGAATACG CAAGAACGAA GCTTCAAAGC ATCGCCCCAA CCACACTGGG ACCAATCCGA TCAAGATCAA CAACCTCCTT CCCTAGCATA TCTCCAAGAC TCCTCTTCTT CCTTCTCGGC ATACGGTGAT GGCACGACAA GGGAGTGGGA GGAATCCAAG GGCGCTGGTA AACCGGTGGA GACAGTGACT ACCAGGGACG TGGAGGTGAT CCACGTCGAA GTCGAACCGG GCGTGTTTTT ACCTCTCCGG GGATCAGAGG AAACTTTACG GGCTATGGCG CAGGGTACCA CCAGGCTAGT CCAATGCTTA TCCTGCGAAG CCCCGCTAGC ATGTGTCCCC GACTGTCAGA TGGTTATTTG TCCCGACTGT CGCGTCGTTT CTCCCATCCT GGACGAGGAA CAAAACATGC GCAATGCTTT CCTTCTCGCC TCCAAGACTG GTTCACCCTT GAACGGTAGC AGTGTTGGAT TGGGGTGGAA AATGAGAACG TAATAAAACT AGGCATATTG TCTCTGGTTG CAATCGCGTA AACTCTGACG GGTCCAAGAA ACAGAGAGTC GACTCCGTAA CACATGCGAA GTCCACACTA GTTTTCAGTA AAATAGTATG TTTTC
|
Protein sequence | MMMSSRQSAS RPHYKQSRIS NRATPSSMPY TTSSSRTEAT PVPSSNARNS VSEASLPITG SVPDLSNLCL PRKSYRDGRL ALESSPLSAT QTKGPRTARS IYPSFDTSRG SSSLGQADVT GSIETSEQYI AEDTDLTVDE IFAMVEEQLP LHSQCRQVIT RDDSHARSGT TGTVLDPNDV RKPAPPAWVD TATHTTRTNT QERSFKASPQ PHWDQSDQDQ QPPSLAYLQD SSSSFSAYGD GTTREWEESK GAGKPVETVT TRDVEVIHVE VEPGVFLPLR GSEETLRAMA QGTTRLVQCL SCEAPLACVP DCQMVICPDC RVVSPILDEE QNMRNAFLLA SKTGSPLNGS SVGLGWKMRT
|
| |