Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38809 |
Symbol | |
ID | 7203637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 258591 |
End bp | 259820 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182803 |
Protein GI | 219125053 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGTCG GAGTGTTCGT AAACTTTTGG GTAGCGATAC GGGTAAGCGA TCGGCAGCCT TCAGTGAGCA CAAGGGCAGA GTCTACGCAA TCGGTAATGT CCATGACTTT CGCCGTGTCG GGATATGACG AATCTAGCTT GATACGTGAT AGATCGTTCG GCGTTTCCGT ATGGAACGAC TCCACCACGC TACCAACTTG GATGAAGGAG TACTTTGATT GGCATCGAAC TCAACGGCAG CTCCTCTTGA ACGAAACCAA CTACGGGCAG TTTACCTTTT TAGTCGTCCG ATGCTTGAAA CACGACTTGA AATGCGGAGG GACGGCAGAT CGGCTCAAGC CTCTACCATT CTATGTGTTG CTCGCTTCCC GTATGCACCG CATTTTACTG TTCCATTGGG AACGACCCTT TTTGTTGCAA GAGTTTCTGG TACCACCTGT CGGTGGCGTA GATTGGCGGT TGCCAACCTC TTGGCGTCGT GACCAATTCT CCGATACCAT CGAGATAAAG AACCTTAAGG ACATAACCCC GTATCTTGCA AAGCCTCACT CGGGTCGATA TCGTAAGCCG TTGCCCGCTA TCGCATGCAT TTTGTATCAG TCGCACGATC ACGGTGCTTT ACAATACAAC CAACTCGCCG TACAGCAAAC GAAGGAAGCA ACCTACGAGG AAGTTTTCCG GGATTGCTGG AATTCCTTTT TTGTCCCTTC TCCACCAGTA CAAAGTCGAA TCGACCAATT GCGCCAATCC CTCGGATTGG TCCCCAACGA GTACGTGGGA GCCCACGTGC GGTCACAGTA TCATTCTTAC AACGGCAACA AGAAGTTAAA AGTTTTGGTA CAAAACGCCG TTGCGTGTGC CTCTCGCCTT CGTACGGAGG TCCGGCAGAG CATTTACGTT ACCGCCGATT CCGAGCGCGC ACTCCAGGTG GTCGGAGAAT CCTCTGTGGG AATGCGCAAT CTACCGGTAG TCCGTCGGAA GGGCGATCGG CCGCCACTCC ATTTGGATCG TGGTGTTGCG TACTTGGCCA AGAGTGCCAC CAATTGGACA CACCACGACG ATCCACGAGC CTACTACGAT ATCTTTGTCG ACTTATACCT GTTGGCAGGG AGTCGTTGTA TTGCGTACAA CGTTGGCAAC TACGGCAAGT GGGCCATCCT CCTTTCGTCG AATCGGTCCT GCACGATAAA TCACGGCAAG ACAACGTGTC GTTGGAAAAT ATTATCGTAG
|
Protein sequence | MGVGVFVNFW VAIRVSDRQP SVSTRAESTQ SVMSMTFAVS GYDESSLIRD RSFGVSVWND STTLPTWMKE YFDWHRTQRQ LLLNETNYGQ FTFLVVRCLK HDLKCGGTAD RLKPLPFYVL LASRMHRILL FHWERPFLLQ EFLVPPVGGV DWRLPTSWRR DQFSDTIEIK NLKDITPYLA KPHSGRYRKP LPAIACILYQ SHDHGALQYN QLAVQQTKEA TYEEVFRDCW NSFFVPSPPV QSRIDQLRQS LGLVPNEYVG AHVRSQYHSY NGNKKLKVLV QNAVACASRL RTEVRQSIYV TADSERALQV VGESSVGMRN LPVVRRKGDR PPLHLDRGVA YLAKSATNWT HHDDPRAYYD IFVDLYLLAG SRCIAYNVGN YGKWAILLSS NRSCTINHGK TTCRWKILS
|
| |