Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31781 |
Symbol | |
ID | 7196121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 909383 |
End bp | 910522 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176681 |
Protein GI | 219109856 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTG GCACTTCGCC GAAGGCTCCA ATCTACAATC AATACAAGCG CAAGGCCCCG TTTTCGTCCT GTTCGAAAGG ATCGGCTTCT GCGGACACCT TTCCAATACG GCAAGAACAA TCCGCTTCCC GAATTCCTCT CACAGAGATA CGGAACCAGT ATGCTTCTAC CTCCACTAAC CGTAGACAGA GACGATTGTC AATCCCCAAT GCCATTTCGG TCCAATCTCC CCTGCTCCCC TCTCCCACAA CTGCCTTGCA GCTGCTACAA CGGCATCAGC AACGAACGCA CACCCTCAGA TTAATACCGA ACGGTTTGGA ATCCTGGTTG TCCTTGGCCC GGCCCGGCAT TTACGAACTC GTGGGGGAGG CCGGAACCGG CAAATCACAA ACCGCCCTTA GCGTGTGCGT CCAAGCAGCT TCCTCGACGA CAGGCCCCTC CCGAGAACCA CCGTTGATTG CCGCACCCAC CACCGGCACT GACGTCTCTT TGCATCTGAT CCCGTGTCGT GCAATATACA TTTCTCTCAA GGCTCAAAAC AATGTCGTAC AAATCGTCAA ACGACTGGAA CAAATGGTCC TCTCCCGGCA GGAACAACGA CCGGCGTCCA CACCTGACCA AATGAAGGCT CCTACCAGAT CTCCCCCTTG CACTATTTTG CAGCGCATTC TGACACGCGC AGCGTGGAAC GCAGAACAAC TCACGCAGGT ATTGGACGAA TTGCCAGTCT TGCTAAAATC CGGCACCGTC CGTGTGCTCG TCTTGGATTC CATTGCCGAC ATGTTCCGCA CCAGTGAGGA CACGGACGGC ACTCGTTCGC AACAATCGTC ACACCACCAT GCGGCTCGCT CGGCCATTTT GTTTGGACTC GCGGCGCGTC TCAAAAAACT GTCGGACGTC TTTGATGTCC CCGTACTCGT CATCAATCAA GTGGCCCTGT CCGGAGTCTG GACCAAGCCC GCCCTGGGTT TATCTTGGGC GCACTGTATC GACGTACGGT ACATTCTAAC CCGACAGGAA CGCGGCGGAG ACGCGGGTGT CGTCTTTGGA CGCCGCGTCA CGTTGGACGC GTCGTCCAGC CATGCAACCG GACAGCACAA GGCCTTTTTT ATACGTGCTG ATGGAGTTGT TGCCGGGTAA
|
Protein sequence | MSTGTSPKAP IYNQYKRKAP FSSCSKGSAS ADTFPIRQEQ SASRIPLTEI RNQYASTSTN RRQRRLSIPN AISVQSPLLP SPTTALQLLQ RHQQRTHTLR LIPNGLESWL SLARPGIYEL VGEAGTGKSQ TALSVCVQAA SSTTGPSREP PLIAAPTTGT DVSLHLIPCR AIYISLKAQN NVVQIVKRLE QMVLSRQEQR PASTPDQMKA PTRSPPCTIL QRILTRAAWN AEQLTQVLDE LPVLLKSGTV RVLVLDSIAD MFRTSEDTDG TRSQQSSHHH AARSAILFGL AARLKKLSDV FDVPVLVINQ VALSGVWTKP ALGLSWAHCI DVRYILTRQE RGGDAGVVFG RRVTLDASSS HATGQHKAFF IRADGVVAG
|
| |