Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22166 |
Symbol | |
ID | 7203427 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 117171 |
End bp | 118431 |
Gene Length | 1261 bp |
Protein Length | 280 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182618 |
Protein GI | 219124663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00107582 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAAG GCTCGGTAAA CAGCGAAAAT GTGGGGGTCG CCTTCGGCTT GGTGATTGGC GCCGGCGCTG CCACAAGTCT CGGGGCGGGG GTCGTATTTG TACCCGCGCT TGTCAAACTG GCGTCGAGAC GGACGTTGGC GGCTGCGTTA GGACTGTCCG CCGGTGTCAT GGTCTATGTA TCTCTCGTTG AAATTTTCAA CGAGGCCAAT CGGCATTTCG AAGAAGCGGG TTTTCCCACT GATGAAGCCT ACCTATACGC GACAATCAGT TTTTTTAGTG GAGTCATTGT GATGGTGGTA CGTTGTATGG ATCTCACGAA AAATATCCAT CATGGGATGC GCGAAAAGAA GATCTCTTCA CAATTGCTAA CACTCTTGTT GATTTATTTA CGGTAGCCGC TCAACTTTTT AGTTTCTTGG TTGCTGGGGG GACATGACGA ACACGAATTC CCGCCATATC TAGAGGATAA GGAAACGCAG GAGTTCTCCA ACGCGGAAAG CGCCGTCCAG GCTGCTGCAG AGGAATGCGG TACTACTGTA ACAGCGTGTC CGTGTTGCTC TGAAGATCCC GCCAAAGAGT TGGAGTGTCT GCAGGAAATG GCGTTGGAAC TGGGAAAAAG GGAGCACGAA CCCGAAGCCG GGTGCAGTGT ATCGGACCAC GAGGATTTCC CGGCGCAGAA AGACGAATGT GTGGTGCAGG GCAAAGATCA GAAAAAGCTG CTCCGGATGA GTATCAATAC AGCGTTGGCG ATTGGCATTC ACAATTTTCC TGAAGGCCTC GCCACGTTCG TGGCGACGCT CGACAATCCG CGGGTCGGCG CCATTTTAGC CGTCGCCATT GCCATTCATA ATATTCCCGA AGGTTTGTGC ATTGCCATGC CGATTTACTA CGCAACGGGC AACCGCTGGA GGGCTTTTGG TTGGGCCATG GTATCGGGCA TGTCCGAACC ACTGGCGGCA CTTTTGGGTT GGGCCGTTCT GGCGAGTTGT TTCACGCAAA CAATGTTTGG TGCGCTGTTT GGTGTAGTAT CGGGCATGAT GGTAATTGTA TCCGTCCGTG AATTGCTGCC AACGGCACAT CGGTACGATC CAGACGATGT CGTGGTGACT TACTCGTTCA TGGCGGGAAT GTTGGTCATG GCGGTCTCGC TAGTGCTCTT TTTGGTGTAA AACTACTGCA AACCGGCACC CCCAAAGCTC TTTCTCCAAA CGAGCGTCGG CGAAATATGC ACAAGTTAAC CGTAAAGCGT GTCTTGCTGT C
|
Protein sequence | MSEGSVNSEN VGVAFGLVIG AGAATSLGAG VVFVPALVKL ASRRTLAAAL GLSAGVMVYV SLVEIFNEAN RHFEEAGFPT DEAYLYATIS FFSGVIVMVH EPEAGCSVSD HEDFPAQKDE CVVQGKDQKK LLRMSINTAL AIGIHNFPEG LATFVATLDN PRVGAILAVA IAIHNIPEGL CIAMPIYYAT GNRWRAFGWA MVSGMSEPLA ALLGWAVLAS CFTQTMFGAL FGVVSGMMVI VSVRELLPTA HRYDPDDVVV TYSFMAGMLV MAVSLVLFLV
|
| |