Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20893 |
Symbol | |
ID | 7201853 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 771598 |
End bp | 772830 |
Gene Length | 1233 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181070 |
Protein GI | 219120673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.775902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGATCGAT ATCGACGAAC AAACCTATAC TGCGACTTTT TCGCAATGCC GCACTATGCA ATCCAGAGAA GGTTTTAATG TCCCAACACG GTGACGGGCT TTTCAATACA AAAATTCCCA TATTGCATCG TTGTTTTGCT TCTTCTCCAA GCAAGCCCCA TGCCTCCATC TTCTCTACCG ACACCGTTCG TCTTTCTAAA GTGTTATCCC TGCATACAGA AAACCTGGTG ATCAGCCGTC GAGAAGCGGA AAGGATGATA CGATCAGGAG ATGTGACTCT GGCAGGAAAT GTGTTAACAT CGCCCATGAT GATGCTGAAA GAGGAGGACT TGAATGATGG AGCTTTGAAG GTGAACGGAA AGGTGGTAAA CCTTCGATCA AGTAAGGGTC CTCGTTCCGT TGCAGGAGAA GAAAATTCGG TTCACAAAAC CCGCGTTTGG ATCGCGCATA AACTTCCGGG AGAAATAGTC GCTGACAACG ATCCTTACGA TCGTCCCTCT TTATTGCAGC GATTGATACG AGGTGGTGTC GGCAAAGTTG GCAAGACTCG ACTTCATCTT AAATCCGTGG GACGCCTCGA CATGAACACA GAAGGACTGA TTTTGGTGAC TAATGATGGA AAGTATGCTA GAGAGATGGA GCTGCCGTCA AATAAATTGC ATCGAACGTA TCGCGTTCGA GTACATGGCT TGCTTACGGA CCACAAGTTG GCCCGAATAC GGAAAGGGGT AACTGTGGAA GGTATAAGGT ATCCTCCAAT GAGGATCATA CCCGAAAGCA CTCGGCAATC GCAATCAACA AATAAGTGGC TAAAAGTGAC TTGCACAGAG GGAAAGAATC GCCAAATTCG AAACGTTTTC AAGTATTTAG GATGTAAGTA CCGACCTTGA GCCGCCGCAC ACCTCAGACT CCTTTTACTT CACTTGTCTT CCTCACCGAA ATTTTTTTTA GTAACGGTCA CACGATTGAT TCGCATTTCC TATGGCGATT ATCGGCTGCA AACTATTCCG CCTGGTATGG CAATCGAAGT TCCAGCCAAG CATATCGAGA ATCAAAAACA TCGGGGTCGA TTGATTCGAC CGACTAAGCC CGCACCGAAA CGGAGTGACG AGGCCGAATC GTCGAAGGCG GTGCAGTGGG TCCGACATTA GATTTCGATC TTGAATAGTT GTTTTACATA GAAGTATAAG GGATAAGAGG TCACTTGTAA AATCTAGCAC AAGGCTTCGC GGG
|
Protein sequence | MSQHGDGLFN TKIPILHRCF ASSPSKPHAS IFSTDTVRLS KVLSLHTENL VISRREAERM IRSGDVTLAG NVLTSPMMML KEEDLNDGAL KVNGKVVNLR SSKGPRSVAG EENSVHKTRV WIAHKLPGEI VADNDPYDRP SLLQRLIRGG VGKVGKTRLH LKSVGRLDMN TEGLILVTND GKYAREMELP SNKLHRTYRV RVHGLLTDHK LARIRKGVTV EGIRYPPMRI IPESTRQSQS TNKWLKVTCT EGKNRQIRNV FKYLGLTVTR LIRISYGDYR LQTIPPGMAI EVPAKHIENQ KHRGRLIRPT KPAPKRSDEA ESSKAVQWVR H
|
| |