Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47572 |
Symbol | |
ID | 7202797 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 147957 |
End bp | 149547 |
Gene Length | 1591 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181854 |
Protein GI | 219123069 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0059247 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAGTTCT CCAACACCTC GATACCCAAG TCTCGAAATC CCTCTCCACA AAACATACAA CCATGGTCTT CTCTACCCCT GCCAAAGTCT TTGCCACTGC GCTCGCCATG GGTGTTCTGG TTGCCACCCA GATTGATCTG ACGTCGCAAA AGGTGAGCTT AATTTGGCGT TGCTCGGCGT TGGTGAACGT CTTCCCCTTG TAGATCCGAC CCCGATTGAC CTGACTGTCT ACCAAGTACG TGGACATGGA TCTAACAACA TGTTTGCTAA TCTTCAGAAC GGTGCTGCCA ACGGAATTGC GTCCGACGCC GACGCTGCTT CAAAGCCTAC AGGAGCTTCT AGCCGCGGCT TGAAGGGTGC TCGCCGTTTG GCTACCGGAA ACGTCGAAGG ATACGGAAGG GTCAATGCCA CGTCCGGAAC TCTCGTCAAC GGCTTTGCCG GCCCCGGAGA GAGCTCTCAG TACGCTGCCG TCAAACAGAC GGGATCCTTC GACATCAGAT CATTTGCCAC GGGTCCCAAC GGATTTATTG AAGGATCCAG TAATGGGGCG GTTTACCATA CCGAGCTGAA AACGATTGGT GAGGCTGACT ACTACGGTCA GGTTGAGAAC ACCGGAGCCA CTGTCGAGTC CTACGGTACC GGCTACTTTG CCGGAAACTT CGACACGGAA GGGTCCGCAC CTGCTGGACC CAACGCCACG ACCGCTCCTG GAGCCACGAT GGCACCCACC AGTGCCCCTG TCGCCACGTC GTCCACCAAG GCACCCAAGG CTTCCATGGT ACCCAAGAGT ACTGCGGCAC CCAAGAGCGA GAAGGCTACC CTTTCACCGA AGAGTGCCAA GGGCGACACC ATGAACATCG AGGTGCCTCC CCCCGAGCCG CGCACTGCGA CGGCGGACGA GGGGGCCGTC TCGCCCGACG CTGTGACCGG CTCGGTGTAC TCGGTGGCTG GAGGCACTCT CAAGTCGACT TTCACGGCTC AAGGGAGCAC GTACGTGTAC GCCCCGGCGG AAATCATCGG TGGAGGTCTT GACATTGGTG CCTCGGAAGG ACAAGTTGGC AGCGCAAGTG GTCTTACTTC TGTTGGTGGC GACATCTTTG ACGACATTGC TACCATGATG CCAGACGATG CTACCGTGAG CCCCCTTGCG ACCGTCAGCG AAGGGCTCGC TACCGCGAGT CCTGGAAGCG GCGGAGCTTC TCTTACAAGC TCGTCCTCCG CTTATGGTTA TGCCAACGAA TTCTTTGCGG ATGCCGAGGC CGCTGGTGAT CTGACCCAGA CCGGATCGGC TACTGCTTTT GCCTATGGAC CGAGTGCTGG TGAATACTTG CCCGGTGGTA ATTCGAGCAC ATACGCGGAT TCGTATGCCA CGTTTGGTGT TTCGGGAGAG GCGGCGGCTC CTAGTCCCTA AGCTGTGAAT GTTTGCTGGT ACATCCTTTG ATGTGAAAAA GCGTACGTAG TATGTGATAC TGCTACAATA CACATCTCAT TTCGTCATCA GACCTTGAGC TGGAACAAGA AAGAAAGATG ATGGATCTGT TGTCAATGTA GTAGATATTT AATAGTAAAA AGGAAAATGA TTGATTATCT T
|
Protein sequence | MVFSTPAKVF ATALAMGVLV ATQIDLTSQK NGAANGIASD ADAASKPTGA SSRGLKGARR LATGNVEGYG RVNATSGTLV NGFAGPGESS QYAAVKQTGS FDIRSFATGP NGFIEGSSNG AVYHTELKTI GEADYYGQVE NTGATVESYG TGYFAGNFDT EGSAPAGPNA TTAPGATMAP TSAPVATSST KAPKASMVPK STAAPKSEKA TLSPKSAKGD TMNIEVPPPE PRTATADEGA VSPDAVTGSV YSVAGGTLKS TFTAQGSTYV YAPAEIIGGG LDIGASEGQV GSASGLTSVG GDIFDDIATM MPDDATVSPL ATVSEGLATA SPGSGGASLT SSSSAYGYAN EFFADAEAAG DLTQTGSATA FAYGPSAGEY LPGGNSSTYA DSYATFGVSG EAAAPSP
|
| |