Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43852 |
Symbol | |
ID | 7204279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 225978 |
End bp | 227889 |
Gene Length | 1912 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186020 |
Protein GI | 219112873 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00767689 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATGAACCT CCCAAACAAC ACCCGACCTC GGGGAGCTTC TATTCGTTTC GGTGGGTATC TCTGGCACCC TACATCTCAT TCGTATGGTA TAAAACAAAA GCACCATGAA AGACTCGGGC AAACCGCCCC CGTATCGTCG TTCAGGACCA CACAATAATG AGGAGGAGGA GGAAGGGGGC AACCGAGAGG ACGATCGGAA ACCTCCGGCA CGGGAAATCA GTATACACAC CACACCGAGT ACGACAACAC ATCGCGACAA TGCCGAGTGT GACTCTTCCG TGGACGATTC CGCGCTCACA CCTGCTTACG CCGCCATACC GACGCGTATG CATACGTCGG CGTCGACACC GCAACCCCTG TTGCTGGCTT CCACGCCCGG GGAATCCCCC GACACGGACG ACCGCAAGCA GCGGCCATTG CACAGTTCCG ACTTTCACAA ACCCATCCGT AACAACAACA ACAGCGTGGA CACGGAGTCC TCTACACCGT TACGGAGTCG CACTTGGCTC AATCACGAAC TGTATTTCTG GTCCGGTGCC TCGTTGCTCT CCGTGCTGCA ATTGCTGTAC CTTTGCTTGC CGCTTACCGC CTTGTGGACT CTCTTGGTCT TGGTAATTTC CACCCTGCTC TTTGCGTGGA CGGCACTCCA ACGCTTGCGT CTGGAATACC GGGAACGCAT CACGCAACAC GGATTGGCTG CCTACCTTCC CGAGTCCCTC TCCCACACGC TCACGTCCCA AACCCTGCAC GAGTACCTCA CGGACGACTC GTTCGGGTTG GAATACCGAC ACTTGCTGCT CTACTTTATG CCCGGTCTCT CCAATGAACA GATTGAACAA TACGTTGATC AGTTGCCGCC GCGTCATCGG GAGGAATTGC GACGACACGG GATGGGATAC TTTTTCGGTG ACGGCTTTAT GCGACTCTTG ATGGGCGAGC ACGCCTACCA GACACGTCAG CAACAACAAC AACAAGGCGC AGCCGCACCC ACCAGCTTCT CCGATTTCCC AACAACTCTA TCGGTAGCCA CCACACCAAC GGAAGCAACG GATACCTCCC GTCGTCGACT ATCGTACCAA CGAGACGATT CCACTGCGTC TTCGAACAGT GACTTGGGTC TGCAAATTTC CACCGGCGAT TTGGCGGGGG GTCATATGAA CGACGCGCAA GCCTGGAGCA TGGCTCAGTG GTTGGGTGTG CGATCCCCGT CTTCGACGAT GACACCACCT ACGCGCAGCA ATACGACCAG TACGACCCGT GGGGAGGCGA CGACCGCAAC CGGTACAACA AGTTCCCCCG CTGCCATCCT AGACGAATCC AATCCCGAGG ATGACGACCG CCGTTTGCGG CGCGAGTATG CGGACGAAGA ACGCATCCTG ACGGATGCCT TTTGGGACGC CTACCGTTCC TTGTACGCGT CCGTATGGAC TCCAACGGTA CAGACTGTGA GAGAAAGGAT TACACAGCCT GTTACGAATA TGGTCGTACG CATTGGACTC GGTGCGTTGA CGCTTTCGAG TGGGATTGGC GTTGTCGGCT ACTGGCAAGG CGTGTACGCG CTCCCCTTTC GCCAAACCTT TCCCAGTAGC AGACACCATG GGTCGGCTCG TAATCACGAT AGTCTTGGAC TCGCCTTAGT GCAAACGCCG TGGGATCGCC TACAGTTTCC ATCGTCCGAG TCGTTGTGGA CAACCGCCGT GATGGGTGGC GCTTCGGCGG GAGTAGTCCT GTTTGCTCGT GCATATTGGT CGATCGGGCG GAACACCGAG TCGACAGCGC GAAAGGGCGC GGGTAGCGAA AACGAGCCGA AACAACAACG TGAAAATTAA AAAGACCACA TCGTTCACGT GTTTATCACG GAACTAGTTA GATTCAAGCT CAGTTTTGAT TTGAGTTTTC TAGATTTATT TG
|
Protein sequence | MKDSGKPPPY RRSGPHNNEE EEEGGNREDD RKPPAREISI HTTPSTTTHR DNAECDSSVD DSALTPAYAA IPTRMHTSAS TPQPLLLAST PGESPDTDDR KQRPLHSSDF HKPIRNNNNS VDTESSTPLR SRTWLNHELY FWSGASLLSV LQLLYLCLPL TALWTLLVLV ISTLLFAWTA LQRLRLEYRE RITQHGLAAY LPESLSHTLT SQTLHEYLTD DSFGLEYRHL LLYFMPGLSN EQIEQYVDQL PPRHREELRR HGMGYFFGDG FMRLLMGEHA YQTRQQQQQQ GAAAPTSFSD FPTTLSVATT PTEATDTSRR RLSYQRDDST ASSNSDLGLQ ISTGDLAGGH MNDAQAWSMA QWLGVRSPSS TMTPPTRSNT TSTTRGEATT ATGTTSSPAA ILDESNPEDD DRRLRREYAD EERILTDAFW DAYRSLYASV WTPTVQTVRE RITQPVTNMV VRIGLGALTL SSGIGVVGYW QGVYALPFRQ TFPSSRHHGS ARNHDSLGLA LVQTPWDRLQ FPSSESLWTT AVMGGASAGV VLFARAYWSI GRNTESTARK GAGSENEPKQ QREN
|
| |