Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50371 |
Symbol | |
ID | 7199147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 111752 |
End bp | 113206 |
Gene Length | 1455 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185283 |
Protein GI | 219130252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00743157 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCTACTA GTAGGCGAGA GTACAAAATG GGGGTCTCTC CCCCCCCATG GAACAACCGT GAATACAAAC ACAGACCCAA CCGGAATAGA TACTATGCAC GCCGCGGCTA CCACGGAGAT GCCAACGTCG GAGTCCTTGG TTCCTTCTTC ACCACCATTG TTACCCGAGT CGCGGGGTCC CCACGTCTAC GCTGTTCTCT ACACCAAACA CAAGAGTCAA AAACAAAAAG TGTGGCACGA CGGGAAACTC GTCGTGCCAA TGCCGACCCA CGTCTCGTCG TCGTCGTCAA TCCATCGTGG AATCTTGTAC CCCGCCGTAC CAACGCCCGG TACCGGAGAC GGTGTTCTCG ACGAAATCGA ATGGAGCCAC CACCACTGTC ACTCGTCAGT AGTAATGGGG ACACGACTCG AAACGGAACA ATACTTGGTG GAAGTTCAAG GACCCTGGGT ACCGTCTCAC ACGAACCCCG ACAACAACAA TGACCACCAC GACAAGAACA ACGGGTACCC GTCACGAACG ACGGTTCCCT CGAAAGGCAT GCAAAAACTC TTGCGGCACA AATTTCGCAA GCCTCCCACG AAACTACCCG ACGTACCACC AGCCACCTCG TTTCTACACC GTCGCAAACG GCCGTTGCAA CCGGGCGAAC TGACAAAGAG TCAGCGTTAT TCGGGCCTGT CGCACGTCAC GGAACCCACC GGGACGCACT CGCAACCACC GTCGGCACCA CCCGGACCGC AACAACCACC ACCACCGTCG GGGAGGTTCC CTCCGTCATG GTCGACAATG ATGGATGCTA CCGTTCACCC ACCAGCAACA AAGTCTCTGC CAACACACCG TGTGCCGTCC GATTTCCATA CTCGTCCCAA CGGATTTGAT CCGGCTAGCT TTTACGGGGA AGATGAGGAA GACGACGACG ACAAAAATGA CAACCAAAAT GGAGTAGCGA CGGCCTGGAA TTCCCTTTAT CTAGCAGAGA CAATGCAAAC CTCGGCTTCG ACCTTATCCT CGCTCGGAAC TACTCCGGAA CCTTTTCCTA CAAACCCCGA CAAGGAATCA TTCAACACGA CTCCCGTTGC TGCTGCTGCT GCTGCTACTA CTACTTCCCA AAAAATAGTC CGGCCAGCAA CAGCTCGCGT GAAGGACACT GGTGACAACG GTTCGGGAAT AACCGACGCG ACAGCCAGTG CTGTTCCAAC AAAAAGTAAT TTCGGCAGCA CCGAGTCTAC GGTAGTTCCA TCGAAACCAA CGCGCGAAAC TTTGTCGAAC GCGGAACTCT TGAGCTTATT TGGAGCCGCC CCGGTAAGGA CCGCCGCCAC GTTGTTGCCG GGACGGCCGC AACCATCCGA CATTCCCCCA GACGACCATC ACTCGTTTAC ACTCCCACTC AATGCGGATA GTGAAGACTC GTCCGAGGAC GAATAAGTTA GCGTTACCGA GACGTTCGTC TTTCT
|
Protein sequence | MHAAATTEMP TSESLVPSSP PLLPESRGPH VYAVLYTKHK SQKQKVWHDG KLVVPMPTHV SSSSSIHRGI LYPAVPTPGT GDGVLDEIEW SHHHCHSSVV MGTRLETEQY LVEVQGPWVP SHTNPDNNND HHDKNNGYPS RTTVPSKGMQ KLLRHKFRKP PTKLPDVPPA TSFLHRRKRP LQPGELTKSQ RYSGLSHVTE PTGTHSQPPS APPGPQQPPP PSGRFPPSWS TMMDATVHPP ATKSLPTHRV PSDFHTRPNG FDPASFYGED EEDDDDKNDN QNGVATAWNS LYLAETMQTS ASTLSSLGTT PEPFPTNPDK ESFNTTPVAA AAAATTTSQK IVRPATARVK DTGDNGSGIT DATASAVPTK SNFGSTESTV VPSKPTRETL SNAELLSLFG AAPVRTAATL LPGRPQPSDI PPDDHHSFTL PLNADSEDSS EDE
|
| |