Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50296 |
Symbol | |
ID | 7199131 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 225564 |
End bp | 227586 |
Gene Length | 2023 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185152 |
Protein GI | 219129977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.667552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGCCAACC AAATCAAAGA AATTCAATCA AAAATATAAG AGGTGGACCA CAACCTCTAT ACAAACGCAA TGTTTCGCGT CGCCAGCCAA ACGCGGACAT GGTTTGGCCA CCACCGTGTC GGATACGTCC CGCTCCCGGG GTCGCGAACC TTTCTTTCCG CGACGCGTAG TACACGCGCG TCCACCAAGA TCCGCAACGG TATTCCTCTC CGCATCGCGT CGCCGCCACG GTGGGCTCCA CTTGTGGGAT ATCTCGTCAG CTGTTCCTCC GGCATCCTCT ACCCCGCAGT TTGGTGCGAT GACCACGCAA TTTCCGGACA CGGTAGCAAA CACCACGCCC CAATCGCACC GATCCATCGG CACGGGACGC CACGACGCCG CAGTTGGCGC CGCCGACTCG CACGGGTTTG GCGAATCCTG AAACGGGTAC TCCAACTCAG TGTGACACTC GCACCCGTCG TGTTGCTGTA TCCCATTGCC CTGTGGTTCC ACGCCGCGCA AGCGGCCAAC GACGACCCGG ACAATGCGAA CGGTGGGGGT ATGTTGGCCA AGGACGCCCG GCAGATTGTA CTCACCGATA CAAAGCCTGC GTCGGGATGG TTGGGATGGT ATCTACAAAT GTGTCTGACC TGCGTGGAAT GGAGTGGCGC CGCCGTCATT AAACTCATGC AGTGGGCCGG ATCCCGCCCG GACCTGTTCG GACACGAATT CTGTGTCGTC TTTTCGCAAC TGCAAGATCA CACCACGCCG CATCGATGGG CCCACACCGA AGCCAAACTG CAAGAAGCCT TCGGGAAGGA TTGGCAAAAC AAGATTCGGC TCGGCGACGT CATTGGTAGT GGTTGTATTG GACAAGTGTA TCACGGGCAA GTCCTGTCAA CAGACAACGA TGGTGGCGGC GCTTTGGGTC AAATGCGCGA CGTGGCAGTC AAGGTGTTGC ATCCTAATGT TCAGTCAGAT ATCGAAGCCG ATCTAGATTT GATGCGGTTG GCAGTCCGGG CCGTCAAGTA CGTCCCCTTC GATGTCTTTG CCAATCTCAA ATGGCTCAAT ATGGAAGGCG TCGTCGACGA GTTTGCTCAT CTGCTACAGT TGCAGCTGGA CTTGCGCCAA GAGGCAGCCA ATCTAGAACG TTTCAATGCA AACTTTAAGG ACGTGCCGCA GGTAGAGTTC CCCAAACTGG TGGAAGGCTA CGTACCCACT AAGAACGTTC TTGTCGAATC ATTTTGTGTC GGTGTGCCGG TTCTCAAGTT CGCCCGGGAG AATCAACACA ATCACAACCT CATGCGGGCA ATTTGCCAAA CCGGAATTCG AGCGGTGTGC AAAATGATTT TTCTCGATAA TTTTATGCAC GGTACGTGAA GCACAGTGAA TGGAACGCAG GGCATTTTTT GTTGCAATCT GGCGGCATAC TTTAACCCCA ACTACCTTCT CATGGTATAC ATTACTATCT ACAGGTGATT TGCATCCAGG AAACGTGTAC ATTTCCCAGG ACGGCAAAAA AATTATCCTG TTTGACGTGG GAATTGTCGC CGAATACTCG GAGGAGGACC ACCGAGCAAT TGTGGACGTG CTAGCTGCCT TTATTCGCAA AAAGGGCCGA GTCGCTGGAC GCCGAATGAT CGCAGATTCC AACAACCGTT TGCGGGGTAG CGGTGATTAT GCGCGCGAGG AAGAAAGGTA CATTGACAAA ATTGAAGAAT TGACCATCAA AGCGAGCGGG AAAGATTACT TCATGGAACA TCTCGGGACG TACATTTCGT ACATTTGTGA CGCAGCGGCC GCGCATCACG TTATGATGAA TCCTTCCTTC ATTTCGGCTG CTCTGGCCGT GAAAGTACAA GAGGGCATTG CATTGGCGCT GGACCCTTCC ATTAATCTAC CCAAGGTTGC TATTCCGGTA ATCATTGAGG CCGAGCGACG ACGAGACGGG CTGATCAAAA GCGCTTGCAA AGTTTTGGGT GTAGATGAAT GGATATCGTC CCACTTTGGT ACAAAACAGA AGACCAATTG TCAGTCACAA TAA
|
Protein sequence | MTTQFPDTVA NTTPQSHRSI GTGRHDAAVG AADSHGVTLA PVVLLYPIAL WFHAAQAAND DPDNANGGGM LAKDARQIVL TDTKPASGWL GWYLQMCLTC VEWSGAAVIK LMQWAGSRPD LFGHEFCVVF SQLQDHTTPH RWAHTEAKLQ EAFGKDWQNK IRLGDVIGSG CIGQVYHGQV LSTDNDGGGA LGQMRDVAVK VLHPNVQSDI EADLDLMRLA VRAVKYVPFD VFANLKWLNM EGVVDEFAHL LQLQLDLRQE AANLERFNAN FKDVPQVEFP KLVEGYVPTK NVLVESFCVG VPVLKFAREN QHNHNLMRAI CQTGIRAVCK MIFLDNFMHG DLHPGNVYIS QDGKKIILFD VGIVAEYSEE DHRAIVDVLA AFIRKKGRVA GRRMIADSNN RLRGSGDYAR EEERYIDKIE ELTIKASGKD YFMEHLGTYI SYICDAAAAH HVMMNPSFIS AALAVKVQEG IALALDPSIN LPKVAIPVII EAERRRDGLI KSACKVLGVD EWISSHFGTK QKTNCQSQ
|
| |