Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16649 |
Symbol | |
ID | 7198830 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 96692 |
End bp | 98321 |
Gene Length | 1630 bp |
Protein Length | 461 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185047 |
Protein GI | 219129756 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.24052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACACGTCG GCATTCTCGC CGTGGAAGTC TACTTTCCGC GCGCCTACGT CGCCCAAGCC GCACTAGAGG AACACGTCGG AGTCCCCCAG GGCAAGTACA CCATCGGCTT GGGACAACAA GGCCTGGCCG TTACCGGAGA TGCCGAAGAC GTCAATTCCT TGTGCTTGAC GGTCGTACAC TCTTTGTTGG AAAAGTAAGT ACCCGGACTC GAATCCTTAT CCATATCCGT ATCGCATGAA TACATGCGGT TGGATGAAAG TCGTACGTCA TTGCTAGACC GTCAATCCCA CACTTTCCCT TCCTATCGTC TCTTTTTTGC TGCTCGCTAG ATACAATATC GACCCCGCCT TGGTGGGACG ACTCGAAGTG GGAACGGAAA CGCTCGTCGA CAAGTCCAAG TCCACCAAAA CACTCCTCAT GGATCTATTT CCCAACAATA CCGATATTGA AGGCGCCACC ATTATCAACG CATGCTACGG AGGCACCGCC GCCCTGCTCA ACGCCTTTTT GTGGGTCGAA TCGGACGGAT GGGACGGCCG CTACGCCATC GTCGTCGCCG CCGATATTGC TGCCTACGCA CGCGGCCCCG CCCGACCCAC TAGTGGGGCT GGCGCCGTTG CCATACTCGT GGGACGTGAT GCGCCCTTAT CCTTTGACCC ACGGACCCGA GCCACGCACG CCGCCAACGT TTGGGACTTT TACAAACCCG ACCATACCGT GGAATACCCA ACCGTGGATG GCGCACTCTC GCAGGTTTGC TACTATCAAG CTTTAGAAGA CGTCTACACT CGCTTTACCG AAAAAGTCAC CCTCCGAGAC GGCGCCACGC ACGGTGGAAG ACAATTCACG GCCGAATCAC CCGATTACCT CGTATTTCAC GCACCCTACA ACAAACTCGT GCAAAAATCC TACGCCCGCC TCTTTCTCAT GGACGCACGC GCGCAATACG CACGCCACTC GGCGGAAGAA AAGAAGGACG AATCGACCAG TGCCGATCAG GACGAGCCCG ATCCGCTCGC GCCGTGGTTG GACGTGCCGA TCGCTGATAC CTACAACGAT CGAGCATTGG ACGCAGTCTT GAAGAAGCGG GCCGCAACCA GTTTGCAAGC GCGTCTCGCC GACGCCAACG TGGCCAGTCA ACTCGTTGGT AATACATACA CCGCCAGCGT GTTTTTGGGT TTGGCCTCAC TCCTCGACCA GGCGGGAACA CGGGACGAAC TGACGCCTGG TAAAAACATC GTTCTCTTTT CCTACGGATC CGGAGCCCTC GCAACCATGT ACCGATTGAC CGTGTAAGTT TGGTGACGTG CCTGTGAATC CATCTGCGGA AAAGATCCCG GGCGTTGGCT CACGTTTAGA TTCTTGTTCT TGTTCCTTGT TAGTCGTACT CCCACGAAAC AGTCCCAGTT CACGGTAAAA TCCATGGCCA CCACCATGAA CCTGTCCGAG CGCCTGGCGT CACGTGAGAT GGTACACCCG GGTGAGTTGG ACTACGCACT CGAAACTCGC GCCCGCATGC ATCGAGCTGG TGCCCCGTAC AGTCCCGTCT ATCCCACCGT GGGACGTCTT TTCCCCGGAA CCTACTATTT AAACGGCATT GACGCGCTAT TCCGTCGCAC CTATTCCCGC
|
Protein sequence | KHVGILAVEV YFPRAYVAQA ALEEHVGVPQ GKYTIGLGQQ GLAVTGDAED VNSLCLTVVH SLLEKYNIDP ALVGRLEVGT ETLVDKSKST KTLLMDLFPN NTDIEGATII NACYGGTAAL LNAFLWVESD GWDGRYAIVV AADIAAYARG PARPTSGAGA VAILVGRDAP LSFDPRTRAT HAANVWDFYK PDHTVEYPTV DGALSQVCYY QALEDVYTRF TEKVTLRDGA THGGRQFTAE SPDYLVFHAP YNKLVQKSYA RLFLMDARAQ YARHSAEEKK DESTSADQDE PDPLAPWLDV PIADTYNDRA LDAVLKKRAA TSLQARLADA NVASQLVGNT YTASVFLGLA SLLDQAGTRD ELTPGKNIVL FSYGSGALAT MYRLTSQFTV KSMATTMNLS ERLASREMVH PGELDYALET RARMHRAGAP YSPVYPTVGR LFPGTYYLNG IDALFRRTYS R
|
| |