Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21983 |
Symbol | |
ID | 7203088 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 300399 |
End bp | 302134 |
Gene Length | 1736 bp |
Protein Length | 484 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182364 |
Protein GI | 219124130 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.383606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGTTCCTA TCCAACAAGT AGTACAAAAT CTGATCCGCG TTCGAAGTGT AAAGATATTC CTTAGTTTCA TTATGCAAGG AGGTCAACAA CACAATGCTG CTCAAACGGA GGTGCTCAGT TTGACGGCCG CGAGTCGAGC CCAGCAAGAA GTCCACCAAG CCATGCTACT GGATCTGGAG GCTAAGAAAA TTGCGGCTTC CTTGGATGTG CCCACGCTGC CGGAACAAGT GCGCGCCGCT TTGCGAGAAA TGGGACAGCC CGTCCGTTTG TTCGGAGAAA ATTTGGCCGA TGTGCGCCAG CGCTTGCGCG AAGCCATGGC GCATCAAAAA GTCTCACTGG ATGCGGCGTC GCTCTTTAAG GAAGAAGATC TGACTGGGCA ACGCGGAGCA AATGAAAGAT ACGAAGAAGA GGTGACAAAA TACACCCGTG CGGAGCAGGA GTTGATTGAG GCTCGTCAAG CGATTGCCAA TTTTTCATTG AAACGGGCGG GGGCGCGACT GGAACGGGAA CGCCGACTCC GCTTGCAAGC GAACCGGCGC AAGCGTAAAA TTGACGAAAA ACCCGGCACC AGTACGGAAG TGGACGTGCT CGATGAATCC TGTCACAAAA TGTACCAATC CATTCAGATG ATGGCCCTGC AGGGTTCTCA GTATGGTGAT AGCCGTGTTG TGAGCTGCAT CAGCGCACAA ACTTTGGATG GGATTCCCGT CGTTGCAACT GGGGGTTGGA CTGGAAGCGT TCAGTTATGG GATGGAAGTT CCTCCGCGCT TGAGATCTTA GGGGGCAAGA CCATGTGCCA CGAAGACCGG ATTATGGGCT TGGATACAAT GAAAGTAAAC GAAGACCTGG CAATTATGGC AACAACGTCC ATCGATTTGA CTGCTAAGTT GTACCGTGTG CAGAGCGCTC ACACTGTCAT GTTGGACGAT GCAGGAGCTG TCGATAGCAC CGAGCGCTTT GCCGTAACTG AGCAAGCGGT CTTACACGGT CATCAATCCC GATTATGCCG AGCAGCTTTT CATCCGATGC AACGACATGT CGCAACCACC AGTTTTGATC ATACCTGGCG TTTATGGGAT ATTGAAACTA GCCAGAATAT TCTGCTCCAA GATGGTCATT GGAAGGAGTG TTACGGTGTT GGTTTCCATC CAGACGGCAG TCTATGTGCC ACGACTGATT TCGGCGGAAT CGTACAGGTC TGGGATTTGA GGACTGGCAA GTCTATTAAA CACTTTCTGG GGCATGCGAA GCGTGTGCTA AACGCTATTT TTCACCCGAA CGGCTTTCAA TTGGCCACGG CCGGTGACGA CGGTACGATC AAGATTTGGG ATCTGCGAAG GCGGAAACTG GCTGCATCCT TGCCAGCGCA TTCCAACGTG GTAACCAAAC TACAGTTTGA TGCGTCCGGT GAATATCTGG CGTCGTCTTC CTATGATGGG ACGGCACGGT TGTGGGGCTG CCGTGATTGG AAAATGTTGC GCCAGCTGCA GGCCCATGAA GGCAAGCTAT CGGGAATAGA AATTCTAGGC AGCAACTCCA TTCTAACCTG TGGATTTGAC AAGACGCTCA AACTGTGGCA GTAAAAGTGG ACTTGTGAAA ATTGAACTGA GATTGTTCTA GATCCATTCC TATGTTTATT TCAATGTAAA GCAATCTACT GCCAATGAAT CGGATTTAAT GTAATGTAAA GTAGAAAGCG GCCAGAGGGA GGGGCAATTC CTTTCAGTTA CATAAC
|
Protein sequence | MQGGQQHNAA QTEVLSLTAA SRAQQEVHQA MLLDLEAKKI AASLDVPTLP EQVRAALREM GQPVRLFGEN LADVRQRLRE AMAHQKVSLD AASLFKEEDL TGQRGANERY EEEVTKYTRA EQELIEARQA IANFSLKRAG ARLERERRLR LQANRRKRKI DEKPGTSTEV DVLDESCHKM YQSIQMMALQ GSQYGDSRVV SCISAQTLDG IPVVATGGWT GSVQLWDGSS SALEILGGKT MCHEDRIMGL DTMKVNEDLA IMATTSIDLT AKLYRVQSAH TQAVLHGHQS RLCRAAFHPM QRHVATTSFD HTWRLWDIET SQNILLQDGH WKECYGVGFH PDGSLCATTD FGGIVQVWDL RTGKSIKHFL GHAKRVLNAI FHPNGFQLAT AGDDGTIKIW DLRRRKLAAS LPAHSNVVTK LQFDASGEYL ASSSYDGTAR LWGCRDWKML RQLQAHEGKL SGIEILGSNS ILTCGFDKTL KLWQ
|
| |