Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46014 |
Symbol | |
ID | 7200870 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 967161 |
End bp | 969353 |
Gene Length | 2193 bp |
Protein Length | 588 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180153 |
Protein GI | 219118773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTGTGCAC TTTTGGAATT TTGTTGTTAC GTTGAGTTTG CTGGTAGACG TACGCTTTGT CCTCTTTGTA GTTTACCGAA ATGGGAGTGT GGTCAGAACC TATAAAAACC TGATCTAGAG TCCCCCAAGT TTTGGAGTAG ATATTTGCAG AACTTGGAAA CCGCACACCA AAGGTTGAGG ATCAGATTCT GTGCGTGGCA GTTTTCGTGG GATCGGTCGC ATCAAACCCT TATCATTTTC TGTCCTCGTT AAAGGCACGC TGACAGAGAC ACATGGCCGA CTCTGTTTCG CGATCTCGTG TAAAGGACTC GGACGATATC GAAAAGTCCG TTCGACGGTC ATCCCGTATT CGCCGACCCC AACGAATACC CATCCACCGA CTTTTTGAAG ACGATGATTT GTCCGTGGAA TCCGACTCTG CTTTGAGCCA GGCGGGAATA CACTCGTCAA GACAAAAGTA TAATCGAAAG CGAAAGGCTG GTGAAGGCTT GACGCACGAA AAATCAGAAG CGAACAGTAG TAAAAGAAGA CCCAAGGTGT CTGACGTCCA CGGCGCACGG GGGCCAGAAG TCAGAGTGCA CAAGAGGCTA ATTCCTGTAG AAGACGAACC CGCTTTAATT GAGACGATTA AAAGGGAGCA TGCTGATCTT TTAAGAAGCG ATGAAGCGCT GGAAGTGGAT AGGGCGAAGA AAATTAAGCT GCAGATGGTT CTGGTCAACA ACGAATGGGA ACAACGATTC CTTGAGTTTC TCGTTTTCCA ACGTGTCTAC GGCCATTCTC TCGTGCCTAA AAACTTTATG CCAAATAAGC AATTGGGACG ATGGTGCGCC AAAGTTCGTT GTTGGTATAG TAAGAATGAT TCGAGATTGA CCCCCTCAAG GCAACGGCGA CTTAATGCAG CGGGTTTTAT CTGGAAAGCC AAGAAAGATC CACAATTTTG GAAGATACAA AATCAATGCG AGCAAGCTAT GGATAAGTGG GAAGACTTTT TTGATCGACT TCTTGTTTAC AAAGCTCAAA AGGGCGACTG TTTGGTACCA AAAGAGTATC CTGAGGACAT GGTACGTTTA CGCTATAATG TCAACCACTT TGAAAAATCC AGAAAGGCGT TGCTGATAAG ACCTCTGAAT GCTGCAGACC CTGGCTCGAT GGGTTGCTAA AACTCGAAAG CACTACAAAG CCAAAAAAGA AGGTCGCTAT CACACTTTGG ACGACGATAA AGAAATGAGA CTGGTGGAAG CGGGTTTTGT TTTCAACTCG AAAACTCAGG AGCGTTTGCG ATTTACTGTG CTCAAGCGTT TTGAAGGACG TTGGGAGGAG TATTTCAGCA AGCTTGAGAA ATACAAAGAG CGCTTCGGCC ATTGTGTCGT TCCTCGGCGG TGGAAAGAGG ACCAGTCGTT GGCCTCTTGG GTTATGCGGC AGGTAGGTCC ATCATCGATC CCACTACGTC CTCATGAGTA GCCAGCCAGC CATCTCATCA GATTTTGTTG TTTCTCAAAG CGATGCCACT GGAAGCGCCT TCAACAAGGG CTTCATAGCT ACTTGACTGA AGAGCGATTA GCAAAGCTTG AGGAGATTGG ATTCGCATTT GTGGTTGCTA GGAAAGGAAT GCCGCTACAT ACTGGCCCTG ACCAGGAAGG CAGCGAAGAC GAAGACGAAA GCAATGAGAA TGACTCGGAT GATGACACGG AAAGCCTTCG TTCGGACGAA GTCCAACCAA AGATCTCCCG AAGACATAGA GAGAACGCTG GAATGGGCCT TGAAGGTGAA ACACAGAGAT CGAAAAGGTC AGCCCTCCGA GAATCTGTCT CTTGTGTTCC TGCGGCATCC GTGTCAGTAT ATGTCACAGC GGAAGTAGCT TCGGGAGAAG ATCTTACGGT TAGCGACCCA CATCAAAACA AGGAATCGAA GCCTGAATCG CTGGACCCCC CTTTTTCATC TCATTGTACT GCATCAAATC CCACCGTTCA CTATGATGGA GAGGTGAAAA GCACAAGGAG CTCTTCACAA GGTAAAAATT TATTGAGCGC TTTCCGGAGT TCTTTGGCAG TCAAGAAATC AGAGGAGCCT CCTGTCATCA AAGGCCAATG GCGCTGCGAA TCATGCGGAA AGGACGAATT TAGCGCGTTT GCTGAGTTTG CAGCTCACGA GCACTCTTGC TTGGCGCTAC CAGAATTTGC CACTGTTCCC TGA
|
Protein sequence | MADSVSRSRV KDSDDIEKSV RRSSRIRRPQ RIPIHRLFED DDLSVESDSA LSQAGIHSSR QKYNRKRKAG EGLTHEKSEA NSSKRRPKVS DVHGARGPEV RVHKRLIPVE DEPALIETIK REHADLLRSD EALEVDRAKK IKLQMVLVNN EWEQRFLEFL VFQRVYGHSL VPKNFMPNKQ LGRWCAKVRC WYSKNDSRLT PSRQRRLNAA GFIWKAKKDP QFWKIQNQCE QAMDKWEDFF DRLLVYKAQK GDCLVPKEYP EDMTLARWVA KTRKHYKAKK EGRYHTLDDD KEMRLVEAGF VFNSKTQERL RFTVLKRFEG RWEEYFSKLE KYKERFGHCV VPRRWKEDQS LASWVMRQRC HWKRLQQGLH SYLTEERLAK LEEIGFAFVV ARKGMPLHTG PDQEGSEDED ESNENDSDDD TESLRSDEVQ PKISRRHREN AGMGLEGETQ RSKRSALRES VSCVPAASVS VYVTAEVASG EDLTVSDPHQ NKESKPESLD PPFSSHCTAS NPTVHYDGEV KSTRSSSQGK NLLSAFRSSL AVKKSEEPPV IKGQWRCESC GKDEFSAFAE FAAHEHSCLA LPEFATVP
|
| |