Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41359 |
Symbol | |
ID | 7199214 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 267845 |
End bp | 271091 |
Gene Length | 3247 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185300 |
Protein GI | 219130288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAAG ACGCTCCTTC CCACACAGTA TCCGCAGCTG AAGCTGCCGT AAATAAGACG CATTTCAGGA ACAACACGGC TAAGCACGCT TCGGTGGATG CGGATGCCGT CAGTCACGTT TCTAACTTGA CTGCGGATGA TGCTAGCACC GTTGTTTCGT TTTCCGATCG GGTCACGTCC AGTATTGGAT CCGTCTTGGG ACAAGCCAAG CGCATCTTGG AAGGCGAAGC ACCCATTCTA GAAGATGAGA CTGATGGTGA CGAAGTTGCC GAGAGTCCGT CCAACTTGGT AGTCACGTTG GAACAGGAAA TCGGCAAGTC GCGGAGTGCC CGTTCCCTTG GTGAGGATAA CAACAAGGGA GAAAGCGTGT CCACAGCACG CATGCGTGTC GCCAAGGACT TTTTACGGGA CGAACGCTCG GAGATGACAT TTTCCCGTCG AATCGCTCTG GCCTTGTCTC ACAAGTCCTG GTACAACCCG CGTGCGAAGG AGGAACCAGA GTTCGCCCCC AATGAGATAG AAACACCGGA AGCCTCCACA AGGATCGATT CGCAACATTC GATGCCGGTG CTGCCCGTTG GGGGGCCCAA CATTGAGGCC TATCCCTTTA CCCACAGCCG TAGAGAAAAC CCGAGTCTGG GAAAAGCCTG GGCTTGTACG TATGGTACAC CTTTCGGTCA TCCAATGACA ATGCCCGAGA CACCTTGACA TATGGCTACT CACACCTTGC AATTTTTGTC TTTCTGGCAG ATTTTGAGCA TGTTGCCATG CCTCGCTATG TGGTGGAAGC CAAACTGGAC CAGCGCCGAA AAAACATTCT GCATCGCATC GTTCGGAAAT TCCAAAAAGC CGACAAACAA CTTCAACGGG CGGAGCCAGG CGAAAAATAT TTGCCAACTA AACTATATGG ACCCATCTGC ACACCGCACA AACAGCTTGG TGACTGGGGT CTTGGCTTTG GTCTCTATTT TTCGACGCTC AGGGCAATCA CTGTACTGAC CTTCTGTGCT GGTTTGCTCA ATATACCCAA CTTAATTTAC TTTTCTTCCG AATCCTATAG CAGCGGCCAA GACGGCGTGA TTCCGCTGCT GCAGGGGTCC GCGATTTGTA CCGATACGCG GTGGGTGCCG TGTCCAAATT GCACGTCAGG TGATTTTGAA GCTACTCGAT TCGCTTACGG AACCAACGAT GCTGGTCTCA ACGTGACCTT TGTGTTGCGG AACACCTGCG AGGGTGCCAC AATAGAGCAA GGCTTTACCA ATTACGCGTC CCTGATGCTT ATCATGTTGG GCACAGTGTT TTTGAATCGC TATCTGAAAC GCATGGAGGT TGCCTTTGAT GAAGACGAAC AAACGGCACA AGATTATTCG ATTGTGATCG GAAATCCACC GGGTGACGCG ACCGATCCCG ACGAATGGCG AATTTTCTTC CACGATTGCT TCGATGGTGC CAAGGTGACA GCACTGACGG TGGCCGTGGA CAATGACTTG TTAGTCCGAT CGTTGGTGGA GCGCCGCGAA AAACTGCGAG AAATTGAGAT GATGGTTGAG CCGGGCACTT CGCTGGATAC GCTCACTTTG GCTGGTATCG CTGCCAAGCA GGAACGGGAA CGTAGCGTGT GGGGTCGTTG GAAATCAATG ATTATTCCAG GCATTCCGGA ACTGTTCAGC CGCACGGTCG TCCTGACAGC CAAAGTCCAA GGACTGGCGC AACAAGACTA TCCAGCCACA AATGTATTCG TGACCTTCGA AACCGAAGCT GATCAGCGCC GCGTGCTAAG TGCCTTGTCG GTTGGTAGCT GGGACGTTCA GCGCAATCGA CAGAGCGCCA TCGCTGACCC CAAACATTTG TTTCGCAGTG AGCTCGTCTT GTCGGTACAC GAGCCGGATG AACCCAACAC TGTTCGCTGG CAAGACTTGA ACGAAAAGTT CAAAGATCGA CTCAAGCAGC AATGCCTTAC CACTCTTTGT ACCTTGACAG CCATCATTCT AATTGCCTTC GTCATTTTTC TTGTCAACGA GCAGAGCATA ACGTTTTCGG CGTTTGCGAT TGCCATTTTC AATAGCATCT TTCCTCTTTT TGCCAAACTG CTGACTGGCA TGGAGGCTCA TTCGTCGGAA GGTGGAAAGC AGAGGTCACT TTACTTTAAA ATTGCGGTCT TTCGGTGGGT GAACACGGCG GTCGTGATTA CAATCATCAC TCCCTTCACG TCAACCTTGA CAGACGGTGG CTTGGTGAAT CAGATTTATG CTCTGTTCTT CGCCGAGATT GTTACAACAA ATGCAATTCA GTTGCTGGAT CCTGTTGGAC ATTTTCAACG CCACTTTTTA GCGCCGCGGG CAAAGACACA AGATGCTATG AATCTTTGTA TGCAGGGACA GCAGGTTGAG CTTGCTGAAA GGTAAGAATG CTTGTTTTCT GGTCGTTTAT CTATACTTTT CATCTCGACT AAATTTCCCC TTGATTTTTG CCTACAGATA CACAAATATG ACCAAGGTTT TATTCTTGGC ACTGTGGTAT TGTGCCATTT TCCCTGGAGC CTTTTTCTTG TGCTCCTTCG CTCTTCTTAT CAACTACTTC ACTGATCGGT TCAGTCTTAT GCGAACATGG AAGCGTGCTC CTCAGCTTGG AGGAAAAATT TCATCTTTCA GCAGACGCTA CTTTTTTTCA TTGTCTATCG TAGCAATGGC GCTCGTATCG TCCTATTACT GGTCAGCCTT CCCATTCGAC AATGTTTGCT CCACCGAATT ACCGGTAAAC ACGTCATTTG TTGGTGTTTG GAACATAACA GGGTTCGCCA AAAATGATGA AAAGGAACCC ACTTTCCAGC TATCTTTAGT GGAGGATGCG GATACTTCCT TCTTTTTCTG TGTACAAGAC TTTTTTCGCT ACGAGGCCGA GGAACAAGCG TTTCCCTTTA TACCAAAGTT TCAACGCAGC GGAGAGGAGT GGATGACCAG CGATCAAGAG ACTTTGACGG CTGTCTATGG TTGGACTGTC GTCGCCGTCG CTGCTCTTGT TCTACTCAAG TTCATCCATG GTTGGTTCAG CAGTATTATG AAAATGTTCC GGGGGACTTA TAAGCCTTGC GGCGATGACC AGACTATCAA TTTTAGCGAT GTCCCGTCTA TTTCGGCCTA CGTCCCACAG GTCGTAAGCA ACCTTTTTTC GTATCCCTTG CTGGCTTGCA ATTTTCAAGG AATTGACGAA GATCTTATGG ATTGGAGTGA CCCGGACCGA CCTATAG
|
Protein sequence | MTKDAPSHTV SAAEAAVNKT HFRNNTAKHA SVDADAVSHV SNLTADDAST VVSFSDRVTS SIGSVLGQAK RILEGEAPIL EDETDGDEVA ESPSNLVVTL EQEIGKSRSA RSLGEDNNKG ESVSTARMRV AKDFLRDERS EMTFSRRIAL ALSHKSWYNP RAKEEPEFAP NEIETPEAST RIDSQHSMPV LPVGGPNIEA YPFTHSRREN PSLGKAWAYF EHVAMPRYVV EAKLDQRRKN ILHRIVRKFQ KADKQLQRAE PGEKYLPTKL YGPICTPHKQ LGDWGLGFGL YFSTLRAITV LTFCAGLLNI PNLIYFSSES YSSGQDGVIP LLQGSAICTD TRWVPCPNCT SGDFEATRFA YGTNDAGLNV TFVLRNTCEG ATIEQGFTNY ASLMLIMLGT VFLNRYLKRM EVAFDEDEQT AQDYSIVIGN PPGDATDPDE WRIFFHDCFD GAKVTALTVA VDNDLLVRSL VERREKLREI EMMVEPGTSL DTLTLAGIAA KQERERSVWG RWKSMIIPGI PELFSRTVVL TAKVQGLAQQ DYPATNVFVT FETEADQRRV LSALSVGSWD VQRNRQSAIA DPKHLFRSEL VLSVHEPDEP NTVRWQDLNE KFKDRLKQQC LTTLCTLTAI ILIAFVIFLV NEQSITFSAF AIAIFNSIFP LFAKLLTGME AHSSEGGKQR SLYFKIAVFR WVNTAVVITI ITPFTSTLTD GGLVNQIYAL FFAEIVTTNA IQLLDPVGHF QRHFLAPRAK TQDAMNLCMQ GQQVELAERY TNMTKVLFLA LWYCAIFPGA FFLCSFALLI NYFTDRFSLM RTWKRAPQLG GKISSFSRRY FFSLSIVAMA LVSSYYWSAF PFDNVCSTEL PVNTSFVGVW NITGFAKNDE KEPTFQLSLV EDADTSFFFC VQDFFRYEAE EQAFPFIPKF QRSGEEWMTS DQETLTAVYG WTVVAVAALV LLKFIHGWFS SIMKMFRGTY KPCGDDQTIN FSDVPSISAY VPQVELTKIL WIGVTRTDL
|
| |