Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45417 |
Symbol | |
ID | 7200535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 97302 |
End bp | 100232 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179579 |
Protein GI | 219117572 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAT TTTCTTCACC CCTCACGAAG CATCCGATCC TGATCGACCT TACACACGAC GACATCGAAT CGCCAGCTGC GGAGGAGCAA GTGCATTCGA CGAAATTGCG GGACAGAGTC TTTCCTGAAC CTAATACACC GGAGGCTTTT GCTTGTTCGA CTACGGATGC GGCGGTATCG CGATCGGCAG CATCCTCCAG GCAAATTGTC ACGCCACCAG TGGCAGGAAC TCGTACTGTA TCCAATTCAC AGGACCCAAA TGTTATTGAT CTCGTGAGGG AAGATTCGGA TTCGATTTCT TACTCTGAAG TGTTGCAAAT CGATAGGAAT GGGGGAAAAG TGAGCTGGTC ATCCGTCGCA GAAGAGCAGA CGGACAACGT CCGCAGCCAA GATCACCGCG ATCGTCTGTC GCGTACGTGT CCTCCACGCC GAGAATATCG ACGATCTAAA CAATGGACGA AAGCCAATTT CAATTTGGGT CAAACCTTGG GAACGCCCAG ACAACGTGAG GGCACGGACC TTGAACTCCA TGATTTCCTC GGCTTTCGTA ACTCACTTGG CCGTGTGTAC TACAAATCCT TTTCTTTGAA GAATGACCTG TGCCAGGGAA CCTTCCGTGT TGGTTGTTCG GTGTTGTTAG CAACAGAAGA ATACGAAGAG CCTACTTGTC ACCGGATCAT ATCCATCTTC CAAGCTACCA GGTCCTACGT GGGACAATGG TATGGTACTC AAGAGATACA AAAACGCAGC CACTGCTATT TTGAGTTCGA ATCAGCTTCA GCTGAGCAAA ACTCTGTTTA TGAAATACCG CAGTGGATTC TTCATCCTTA TCTGAAGGTG GTCAATTGCG TAGCCATAGA TGACTTAGAA AGAGGCAACG AGACCTCCGC AATGCGTCTA CTTCTAAGAT CGGCATCACG AAATTGGCAT CCTAAGCTGA AAGCCGAAGA GTGTACTTTC GTGTGTCAGG GTAGCCCAAA GTCCCGCCAA CTTTCGACGA CCGAACGATG GAACCATGAT TCAAGTGACG ATTCCGATGA ATTCGAGGAC GCCTCTTCCG AACTCGCAAA AGCCGCAGCC ACGAAAAAAA GAAAAAGTGA AGGTCGTGCC GATTCCTGTC AGCGAACCCC TGTCCAGAAA CATAAGATTG CGCGTGGGCG AAAAAAGGCT TCGTCCGTGA GATCGTCTCG CCCAAGACAA AGTTCGGGAG CAAGGAAAAG GCCAGCTTGC CCTATGCAAA AGCAGCGGTG TCATCCTTCG ACAATTCAAA CATCAGACAC CGAGAAGATG TTCATCGAGA ATAGCACTGG GCACGCTGTT CTAGGCAATT TATTGGTACA AAAGATCCGG GCTGGGGCTA CTATGAAACG AGATAGCTTG TTGTCTTATT TTCTGAATCA TCGTTGGCTG CTAGTCGAAA CTCCACTTCA TAATTTTCAA GGGCTTTTTG ACTGTCGCTT TTCAAAAAAC GGTGAGACTT TTGTTCTGAC TTCCGTGAAG GCTCAGGCTG ATACAAGCGT GGGCTCGCTC AACAAGATGG GTTCGCTAAA ATGCGCGCAC TTGATTTACA CCAAAATCTC GGGACCGGGT GTGGCTCCTA TATCTGTTCA AAATCTGCTC TTGCAATTTG GTGACTTTTC CATTCTACCA GCAAGAAAAG TCGTTGCCCG CCTAGAATTA CTTCAGAGTC CTTCTTGTAC GTTTACGGTT GGGCAGAAGA AGCACTATGG TATGTTTTGC CTCCAAGCAA GCGACTTTGT TGTGATGGCC GAAGAAGGAA ATGACGGATG CGGCTTTATT TCGGAAGAGT TGCTGGCATC GTTGTTTGGG AATAGTAAAG CAGCGAAGCA ACTCCTTGGT CCACAGGTTC GAGTGGTTGC TCCTCGACTA GGCATTTTTA AAGGCATGTT GATTCGCAAA CGGATACCAG TGGGTGAGCC TCCGATACAG CTGACACCTT CAATGCGTAA GGTTGGGCCT TCCCGATACT CCGAAAACGA CATTCGAGCA TTTTTGCTTG TTACAAATCA AGGAAAGCAT CCCAGTGTGA ATAATGACGC ACTTGGAAAG CTACTTAACC CTTTGCTTGA CAATCCTCCT CCCTCTTGGA AACAAAACGG TTTTGCCGAG AGAAGTCAGA TGCTTCCTCT CCTCTTGCGT ACTTTGGGCG TTCCAGCAAT TGTCATGGAA CGCTACCAAA GGGAATATTA TTCCCAGACA CGTTGCCGCA TACACCACAC ATTTATGCCA GGGTACGCTG ATCCAACGGG TGCGATCCCA CATGGACATG TGTTTGTGAC AGGAAGCAAA CCGTTTCAGG AGAACCTTCT TTTTGTGACC AGATCGCCTT GCATCTTCCC AAGCGATGGG CGGTTGCTGC CGAACTTGGT GACGAAGCCA AACGCTATGG TAATCGATGA TTGGAACTGG CTCAATTCTC TTCCTTTTGG GGCTCTTATA TTTGCCGACG CTACTCCCGG AATGAAACCC TTGCCAGCAC ATATTGCTAA CGGCGATCTT GACGGAGATC TATACTTTGT GTGTTGGGAT AGCGAAATCT TAAGGAATGT ACGAGCCGAT CCTATTGTGG AGGAACCTCT GACCTTAACG GATGGTGAAG TTGCCAGTAC TCCGCAGGCA AAGATGCCTC CGGAGAATCC CAATTGGTTT GAGGAGGCTC TAGAAATCAT GTGCGATCCT GCTGAATTGG CCGAAATCTC CGCATTTTAT GGAAAGCTTT TCAATCTCGC CTTGAAAGCT GCGTTGAACA ACCCGAACAA TTTGCTGTTG CGGGATCCTG ATGCCATGGA CTACGCTACG GCCTACAATC AAGCGTTGGA CTACCACAAA CATGGTCGTC TTGTTCAACT TCCCAGAAGG CTCCATTCTT CAATTCCCAC ACGCTTTCAC CAGTATCTTG CGAAAACTTA G
|
Protein sequence | MEEFSSPLTK HPILIDLTHD DIESPAAEEQ VHSTKLRDRV FPEPNTPEAF ACSTTDAAVS RSAASSRQIV TPPVAGTRTV SNSQDPNVID LVREDSDSIS YSEVLQIDRN GGKVSWSSVA EEQTDNVRSQ DHRDRLSRTC PPRREYRRSK QWTKANFNLG QTLGTPRQRE GTDLELHDFL GFRNSLGRVY YKSFSLKNDL CQGTFRVGCS VLLATEEYEE PTCHRIISIF QATRSYVGQW YGTQEIQKRS HCYFEFESAS AEQNSVYEIP QWILHPYLKV VNCVAIDDLE RGNETSAMRL LLRSASRNWH PKLKAEECTF VCQGSPKSRQ LSTTERWNHD SSDDSDEFED ASSELAKAAA TKKRKSEGRA DSCQRTPVQK HKIARGRKKA SSVRSSRPRQ SSGARKRPAC PMQKQRCHPS TIQTSDTEKM FIENSTGHAV LGNLLVQKIR AGATMKRDSL LSYFLNHRWL LVETPLHNFQ GLFDCRFSKN GETFVLTSVK AQADTSVGSL NKMGSLKCAH LIYTKISGPG VAPISVQNLL LQFGDFSILP ARKVVARLEL LQSPSCTFTV GQKKHYGMFC LQASDFVVMA EEGNDGCGFI SEELLASLFG NSKAAKQLLG PQVRVVAPRL GIFKGMLIRK RIPVGEPPIQ LTPSMRKVGP SRYSENDIRA FLLVTNQGKH PSVNNDALGK LLNPLLDNPP PSWKQNGFAE RSQMLPLLLR TLGVPAIVME RYQREYYSQT RCRIHHTFMP GYADPTGAIP HGHVFVTGSK PFQENLLFVT RSPCIFPSDG RLLPNLVTKP NAMVIDDWNW LNSLPFGALI FADATPGMKP LPAHIANGDL DGDLYFVCWD SEILRNVRAD PIVEEPLTLT DGEVASTPQA KMPPENPNWF EEALEIMCDP AELAEISAFY GKLFNLALKA ALNNPNNLLL RDPDAMDYAT AYNQALDYHK HGRLVQLPRR LHSSIPTRFH QYLAKT
|
| |