Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31525 |
Symbol | |
ID | 7196692 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 284490 |
End bp | 287886 |
Gene Length | 3397 bp |
Protein Length | 1104 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177057 |
Protein GI | 219110611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTGT CAAACTACCC CCGCCATCCG TCGCCGCTGG GTGGATTGAT GGTGCCTAAA TTCAACACAA TGGTTTTGTA TTGTACAATT TTAGTCATAT TTATTGCGCT GAATATTGTT TTGGCGGGGA AGGGGCCGGT TTCGCAACGA CGCGTCGCGT CCTCGGCGCC TCTTTTTCAA TTTGACTTGC AACCGACAGG ATTTCTAAAA GAGGCTCGTC AACGCCGCAG ACTACAGAGC AACTCGTCTT TCGTCATACC GGATGAATGG CTGCGACCGG AACGAAGTGC CGTTTATGCC GGTATTCCTG ATGCCCGTGA TCCACGTTAC CGACACAACC GTCAATTGCA AGAGTTTCAT GACCTCCGAC ACTTGAGTCG ATACGAGCAG GCTTATCGGA CCCAAAATAA TATTGATTTA CGAGAAAAAT GGGATGAAGA ATACTCTTTT GAAGACGAAC AAAAGGAGAT TCCCAAAAGC GCAGAGCTTC GCAACCAGAC TGCAGAAAAT CGATTGCGTT CACACTCCGA CAAAAGGAGA AGGACACAGG AGGCTGCTCC CGTTGCTGGT GGACAATACA ACAATTATCA GGCCGTACCC TTGGCACAAG GCTATGGAAC GCACTACGTT AATGTCTGGG TGGGATCTCC CTTTCCGCAA CGAAAGACGG TCATCGTCGA TACCGGCTCG CACTACACTG CTTTTCCCTG TAATGGATGT CAAAATTGCG GTTCGACGCA TCATACCGAT CCTTACTTTG AACCAAAAAA GAGTGCATCG TTTCATCAAC TCCAGTGTGA CGAATGCCGG GATGGTATCA CATGTCAAGA CGGGGAGTGT AGGTTCAGCC AGTCCTATAC GGAAGGCAGC TCATGGGACG CCGTACAAGT TTTAGATCGA TTCTATTGCA GTGGTTCCGA TATTATCGAC TCCGTTTCTT TAGAAGACCA ACGAAACTCA ATCGACTTTA TGTTTGGATG TCAAAAGAGT ATGACTGGCT TGTTCATTAC GCAGCTTGCG GATGGAATAA TGGGAATGTC AGCACACCAG GCGACCTTAC CGAAACAGTT GTACGACAGA CACATGATTG AACACAATAT TTTCTCCATG TGTTATCGTC GAGAGCTGGG CACAAGCAAG CGTGGTGTCA TGGCAGGTAG TATGACAATT GGAGGAATAT CCACCAACCT CGATACCAGC CCCATGGTGT ATGCCAAGAA CATGGCCAAA ATTGGATGGT ATACAGTCTA CGTCAAGAAC ATTTATATTA GACAAGGAGG CGGACAATCT GCCAAGAGTG TGGATCCTGA TCACCGTACG ATCAAAGTAA AGATGAATCC TGCTGTTCTT AATAGTGGTA AAGGTGTCAT TGTGGATTCG GGAACAACCG ATACTTACCT CAACAAAGAT GTAGCTCCGG AGTTTAATAT GGCTTGGCGT CAGGCAACTG GTCAGTCGTA CTCTCATCTA CCGATGAGAC TCTCGCCGGA ACAAATTCTG GAGCTGCCTA CTGTTCTCGT GCAATGTCAT GCCTACAGAG AAAATTTGGA TCCGTCAATC GAGGGTTATG AAGATATTCC TGGGTACGCT GGTCGTTTGG ACCCATCGTC CCCCAATGAT TTGCTCATTG CGATTCCAGC CACCAGCTAC ATGGACTTTT CTCCGATCAC GTCCATGTAC ACCAGCCGAA TCTATTTCAG TGAAACATCT GGCGGAGTCT TGGGAAGTAA CACTATGCAA GGGCACAACG TTGTCTTCGA CTGGGAAAAC GGCCGTGTTG GGTTTGCAGA GAGTTCTTGC ACGTACGACA AGAAATCCGT GCCGGAAGTT GCACAGGATA ACGGATACTC AAAGGACTGC ACAGTGCACG CTCCAATTCT ATCTACGCCA TGTATTGATA CCGTGCACCG GGAAATTTGC GAACACGCCT CCTCGAATAT TGCTCTCCTG GGCAACGAAA CTTGGACTGG TATTGTCGAG AGTGCTGGCA GCAAAGAGGG TGTTCAGTGT ACTGAAGTAG CAAGAGAATC TTCATCAAAG AGTGTGTTCC AGAACTCTGA CGTTGACTGC AATGGCAAGG GAACTTGCGA AGAGAAGAGA TCGTGTCAAC TTACATGCGC CGAAGCCATA GTAGCTGCGA ATGTGTCAAA AGCGCCTATT TCTGAAAGCA TCAGATATGA CTGCGGAGAT TCTTTGTGGA GCACATGCGA TCATGGGTGC GAGCAGACTC GGATTGTATC AGCCGCTCAT ACAGACGGAA TCTGCCACGA AGAGCACAGA TTCTCCCGAC CTTGCCATAT CGAAGCTTGC GCCCGTTCAG ACCCTTGTCT TGTCCCATTT CTCATTCATA CAGTTGTTGG CCTTCAGGGA ATATCGGTTT CAAAATGGAC AGGATCCTCG GAAAATACTT TTGTTTCAGC TCTGACAAGT GTGGCACGTG CACTTAATCC ATTAGAAACA TTTGGCGAAG GTGATGTGAA TGTGCTGCTA GCTATTCCAT GGCATGTGGA CGAAGACGAT CCAGACCAAG GTACTCATGT ATCCAAACCT ATTGGAACAA AAATTATTCT GGAGATTTCA ATTTTTAACA ACCTTTCCAA TGCTACATCC ACTGTAACCA GCGATACAGA CGATTCATCC GTCAAAGGAA TACTGTGGAA TATTACCGAG CGCATAAAGA CGCGGCTACC CGATACGATC TGCAACTCGG ATGATATGTA CACGCTTGCA AAGAAAACTC TCTCCATAAA AAAGCGTGTC TTAGAAAGTC AGCTTTTCAT TGGTTCGCTT ATTCACGAAA TGGAGAGAAT AGAATTATCA GACCCAATCT CAGCGGCAAT TTCGATGTTT TCGCCACTTT TCCATACAGT TTCCCTGGAG AGCGAGAATG AAAGCCGAGT TGTATCTTCT TGGACGATCC AGACAACGAT CGATGATCAG ATCAACTACT TCGTAAGTGA AGCAATAGAA AAAGTCTTTG CTGAGTGCAA TGACATCAAC CTCACTGCAG TCAACTTGAC ATTTTCCCAA ATAGGGACCG CCCAGGCCAA TTTGGCATAC AATGTTAAGC TTCATTCATG TCGTGTTGTT GACGATGATC TTTTTTTTAA TCCTGACATC AGCTTGGACA CTGCTCGTGT CTATGTATGA ATATCTACTG GAGCGCGGTT GGTTTCTACG AGTAAGGAAA GGCCGACATC GATATTCTCC AGCTAAGGCC TGTGACGACT CACAGACAGC TGATCAGGAG CTCGAACTGG CGGATGGCGG TAATCTGGAA ATGACGGTAC AAAACACTGA TTTTCAAAGC CGAGGCGGAA CTAAAGCCAT CAAGCGAAAG ACATCTCCAG CCGATCGTGA AGGCACGATT GTGAAGAATA TTTCGAGAAC GACCTAG
|
Protein sequence | MTLSNYPRHP SPLGGLMVPK FNTMVLYCTI LVIFIALNIV LAGKGPVSQR RVASSAPLFQ FDLQPTGFLK EARQRRRLQS NSSFVIPDEW LRPERSAVYA GIPDARDPRY RHNRQLQEFH DLRHLSRYEQ AYRTQNNIDL REKWDEEYSF EDEQKEIPKS AELRNQTAEN RLRSHSDKRR RTQEAAPVAG GQYNNYQAVP LAQGYGTHYV NVWVGSPFPQ RKTVIVDTGS HYTAFPCNGC QNCGSTHHTD PYFEPKKSAS FHQLQCDECR DGITCQDGEC RFSQSYTEGS SWDAVQVLDR FYCSGSDIID SVSLEDQRNS IDFMFGCQKS MTGLFITQLA DGIMGMSAHQ ATLPKQLYDR HMIEHNIFSM CYRRELGTSK RGVMAGSMTI GGISTNLDTS PMVYAKNMAK IGWYTVYVKN IYIRQGGGQS AKSVDPDHRT IKVKMNPAVL NSGKGVIVDS GTTDTYLNKD VAPEFNMAWR QATGQSYSHL PMRLSPEQIL ELPTVLVQCH AYRENLDPSI EGYEDIPGYA GRLDPSSPND LLIAIPATSY MDFSPITSMY TSRIYFSETS GGVLGSNTMQ GHNVVFDWEN GRVGFAESSC TYDKKSVPEV AQDNGYSKDC TVHAPILSTP CIDTVHREIC EHASSNIALL GNETWTGIVE SAGSKEGVQC TEVARESSSK SVFQNSDVDC NGKGTCEEKR SCQLTCAEAI VAANVSKAPI SESIRYDCGD SLWSTCDHGC EQTRIVSAAH TDGICHEEHR FSRPCHIEAC ARSDPCLVPF LIHTVVGLQG ISVSKWTGSS ENTFVSALTS VARALNPLET FGEGDVNVLL AIPWHVDEDD PDQGTHVSKP IGTKIILEIS IFNNLSNATS TVTSDTDDSS VKGILWNITE RIKTRLPDTI CNSDDMYTLA KKTLSIKKRV LESQLFIGSL IHEMERIELS DPISAAISMF SPLFHTVSLE SENESRVVSS WTIQTTIDDQ INYFGPPRPI WHTMLSFIHV VLLTMIFFLI LTSAWTLLVS MYEYLLERGW FLRVRKGRHR YSPAKACDDS QTADQELELA DGGNLEMTVQ NTDFQSRGGT KAIKRKTSPA DREGTIVKNI SRTT
|
| |