Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43406 |
Symbol | |
ID | 7197425 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 363062 |
End bp | 366848 |
Gene Length | 3787 bp |
Protein Length | 1209 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177605 |
Protein GI | 219111707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.738221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGTG GGCCACAATT CTTGGTGTCA GCAGGAAAGG TGGAGAGCCT TATCATCGAC AGTACACAGT ACACAAGAGG ATATAGATGA GGACTTGATC GAGCTCGGCC GAAGTGCTCT GGCTGGATAC TTTAACTTCC CTCTGGATGA CTGGCAGCTT AAGGCGGGTG GAGCGATTTG TCAGGGCTGC AACGTAATTG TATGCGCCCC GACTGGTGCA GGGAAAACCG TGGTAGGAGA AATGGCGCTC CTACATGCAT TCAACGGAGG TGACAAAGGA ATTTACACGA CGCCGCTCAA AGCGCTCAGC AACCAAAAGT ACTCCGAACT GTGTGGTACC TTTCCAAAGC AGGACACAGG TCTATCGACA GGAGATATAT CAATCAACAA AGGTGCTCGT ATCACAGTCA TGACGACCGA AGTATATCGC AACATTGCCT GGCGCTCGTC CACACCGACG GCAACACTCA TGGGTACAAA CGAACTTTTG GAGAATACCG TAGTTGTACT GGACGAGTTC CATTACATGG GACAACCTGG GCGTGGAGGA GTCTGGGAGG AATCCATTAT TACTAGTCCC TCACATACCC AAATCGTAGG ACTTTCGGCA ACATTGTCGA ACGCTGCTGC ATTGGCGGCG TGGATGGAAC ATGTGACGGG ACGTAGAACA ATCCTGGTCG AGGTTCCGGG ACAAGAACGA CCTGTGCCAC TGCGATATCT GTTTGCTACT AAGGAAGCGC TTTATCCCTT GTTTAGGGAT CCAGACGCAG GTCCCGGTGC GCCCAAGGGG CTACTGGGAT ATCGGGGAGA CGGCGATCTC CCGTCAAACC GTCAATCAAC CAAAAAAGCA AAAGGATTTA CTCATATTAT GGACGGTGAT GATGTTGATG ACGATGAAAC AAATATTAAG ATACCACGTG GTCTGCAGAT AAATCCCGCT CTTAAAGCTG CTGCAAACAA GCGAATGCAG AAAGTGAATC GTGCAATTGA AAGGCAAAAA GTACGCCGTC ATTTGTCCCC ACAAGAGGAC GAAGATGACT GGGGCGGCCG AAAGCGGAAT CAGTCGTCAC GGAAAATGTC CCCACGAGAC GAGCGGAAAG AGCGCGAACG ATTGTTGAAA AATGAAATGC GCCGGTCCGT GCCGTCTTTA CCTGCTATTT TGAATAGACT AAAGCAAAAG GAGCTCCTTC CGGCAATATT CTTCATCTTC TCTAGAGCTG GTTGTGACGA CGCTGCCAGA CAAGTATATC AGTACATGAA GGGTCCGCGA GACCCGAATT GTCTATTGCA AGACGAAAGG GAACAGCTTG GCATCAAGGA AGAGCAAAGA TACGGAACCA AAAACGGGAA GTCACGTCAA AGAGCAGTTC AGAGACGAGG TGACCTCGTC GAGGATTCCG ACGGGCGGAC ATTTCGATCT AAAAGCAATT TTATCAGTGA ACAGACCCTG AATGCGTTGG AAATGTCGAC GCCACTGGCG GACGACGAAT TTGACGAAAG CTCACCGCTA TCACCCGATA ATTGGGATTT TTTTTCGAAA GCTGGGCTTT TAAGCTACGC CGAGGTTCGA GACGTTGCAT CCAGAGTGTC GCGCTTCAAT GCAGGCAATC CCGAAATTGC GTTTGAGGAT GAAGTGATTG AGCAGTATCT TTTCGGCGCT GGCTCTCACC ATGCTGGGAT GCTCCCAGCT CACAAATCGT TTGTTGAGAT CTTATACCGG AACCAACTAA TGAAAGTCGT GTTTGCAACT GAGACTCTTG CAGCTGGTAT TAACATGTAA GTATTTAGTT TTTGGTCATG AATAATCGAG ATCAAGATCC TGATCCTTTC CTTAATCAGG CCAGCGCGAA CGACAGTCAT TTGTGCACTT GCTAAACGCG GTGATAATAG TGCGATGAAC CTCCTGGAAA CATCAAATTT GCTTCAAATG GCGGGTCGTG CGGGTCGTCG GGGAATGGAT ACAGATGGAA CTTGCGTAAT TGTTGCCACG CCTTTTGAAA ATCATGATGA GGCTTCCAGG ATCCTTATTG ATCCAGTTAA ACCAATTTCA AGTCAATTCA GCCCCTCGTA TTCTCTCGCG ATCAATTTGA TTGCTCGAGG AGAAGGGAAG CTTGATGTTG CGCGGCAATT GGTCAGTAAA TCTTTCGCAA TGTGGGAAAA GCGTAAAGTC GAGCAGCACG TTGTAGATGC TGTGGAAAAT CATGGGGATG ATGTAAGCCA GGCTCTCAAA GTGTCTGCGC AAGACCGATT CATGAAAACT TTGGTAGATG CTCTTCAATC ACAGGTGGAT CAGAGAAGTG CTAAGTTTGA TGTAAACAGA GTGGAATCTC TTCTGGGAAT CCTTACAGAT CGCGACTCAT TGAAAAAGAC TTCAAAATCT TTCGTCGGTG CTACCAAAAT GTTCGAGCTT GAGTATACAA CACTTTCGTA TTTACAGAAG GAATACGACG CGCTCCGAGC TAGGGCTGAG ATCGAAGACA GTGACTTCCT CAGTGAAATT ATTGCTGAAG ATACCAAGGA CTTAATTAAT CAGATTGAGA ATCAAAGAAA GCGCGTCGAG ACTACCGAAA AAGAAATTGG GAAGCATCCA TTCTCTATCA TTACGGGTAT TGCAAATCAA ATTATGGAAG ATGCTGCTTT TCCTGAATCA ATCGTATTGA ATAAGGCACT TGGCAGTGCA CGGGAGGGGC ACAAACGGAC CGAGTTGTCG TCATTGACAG CTCAGGAATT GTCTCAGTTT TCAAAGTCAG CCATCATAGT TACTCGTAAA ATGCGGAAAG TCAAAACGGC CAATCCAGAC GTTGAAGATC TTTTTCAACA AGCTGAAGAC CTAAGAAACG ATTCGTGGAA TGACATGCTT TCCATTACCA AGACGCTGGT CGCATACGGC TGCTTATCAA TTGAACATAC TTTAGAAGGG GGCGAGTCGT ACGAAGACCA GCAGTACACG ATATCTCCAG CCGGAATCAA CATTGGTATG CTCGGCTTCG AAAACTCATT GTGGGCGCTG GTTGCCATGG GTGGTGCTTG GGACGTTGTG GGTGCATCGG CCAAACTCGA CGATTTCCGT ACTGCCATGG AAGATTTTGA CAATGACGAA GACTGGTACC AAAATCGAAA GGGTAACGAT GTTCTTATTC CGCCAAAAGT TGTTTCAATT CCTATTTCGC AAAAGGAAGC CGATACGCTT AATGGCTTGC TCCGAGCCAT GGACCCCAGC GAATTGGCTG GCTACGTGGC CAGCATAGTG ACAGACAACT CTCGTGGGAA TGGTGCACCG GTTGTTCAGC TCTTCCAAAA TTTAACCCCC CTTCAACAGC GCGTCATTCA AAGTTCACTT GTCTCTATGG AGCGCCTTGT AGAAGTGCAG AAACTTTATG GTGTAGATGA AAAGACACGA AGCTGCATAC TCGACATTTC CAATTGTGAA GTTGTCACAG CTTGGGCCTC TGGTTGTTCC TGGCAAGAAG CTCTGGAGAT TTCGGGGTCG CCACCAGGAG ATTTGGCCCG TACTCTTTCA CGCGTATTGG ATGCAGTCCG ACAACTAGGG AACATGCCGT ACAGCCCCAT TCGGAAACAG GAACTGTTGG ATGGTCCTGT CGTCTGGGAC GTTTCGCGGG GATTGCATCC CGAAATTCGG CGGCTATGTC GAGACGCTGC TCGTTTGATC AACAGGTACC CAGTCAAGGA CCCCTTACAA TTTGAAGAAT TGGATGACGA AGATGTCGAG ATATTGGACG AGATAGAGCA ATTTTTCGAG GAAGATGAGA TTGAGTCAGA GGCTTAG
|
Protein sequence | MSRGPQFLVT QEDIDEDLIE LGRSALAGYF NFPLDDWQLK AGGAICQGCN VIVCAPTGAG KTVVGEMALL HAFNGGDKGI YTTPLKALSN QKYSELCGTF PKQDTGLSTG DISINKGARI TVMTTEVYRN IAWRSSTPTA TLMGTNELLE NTVVVLDEFH YMGQPGRGGV WEESIITSPS HTQIVGLSAT LSNAAALAAW MEHVTGRRTI LVEVPGQERP VPLRYLFATK EALYPLFRDP DAGPGAPKGL LGYRGDGDLP SNRQSTKKAK GFTHIMDGDD VDDDETNIKI PRGLQINPAL KAAANKRMQK VNRAIERQKV RRHLSPQEDE DDWGGRKRNQ SSRKMSPRDE RKERERLLKN EMRRSVPSLP AILNRLKQKE LLPAIFFIFS RAGCDDAARQ VYQYMKGPRD PNCLLQDERE QLGIKEEQRY GTKNGKSRQR AVQRRGDLVE DSDGRTFRSK SNFISEQTLN ALEMSTPLAD DEFDESSPLS PDNWDFFSKA GLLSYAEVRD VASRVSRFNA GNPEIAFEDE VIEQYLFGAG SHHAGMLPAH KSFVEILYRN QLMKVVFATE TLAAGINIAM NLLETSNLLQ MAGRAGRRGM DTDGTCVIVA TPFENHDEAS RILIDPVKPI SSQFSPSYSL AINLIARGEG KLDVARQLVS KSFAMWEKRK VEQHVVDAVE NHGDDVSQAL KVSAQDRFMK TLVDALQSQV DQRSAKFDVN RVESLLGILT DRDSLKKTSK SFVGATKMFE LEYTTLSYLQ KEYDALRARA EIEDSDFLSE IIAEDTKDLI NQIENQRKRV ETTEKEIGKH PFSIITGIAN QIMEDAAFPE SIVLNKALGS AREGHKRTEL SSLTAQELSQ FSKSAIIVTR KMRKVKTANP DVEDLFQQAE DLRNDSWNDM LSITKTLVAY GCLSIEHTLE GGESYEDQQY TISPAGINIG MLGFENSLWA LVAMGGAWDV VGASAKLDDF RTAMEDFDND EDWYQNRKGN DVLIPPKVVS IPISQKEADT LNGLLRAMDP SELAGYVASI VTDNSRGNGA PVVQLFQNLT PLQQRVIQSS LVSMERLVEV QKLYGVDEKT RSCILDISNC EVVTAWASGC SWQEALEISG SPPGDLARTL SRVLDAVRQL GNMPYSPIRK QELLDGPVVW DVSRGLHPEI RRLCRDAARL INRYPVKDPL QFEELDDEDV EILDEIEQFF EEDEIESEA
|
| |