Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49765 |
Symbol | |
ID | 7198348 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 207314 |
End bp | 211566 |
Gene Length | 4253 bp |
Protein Length | 1283 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184585 |
Protein GI | 219128784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT CTCGATCCTT TCCAAACGGA AAGAGGCGAC ACTCCATGTA TGGACGACAC CAGAGGCCGA AGTCAGCGTA TGGACAACGC AGTACGAATA CAGGCGACAG CGCGCCCCCA CAAGAGGGTC CGTGCTCTCC CACCAGCAAT GCAAACGATC ACAGTAACGG AAAAGTACCC TCCATGAGGG ATGATCGCGG TATTAGTTTC TTACAAGACA TGGCTGGTCC TTCAAAAACT GGGCATCCGG TTGTCACCAT GCCAAACTGT ACGGTAGTGG AGTCTGCGCA GCTGCCTGTT GTCAACAAAT TGCATTCCAG AAAACGTTCT CGTAGAGAAG GTAACGTGGA GCCACTACAG CAAGCGCGAT CACAGCAGCA AGCTACGGGA AGTCGCCGTC TTGTGTCCAA CTTTCCGCAA CCCGGCATGG CGGCCGTCGG CAATTCTAGA GATGGAATCG AATGGGCTGA TAGACAATCG GGTCTCGTCT CGGAGGAGAA GGTGCGAAGG ACCGCCCAGT CAAATCAACC CATCCCTCGT AAGAAGAAAA CCCACAGTGA CTACACAAAG AAATCGATTC AAGAGCAACA TCGACCTCAT AGGGGCTCGC GTTTGGATGT GAGGTTCATC GTCCCCAAAC TGAAACCCCG CCACAACTGC CCCAAAGCTC GCGTCGAAAG CATATGCGAT GGTGACTCAG ACAATACTGC TGATGAAGGA ATTTCCGCTG TACAAACCTG CGAGACTCTC CCGAAAAGTA TCGACGTTCA TACTAGCGCC AAAAGACAAC CCTCCCGAAC TAAAACGCAG CTCACTAGCA GTTGTCCATC CGCCGACTCA GTCACATCCA GCGGTTTAGA AGCTAATGCT GTGGAGTTGC CTCTCCTCAG TTGTAAGGGA ACGCCTGTTC CCGGGAAGAG GTCGTCAAGC AAGCTTTGGA CTTCCTTGGA AAGACCGTCA CAGTCATCCA AGACAGACGA CGCCGGTACC AAGGAAAGCC CAGTCGACCT TGCGACGGCC GCGATCAAAC AGCAACATAG TGAAGCGGAG AAGAATGAAA ACGAACACAA GCCATCGAAA AAACTTTTAT GGAGGGAAAT GTCCGACGAC GATGACGACT ACACAACAAG CCGCAATCGA CAAGCCCGTC ATTCTTTAGC TGTCGTAAAC ACCATTCTCC AAAGCGGCCT CGAAAGTGAT GTTGTACCTA CGAAACCTAC CAATCATTCT GCCGTGAATT TCGATGCTGG TAGTGACCAA GATCAGCCAT CATCTTCTTT TCTTTCCAGA ATAGTTGACA GAGGACGCAA AGGTATAGCG CGCACAAAAC TTGGCTTTTT GAATACTCTG CAGTCCCAGA AGAAAGAGAA GAAACTTAGC TTATCGAAGC AATTGAATCC AGATGCACAA AGTACGTTTG ATCAACAATT TTTAGCTCCA ATGACTGCTT ACCAGCTCAC TTTCGATTGT CGTACCTCCA ACTATTACAT CTAGGCAAGA CTTTGCAGCA CAGAGGCGTA TCTTCTGACT GCAAAAGAAA ACCAATCAAA GGAAAAAAGA TTACGATAGG CAATTCTTTT TCTCATAAAG GGCAAGAAAG CAGCTTACTC GACCAGACTC CGGAGCGAGA CCGGAAAGGT GCGTTTCAAG ACCAGTTCCT GGTCGAACCA GCAGACCAGG ATGGACAAAT CACCATCGTC TAACACTGCT GAATTGACTT GACAGATCCA TTCTTCGAGT GCAAAGGAGT CGGCCTCACG AGCTTTTCCG AGCATACTAC ACCCTTTAAA ACAAAACTTA ATTTTTGTAA AAGCCAGCCT CGCAGCAGCG AAAGACTTCT GCTAAAGAAG AAAACGAAAG AACCAGAGGT TGTGGAAATT TTGGATGACT CTGAAAGTGA AATGACAAAG GTATGGGTAC TTTGATGACT TGATCGGAAT CGTCAGTGTG TTTTTCCATA CTTATCTTTA TTTTAACTCG CCCTCATCCA CCACTCTATA GTCTCCGCCT ATGGCCACTC GCAGCGTCAA GCGTACGCTT AATTGTGCTG TCATTCGCAT TGCGATCGGA ACAAAAGTTT TCTGTTCCGG TTGTCAGATT CGCCTCAGAG CTACTTGCAA GCTAATGCTC GAATTCGAGC CACACAAGAA AACCACGCGC AAAGGGACCG AATCCCTAAA CAAAGCTACT CTGGAATTGG ATCTGCAGCA TGATTTGACG GAATGCAAAT ACTACCTCAA TGAAGAATTT TCAGCGGAAA CCAACGAGGG CGAGGAAGAC GCCGTTGGTA GTTTTTTGGC AATCAAAGTT GTGCCAAATA ACCAAAATGG CCTAAAAATT TACTCCAACA GCTATCAACC AGACAAAGAA AACACCAAAA GGATGTTTAT CCTCATAGAA TTTCGGGAGA ACAAGGACCT CCGGGACTTA GTGGAAACCA TCCAAACAAG CCGACCTCCT GCTATGGATC TTTTCTTCAA TGATCACTCC AAGCTCGACT CTGCTTCTGC CGAAATATAT TGCCAGCCGC TAATTGACGA TTGCAAGCGA GAGGCAAGGC ATCGTCAAAA AAGTATGCAT TCGTTGTCTG TTCATAGGAA GAATAGTTTT CTTGCTGGGC GGAAGGAGGA TGACCTTTTG CTCATCTATC CCTTTGATGC CGACGAAAAG ATCTTCGACC AGGCCGCTGT GCATCTGACA GAGGCAAAGT ATGATAGCCG TTGTGATTCC AATCTGCAAG TCTGCGCACT TTCTGAGTCA TCAATACCCT CACCTGAGCA AAAGTCAGCT GATTGCGACG GTGGAAATGC CGACATGAAC AACATGAACA ACCGATCTCA TTTGGCAATC ATTCGCGTAA CCGACTACGA AAGGCTAGTG ATTGAGGACG AATACCTTAA CGATACCCTA ATTGATTTTT GGATGCTATG GTAAGCTTAC TTAAGCGAAC CGACGGCCCT GCTTGATTTT TCCTTACACC GTTGTCTTCG TTTACAACAG GATTTCGCGG TTTGACGACT TGTCAAAGTT TCATGTCTTT TCTTCACACT TTTATACGAG TCTTTTCGAA GATGGATCAA TTGCCGTCAC AAAATGGACT GAGCGGAAAG GAATTGACGT TTTCGACAAG AAATTTATTT TTGTTCCAAT CAACAAAAGT CTGCATTGGT CGCTATGCGT TGTAGTCAAT CCGGGACAGA TTCTCCAGCA CCCTGATCTT CGTGGGAAAG ACGAGCATCT GGATGAGTCG AGTCCAATGC CTTGTATCCT CTTTCTCGAC TCTCTCAAAG CTCACCAAAA GACACAGGTC GCCCATCGTA TTCGTCAATG GCTCAACTCG GAATGGCAAC GGCTGCACAA GTCATCGTCG ATCCCCAACC CGTTTCAATC TAAGACGATG CCCGTTATTG ATCCTAAAAG TACGTCAGCA TCTGTTTCTA AACTGGATTC GACCATGTTC TCATACATTC TTGTTGCATG TCGACGTAGT TCCTTATCAA AACAATTCCT GGGATTGCGG AGTGTTTGTT TGCCGGTACG CATTTGCTCT TTACAAACTC CGTAATATTT CATTTTCTTT CCAGGAGTAC CGACAGGACT GCTTTCGCTC GCTGATCACG AACAGCGAAC CGTTTCAGTT TGACATGAAG GACATTACTC GCCTTCGCCA GGAAATGAAA ACTTTAGTCG AAAATTTGTG TACTGCCTAC GTCCCGTGGA AATCCGAGCA AGACCGGAAA GCAAAAGAAG ACAGGCGGAA AGCCAAAGCA GTAGCAGAAG AGACAGTTCA AGTGGACAAC AATGTTCCTG AAGACCCGGA TAGCTCTGAA GGAAACGAGG TGGTTGTAAA CAATGAGGAC AAGGTTGATA CAGCCAATGC AAGGTGGAGC GCTGTCGTTA AGGGCATTGA AGCAAATGAA AACAACATAA TCCAAGATTC ATTATTGATT GTCGATGACC TTAATGATGA CGACAATAGT AGGATGCATG AAACGTCACA GTCCGGAAGT AAGGAAATGG TCTTTGGTCC AGATTGGACG GAAACATATG ACCCTATGGA GGTGGATGAA GATGAGGCAC ATTCCTACGC GACTCAGGAA GGCCTTACGG TGGAAAGAGT CAGTCCAGCT GAAGGCAGCA TAGTGAAGGA TGTCGAAATG AAGGATGATT TGCATACATA CAAAGCTCAA GACCTTGTAG ATCCGTTGGA TACAGAGGCA GTGGATGACG ACGTCGACAT CGACTTTGTC TAA
|
Protein sequence | MSESRSFPNG KRRHSMYGRH QRPKSAYGQR STNTGDSAPP QEGPCSPTSN ANDHSNGKVP SMRDDRGISF LQDMAGPSKT GHPVVTMPNC TVVESAQLPV VNKLHSRKRS RREGNVEPLQ QARSQQQATG SRRLVSNFPQ PGMAAVGNSR DGIEWADRQS GLVSEEKVRR TAQSNQPIPR KKKTHSDYTK KSIQEQHRPH RGSRLDVRFI VPKLKPRHNC PKARVESICD GDSDNTADEG ISAVQTCETL PKSIDVHTSA KRQPSRTKTQ LTSSCPSADS VTSSGLEANA VELPLLSCKG TPVPGKRSSS KLWTSLERPS QSSKTDDAGT KESPVDLATA AIKQQHSEAE KNENEHKPSK KLLWREMSDD DDDYTTSRNR QARHSLAVVN TILQSGLESD VVPTKPTNHS AVNFDAGSDQ DQPSSSFLSR IVDRGRKGIA RTKLGFLNTL QSQKKEKKLS LSKQLNPDAQ SKTLQHRGVS SDCKRKPIKG KKITIGNSFS HKGQESSLLD QTPERDRKDP FFECKGVGLT SFSEHTTPFK TKLNFCKSQP RSSERLLLKK KTKEPEVVEI LDDSESEMTK SPPMATRSVK RTLNCAVIRI AIGTKVFCSG CQIRLRATCK LMLEFEPHKK TTRKGTESLN KATLELDLQH DLTECKYYLN EEFSAETNEG EEDAVGSFLA IKVVPNNQNG LKIYSNSYQP DKENTKRMFI LIEFRENKDL RDLVETIQTS RPPAMDLFFN DHSKLDSASA EIYCQPLIDD CKREARHRQK SMHSLSVHRK NSFLAGRKED DLLLIYPFDA DEKIFDQAAV HLTEAKYDSR CDSNLQVCAL SESSIPSPEQ KSADCDGGNA DMNNMNNRSH LAIIRVTDYE RLVIEDEYLN DTLIDFWMLW ISRFDDLSKF HVFSSHFYTS LFEDGSIAVT KWTERKGIDV FDKKFIFVPI NKSLHWSLCV VVNPGQILQH PDLRGKDEHL DESSPMPCIL FLDSLKAHQK TQVAHRIRQW LNSEWQRLHK SSSIPNPFQS KTMPVIDPKI PYQNNSWDCG VFVCRYAFAL YKLRNISFSF QEYRQDCFRS LITNSEPFQF DMKDITRLRQ EMKTLVENLC TAYVPWKSEQ DRKAKEDRRK AKAVAEETVQ VDNNVPEDPD SSEGNEVVVN NEDKVDTANA RWSAVVKGIE ANENNIIQDS LLIVDDLNDD DNSRMHETSQ SGSKEMVFGP DWTETYDPME VDEDEAHSYA TQEGLTVERV SPAEGSIVKD VEMKDDLHTY KAQDLVDPLD TEAVDDDVDI DFV
|
| |