Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37493 |
Symbol | |
ID | 7202485 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 276885 |
End bp | 280820 |
Gene Length | 3936 bp |
Protein Length | 1289 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181522 |
Protein GI | 219122376 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTC GCGCGTCACG ACGGATGGGG TCCGGTGTCG TCACGGCACC GTCCGGATCC GCCGCGTCGC CGTTGACCGA ACTCAATTAC CTGTTGAGTC GGTGGGAACA ACGGATTGAA CGCGTCCGCC AAGCGCAGCA GCGACAACGG GAGCAACCAC CGCCGCCGTC GCCGGCGTCG ACGTCGCGGC CGCTGTTGGG ACGATTCGGC TTTGGACGCA AAGCGCCGGT GACGCCCACA TCCGAACACA CGGGGGATTC ATCCGAATTA ACGAGTCTTC CGATGGAGAC TCCTGTCGTG TCGCCCCTGC AGGATGTGGT GCCGTCGACC CGGGGAACAA CGCACGTGGC GCAATTTATT GATTTGCCGG ATGCGGACGA TTGTGTAGAA GATTTGCGGA GACTCGCCGA GCTCGTCGTG ATTGGGGAAA ACTTTGTGAC CACACTGCAA AAGAAAAAGG ACACTTTGCG GTCGAAAGAA GGATGGGGAG GCGACGTCTT TGAAGACTTG TCGGAACGGG AACGGGAAAT GGAAATAGCG GAGGAAGATG ACCGTCTGCA GCTCTTTGAT ACATTCTTCG AACGCGAGGC CCTGGAGCTT ATCGTCACCC TCTTGATGGG CCGTGCGTTT CACTCTACCC GTCAAGATAT CCAACCAGAT GTAGACGCGA ATGAGTCGAA CGACGACTCG ATCGAACACG ATTCTTCTCT GGCGGTACCG CCTGATCCTT CGGACGAAGT CTGGTTGCCG TCCCTATACA TTGCGACCCA GGCGATTCAG TCCATTTCCA TTCTGATCCA GAACGTATCC CGGGCCACGT CGCTTTACGT CATTCTCTCC AATAACCACG TCAACACCTT GATCAACTTT CCACTCGACT TGTACGCGGA AGCGGAGCAA GGCCGACAGC ATGATGCCAT CCGGGACGGC ACACTCACGC ACATGTTCAA AAGTCCCGAA CTCGCGGAAC TGACCACCCA CCTCGTTACA CTGTTGAAAT CCCTAGCCAT GCGCATGAAC GCCGAAACGC TACAGTTCTT TCTCAAATAC CCGACCGAAC ACCTCGTGGA CAGTCACGCG GGTCCGGCGT CTTCATCGCA CTTTGAACCA ACAATCGGTA GTGTTGATTC CGTGGAGGAC GACGACGACA ACGAGAAGCA AGGATCGGAA GGACTGCCAA CGTACCGAGT CGAATTTCCT CTCTACGAAC GCGCCTTGGA ATTCTGCGCC GGTCACCACG ACTCCTTTGT GCGCGTGACC GCCATGAATA TTTGCCTGAA TACCTTACAG TTGACGACCG TATCGCCCAA AGAAGACGCC GGCGAAGAGA CCATGGATTC CGTGAAGTCT CCCGACGGTG TTCTGCACAA CTCCAAGGCG TTGCCTTTTC GGGAGCGTTT GGCCATTGCC CAGCACGCGT GTACACCATC TCGGGTGGAG CGGTTGGTCG CACCAATCTT TGGCAAACTG GCGGAGCGAT GGAACTCTCT CGAAGAACAC ATTCGAGAAA TGGACACACA TAAGAATCGA ATGGGGGCTC ATGACGGGCG GAACGAGAAA ATGGCCCAGG CGCGTGAAAA AACCCGACGC GAACGATTGC TGCGGGGCTT TCAAGAGAAA GCTGACAGTT TACAGGATGA GTTATTGTTG TTGGAAGACG TCTTCAAGGT AAGCAAGATC GGACCTCCCA TATACCTAGT ATTTTCCATA AGACTAACCT TGCTTTCCTT GCAGGTCGGT CTTACAGTGT TGAATGAGCA AACCATTGAA ATGATGCTGG CCACGTTCGT TTATCCATTG CTCCTCCAAC CGCTACTGCT TTATTATCAA CGTTTTGACG GTGCCGTTGA CCCGGGAAGG AACGACAAGA TTGAACATCC CTTTAGTGGA TATGGAGGAG ATTTTAGTCA CGCTGAAACC ACCCTTGCTG CAGTATCAAG TCCAGCAAAG ACCGCGCTGT TTACTCTCGC TTCTGTGTTT CACTTTATGA CGAATCCTCC CTTGCTTCGT CTGGTTTTTG CTGCACTATT TCACCCGCTG TCACCCGACT CCAGCACAGT CCCTACCGTC AGAAGCAATT TGGAAGTGGC GGGTCAAGGC AGCAACCATC AATGGACGAT TCGACTAGAT ATGAAGCGTA CTTCAGACGG CCCACTATCG GACGATCGAA CTACGTATGA CTTTGGTACG AAGCCATCTA ATAGAAGGAT AGCCAGATCC GAGATGCCAA CTTTAGATAA AATCGAAGAA AGCGAGGAAT GCATCTTTGT TCTTTCTCCG GCACTGGCAG AGGTGCTTGA ATTTCGAGGC GGCGATGACG CTCTCATGGA GCGGACGCGA CCGAATCCGT ATCGCAAATC CCTGCTTGAA TGTTTGAATG TTCCTTACGA AATGGACGAA GTACGTCAAT TAGCCGTCTG CGCATTCGAT GTAGCCTTGT CAGTATTCGA TCCCAGGTTT ACTTCCGATA TTGTTTTTGG TACGGATGTG AACACGAATA ACAAGATCGT TCTTGAGGAT CACACGCTAG ATGCAGAGCA AGCTCAACCT GATAATGATA AAGGAATCTG CGGGAGCGGT ACTGATCGTT CATGCCACTC ATCGCCTATA AAAGAAAGGC TCATTGGCGT GAACCCAATT GGTGAAGTAG TCACTGCACT TTGTGGAAGT GCTATCTTCG CCCCAAAGGA CAACTTTGGA GGGTTCAATA TTGAATTTGA CAGCGTAGCT GCTCACGCAT TAATTAACTG CGTTCGCCGA AACGACACAG CAATGAGGAT ATCAGCAAAG CTCGTCGAAA CGCGACGACG ACAGTCATCA GTTTTCATTG CCCACCAGGT CACCAACCTA CAAGAAATGA CTGTGGGTGG AGCGACTTTG TTCTTAAGTG GTTCGCCTGC AGCAGATGAT CCGAACTTTG AAGAGCAGAT GCGGGGCAGA ATCACGAATT TACTATTCTT TGAGTCTATG GACGAGCCAG TCAAACTTCC TGCCGTTGAG GGTTTCGTGG AGCTGCAAGG CTCTAGTACA AAGGGTGATA AGGAATTCTC TCTTTTAATA TCTTCAGCGA GTACCTTTAA AGATATGTGT GGACGAGTTG GCAATTACCT TTTGAGCGAG CTGGATAACG GCGACAACCA CTACGATCAA CAAGTACTCG AAGGCTGCCA AGCTAGTGCA AGAGCTTTAT TTCACATCGA CGCTTTCTCT GCCTTTCTTA AAGCTTTAGC GACTAATGGA GGCGCCTCTA TCTACCACGC TGCACCTCAT GGACTCGCTA TTTCTGCCTC TGGGGGTTCC GTCGACATAT CCGATTCATG CATTTTAGAG ATGAAGCGTC AAATTTTTGC CCCCCTCTCG GCAAACATCC TTAGTGCAAT CTTTTTCAGC GAAAACGAAA TTCCAAATCT TCCCATTCCC GGTTCAATAA TTTCGCTTGT TGGCAGCTTG GCGATTCCAT GCGTCTGCGA GATTCCGGCA TCGATGTCTC ATCTCTTCTT GCAAAATGGT TCGATGATTA TTTCCGAAGG CGTCACATGG CAATCACTGT ATCTTGCCTT TCAGCATAAT TCTCTTGTGT TCGCACAACC TCAACCAGAC GGCGAAGCAG GAAACGGGAG GGCGGTATCG TCCTGTCTTT TGGAACGCCT CTCCGTCGTT GTTGACCAGT CACCGGATCC TTCCTCGCCA GCTCGCCGCC TCATACTATC GTACAAAAGC TTTGACTCGG ATCCACCGCC TCTGTTTCTA TTTGACGAGC TTCCGAAGCG CGCGAAGTAC GGCCCCTTCT CTAGAATCGA GCCTTTCACT AGTACTCTGG ATGTGTGGTT TGAAGACCAA CGTGCCACTG AGCAGGCGTT TCAAATCTTT ATGCAGACTA TATTTGCCGC CAAGGCACAA GGCGGCCTAG CAATTTTCCA ATTCCTTTCT TCCTAG
|
Protein sequence | MSFRASRRMG SGVVTAPSGS AASPLTELNY LLSRWEQRIE RVRQAQQRQR EQPPPPSPAS TSRPLLGRFG FGRKAPVTPT SEHTGDSSEL TSLPMETPVV SPLQDVVPST RGTTHVAQFI DLPDADDCVE DLRRLAELVV IGENFVTTLQ KKKDTLRSKE GWGGDVFEDL SEREREMEIA EEDDRLQLFD TFFEREALEL IVTLLMGRAF HSTRQDIQPD VDANESNDDS IEHDSSLAVP PDPSDEVWLP SLYIATQAIQ SISILIQNVS RATSLYVILS NNHVNTLINF PLDLYAEAEQ GRQHDAIRDG TLTHMFKSPE LAELTTHLVT LLKSLAMRMN AETLQFFLKY PTEHLVDSHA GPASSSHFEP TIGSVDSVED DDDNEKQGSE GLPTYRVEFP LYERALEFCA GHHDSFVRVT AMNICLNTLQ LTTVSPKEDA GEETMDSVKS PDGVLHNSKA LPFRERLAIA QHACTPSRVE RLVAPIFGKL AERWNSLEEH IREMDTHKNR MGAHDGRNEK MAQAREKTRR ERLLRGFQEK ADSLQDELLL LEDVFKVGLT VLNEQTIEMM LATFVYPLLL QPLLLYYQRF DGAVDPGRND KIEHPFSGYG GDFSHAETTL AAVSSPAKTA LFTLASVFHF MTNPPLLRLV FAALFHPLSP DSSTVPTVRS NLEVAGQGSN HQWTIRLDMK RTSDGPLSDD RTTYDFGTKP SNRRIARSEM PTLDKIEESE ECIFVLSPAL AEVLEFRGGD DALMERTRPN PYRKSLLECL NVPYEMDEVR QLAVCAFDVA LSVFDPRFTS DIVFGTDVNT NNKIVLEDHT LDAEQAQPDN DKGICGSGTD RSCHSSPIKE RLIGVNPIGE VVTALCGSAI FAPKDNFGGF NIEFDSVAAH ALINCVRRND TAMRISAKLV ETRRRQSSVF IAHQVTNLQE MTVGGATLFL SGSPAADDPN FEEQMRGRIT NLLFFESMDE PVKLPAVEGF VELQGSSTKG DKEFSLLISS ASTFKDMCGR VGNYLLSELD NGDNHYDQQV LEGCQASARA LFHIDAFSAF LKALATNGGA SIYHAAPHGL AISASGGSVD ISDSCILEMK RQIFAPLSAN ILSAIFFSEN EIPNLPIPGS IISLVGSLAI PCVCEIPASM SHLFLQNGSM IISEGVTWQS LYLAFQHNSL VFAQPQPDGE AGNGRAVSSC LLERLSVVVD QSPDPSSPAR RLILSYKSFD SDPPPLFLFD ELPKRAKYGP FSRIEPFTST LDVWFEDQRA TEQAFQIFMQ TIFAAKAQGG LAIFQFLSS
|
| |