Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49989 |
Symbol | |
ID | 7198774 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 52983 |
End bp | 58383 |
Gene Length | 5401 bp |
Protein Length | 1498 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184819 |
Protein GI | 219129278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACAGT CCGAGTCCAC AATAACTTCC CCCCCCAAAG GCACCTCGAG CAATCCACTC GAAGACGGCG GAGCGTCCGC GTCCTCCTCC TCCTCCTCCT TCACCACTCG CCAACCCCGG ACCGCTACGA ATCTCACGAT GTCGCCCCCC CTTTCTCCTC AACCTCGTCC CAGCGGAGGA ACCAATTCCC ATTGGGAAAC TTCCGAAACC GGCGCCTCCC AGGGTTCGGA CGGGGAAGTC GTCTGCCACG CCTGCGGCTT TGAAGCCTCT TACGTTCCGG AAGGTATGTT GCACTGTGAA CGCTGCCGGA ACGTTTCCTA TTGTTCCCTA CATTGCCAAC AGTGGGATTG GACGTCGGGC GGACACTCCG ATCTCTGTGT CGACGCTCGG TCAACGGCTC ACGAAGACAG TACCACCGGG AGTGCCAGCA ACAACAACAG TGGCAGTGGA ACTACTCGTC CCTCGGAGGA TTCCACCCTC GTCTCCATGG ATATAGCCAA CGTATTCGGA CCCCGAAGTG TCGGATCTGC CTCCACTCCC GCACCCGCCC CCGTGAATCG AGGTGTCCAG CGGAGACCGG AAACGAAGGA AGGCGGCCTC GGGGCCTACC TCCACGCGCA CACCCCGCGT ACCACCCCAC CAACTCCACT CGCATCCCGG TACAATTCTC CCAACGACCA GGACCTCGAC GACGATGATC CGAACGAGGG GTCCATTGTA TTAGTATCGG ACTCGGAAAG TACCGACATT CTCGGAATGA TTCAGGAAGA ATCCGAAGGA AACGAAGAAC CGGAATTGCA AGGTTCCTGG CGATCACGGG ACAATCCGTT CCACAAATCC GCTTTGGGAG AAACCTCGTA CGACTCGGAA CGGGACGAAC GGGAACTCAT TCTCGCCACG GCCCCCGCTG CGACGGTCCG GGACGCCCAC GAACACGCTC AAAACAACAA CAATAACACC GTCACCGCAA CGGCATCTTT CGCCAGCCCT CAACGAGATT CGCTCAAGGC CTTTCGGGCC GTAGCGTCCG AAACCACCAA CGTAGAGAGC CACGGCAAAC ATAGCCTCAA AGGCTTTCGA CACGCCTACG ATGATGCCCC GTCGAAAGAA AAGATCGCTT TCTCACACAC ACTCATCCGC CACAGTGATA CGGCCGATGA CACCATGAGT ACCACAGGAT CTCATCCTGG CCAAGTGAAT GAGGCTACTG CGAACAGGAC AAACCAGAAC GCGACCGCCG TCACGAATTT GGCCCTCAAA ACTAGCATCA ACAAAGCTTT GCGAGATTTC GAACGCTTGT ATGGAGAGGA AGCCGCACAG CTGGCTGTAT TGCAACTCAC CCAAGGTCTA ATTACTGAAG ACGACGTTAT TGAACAAGCC AGTCAAAGTC CGGACGAGTC AGAACAGTAC AATGAAACCA CAACCGAGTC TGGGGACTCA TCCCCAGCAG ATCCGAAAAG TTCACCAACG ATTGTAGACC AGAGCCTGTC AAGTTGGGGC TTGTCCGGCA TAGCGTCGAC TGAGGGGTTG GACAAGCCAT CCGCGCACAG CACATCGTCC CTGATCACAG AAAATATGTC TCGGGACTCC AAAAATGCTG CTACCTTATC GTCATCCTCG TCGTCGCAAC AGAATGCGTT GGCCTTTGCA AGTCCTCATT CCGTTGGGGC CACCACCACA GCCGCATCTA CGGCTCCGCT TTTGACACCC GCTGAGCCAG AAGACGAAAG TGCTTGCAGT GACGGATCCT TGTGCACACC TACTGAATCG TCGGTACAAG CAAAGCCATT TACGGTCCAT ACCCCCCGGT ATCTTCAGTA CCGCAATTCC TTGTCCAAAT CTACCGCCAA AAGTGGTTTG CCAGGAACGA CAGCTGTAAG AGTTGCACAC GACAGTGATA CAATTAAAAC CCAGGATACA AAGAATCGTG CAAACCGACA AAAGGAGCCT ACCCAGAACG GAGCTGCCTT GGTCGGTGAA ATAATAGAAG GTGCCGCTGC CGTGGGCACC AAAAGGACAC TTAGCCCCAA TGACCCACCA GAGCAGCTAA GGGAAGCGTC ACGAGTCGTG CCGCCTCCTG CGATTGTGGC GACTCCGACG GAGACGTCTA ACCAAGATTT TCCACCGTCT ATTCATGAAG AGAAGATCGC ATTATCCATG CCCCGTTACC TGACGTATCG CTCGTCGCTG GCCAGATCAG TCGACAGAAA AAGCTTTTCT GTCCAGCTTT CTGCGCAAGA ACTTGAATCC TATGAAGTTG GCCAGTTAAG AATGCCGGGC AATGGAAAAC ACAATAATTC CGATGTTCCG GCGGTAGTCG CTACGGTTCA GGACAACATA CATGAAACGG AAAGGGCCAA GACGGCGACG GTAGCTGCCG AGATGTCGAG CAATTACGAA CTGGGGAATA ATGAATTCAG GCAATCTTTG TCTCCATCGC AGGTAGAAGT TGAGGCTAAT AAGATGAGCG ATAGTACAGG ACTTGCCGAT GCTATTGGCG GAATTGCAGC CTTGGGGTCG GGGGCTGTTG CTCTGGCAAG CACGAAGAAA AATTCTGACA ATAACCAGGT AATATCCAAT GTGTTGCTTG CAGATTCAGA GCTGGGTCTG GAATCCCAGA TGGTGCCGAT CATGCGAAAA CTGTCACCGA ATGCTATTAA GGAAGGGGCA CCGTCTCGTG CGGAGAGCTT TTACTCTCGC TATCGGGCCT CGTTGGCTCA GAGACTTTCC CAACTCTTTA TTGTAGAAGA TGCGATGTTG TCTGGCGATG ACTCGGACGA TTCTTTGAAT GCCGACGAAG AGCGAGAACT CACAGACCAG CTTTCTTCGT ATTTGGAGAA GGGGAATTCC AAAAGAACCA TTGAAGAGAA TGCATCTCGA CCCGGCGCTT TGGGACACAA GAGCTACGAT GGAATGAACA GCTCCTGGAG TGGTTTCGAC GAAGAAAATT CTGTGGACGC AAACATCGAC GGCTCTCGCA GCAGCGAAGT CTGCGAGCGA AAATTTCAGA CAGCCTTGGT TACTGACGCG GGTGCAATCA ATTTGCGAGA AGCTCGAGAA AGCGCTCGAG CTGAACAAGC TCGAAAAATT GAGCTTGCCA GATCAAGCTC CAAACAAGTC ATGTACCAGT CCGTATCTCA AGACTCGAAA CCCTCCAAAA TGACAGAGAG GCGCGCAGCT CCTCTCACAG TAACCAGGAA TCAAGAAAGT TATAGTGAGA AGCGGATAAG CCGAAGCAAT TCCAGCTCTT CTGTAGAAGA TGCTGAGACT GCGTTATCTG CAAAACGAAG AGCGCCAGCC CCCATCAGTA GTCAATGTTT TGCTCCGAAA GTAGCAGAGT ACAGAAATCG GAAACGTTGT GCGATGCTCG GCTTCATTTT CCTTCTCGTG GTACTTCCTC TCGCAATTGG ACTTGGGGTT GGACTTCGTG GAAGCAATAA AAATCGCTCG ACTAACTTTC TACCAGACAC ACAACCCCCA GCAGGATCAA ACCCAACTCC CTCGCCTGAC ACGCAAGAAC CTACGAATTT TCTGCGTACG CGGGCGCCCT CGCAGTCAAC GGACGATACA CCAGGATCAC CGACCATTAA TGCTCCTACG CAAAGCCCAA CAGCTCCACG AATGGAAACC CCCATCGTTT TGCCAATTGA ATCTCCTTCA CCTTCCAATC TCAACCCGGT ATCTTCGTTG GTTCCCTCCA TCGCGCCCAC TCCAACAGTT TCTCTATCAA ACGCTCCTAA TATCATGGAT TCATCCCAAG CCCCAACGGT ACTGTTGCTA AACCAAGAGC TCTTCAGGAT GCTAAGCGAT TTGTCTGAGG ACAATGGAGC AAGCATCCTT CGCCCCTTCA CACCGCAGCG CCGAGCGTTC GAATGGCTTG CATCTACTTC AGACCTCGAC ACTTTGTCCA ACACACGAAA GGTTCAGCGA TTTTCTTTGT CTGTGTTCTT TTTTACTTCG AATGGTAGCC TTTGGCGTAA CAATTCTGGA TGGCTAACAG AAAGTGATGA ATGCACATGG TACTCGAGGT CTGGGCGCAC AACTTGCGAT GGCAGTGGAG TATACCTGCA CTTGGAATTG GGAGACAACG ATGTGGCAGG AAGAATTGCC ACAGAAATTG GTTTGTTGAC AGGACTTCGA CGTTTGGACT TGACAGGTGG TAGTGGAAGT CGCTTGAGCA GCACACTACC AACGGAGCTC GGAGTACTTT CCGACCTTGA GTTTGTGAGT TTCCGGAACA ATTCCATATC TAGAAGCATA CCGATCGAGT TGGGTCAGCT TACGAGACTC CAGCATCTTG ACTTGAGTAT GAACGTGCTT AGAGACTCGA TTCCGACAGC CTTTGGTCAG CTGGCAGCCT TAGAAACGCT TGATCTAGGA CACAATACTC TTTCCGGCTC CATCCCTACT GAGCTCGGCC GGTTGTTGAC TGCGCGAAGT ATCAAGCTGA ACAATAATAT TCTTACTGGT GCATTGTCCA CTTTTATTGG TCAACTTTCT GAATTGGAAT TGCTTAACCT TGCAACGAAC CAGGTGTCGA CCATCCCGAC CGAGCTCGGA CAGCTCACCA GCCTTGCATC TTTTGATTTG CATGAGAATA GGGTGAGGGG TCGTTTTCCT ACGGAAATTG GCTTCTTGAC CCGTCTTCAC TTTCTGGATC TCAGTAACAA TGCCTTTTCT GGCACTCTAC CCACGGAGAT TGGGCTATTG CAAAATACTC TTCGCCAACT GAACCTTTCG AATAATCGAT TTTCAGGGGA AATCCCTATT GAAATAGGAA ACCTCGTAGG GCTCTTCAGC CTGCAGATGC AATCGAATCG ATTCATAGGT TCCGTTCCTG AAGAATTTGA TGGACTGCTA TTGATTACTA CTATCCGCAT TGACAACAAT GATCTCTCTG GCATGGTGCC AGAACAAGTT TGTGACCACT TCTCGAATCG ATTACCAAAG TTTTACTTGG ATTGCGGAGG TAGTCCCGCC AAGTTGTCTT GCCCGCCAGG AACTTGTTGT ACTTACTGCT GCGAAGAAAG CACTGGATGC GAGTGCGTGT ACGCTGGTAC CAGCTTTCAA TTCTTGTGTT AATGCAACAA AAGTGCGAGG GTTGCTACCT AACGTAGCTT GAATTTTGTT TTCGCTTTCC CTATCCTTCA GGAATAAATG TGAAAATTTG TAAACTGAAA ATGTAAGTTT TTTTATTTCG ACGCTTCAAA GAAAACGACT CGAGACCTTC AAATACGCAC CAAAGGGAGA AGCAAAATAG TCTTTACAGT AATACTACTT TAGTTCTAGA GTAAGGGTCA GGATCGTTGG TGCGCTTGAA CCACTAATGT CTAACAAAAA AGCAAGCCTT GACATTAAGG TTGACTGTGA AGTGGACGAT GATGGAGCTA ATTACGAGTG A
|
Protein sequence | MGQSESTITS PPKGTSSNPL EDGGASASSS SSSFTTRQPR TATNLTMSPP LSPQPRPSGG TNSHWETSET GASQGSDGEV VCHACGFEAS YVPEGMLHCE RCRNVSYCSL HCQQWDWTSG GHSDLCVDAR STAHEDSTTG SASNNNSGSG TTRPSEDSTL VSMDIANVFG PRSVGSASTP APAPVNRGVQ RRPETKEGGL GAYLHAHTPR TTPPTPLASR YNSPNDQDLD DDDPNEGSIV LVSDSESTDI LGMIQEESEG NEEPELQGSW RSRDNPFHKS ALGETSYDSE RDERELILAT APAATVRDAH EHAQNNNNNT VTATASFASP QRDSLKAFRA VASETTNVES HGKHSLKGFR HAYDDAPSKE KIAFSHTLIR HSDTADDTMS TTGSHPGQVN EATANRTNQN ATAVTNLALK TSINKALRDF ERLYGEEAAQ LAVLQLTQGL ITEDDVIEQA SQSPDESEQY NETTTESGDS SPADPKSSPT IVDQSLSSWG LSGIASTEGL DKPSAHSTSS LITENMSRDS KNAATLSSSS SSQQNALAFA SPHSVGATTT AASTAPLLTP AEPEDESACS DGSLCTPTES SVQAKPFTVH TPRYLQYRNS LSKSTAKSGL PGTTAVRVAH DSDTIKTQDT KNRANRQKEP TQNGAALVGE IIEGAAAVGT KRTLSPNDPP EQLREASRVV PPPAIVATPT ETSNQDFPPS IHEEKIALSM PRYLTYRSSL ARSVDRKSFS VQLSAQELES YEVGQLRMPG NGKHNNSDVP AVVATVQDNI HETERAKTAT VAAEMSSNYE LGNNEFRQSL SPSQVEVEAN KMSDSTGLAD AIGGIAALGS GAVALASTKK NSDNNQVISN VLLADSELGL ESQMVPIMRK LSPNAIKEGA PSRAESFYSR YRASLAQRLS QLFIVEDAML SGDDSDDSLN ADEERELTDQ LSSYLEKGNS KRTIEENASR PGALGHKSYD GMNSSWSGFD EENSVDANID GSRSSEVCER KFQTALVTDA GAINLREARE SARAEQARKI ELARSSSKQV MYQSVSQDSK PSKMTERRAA PLTVTRNQES YSEKRISRSN SSSSVEDAET ALSAKRRAPA PISSQCFAPK VAEYRNRKRC AMLGFIFLLV VLPLAIGLGV GLRGSNKNRS TNFLPDTQPP AGSNPTPSPD TQEPTNFLRT RAPSQSTDDT PGSPTINAPT QSPTAPRMET PIVLPIESPS PSNLNPVSSL VPSIAPTPTV SLSNAPNIMD SSQAPTVLLL NQELFRMLSD LSEDNGASIL RPFTPQRRAF EWLASTSDLD TLSNTRKVQR FSLSVFFFTS NGSLWRNNSG WLTESDECTW YSRSGRTTCD GSGVYLHLEL GDNDVAGRIA TEIGLLTGLR RLDLTGGSGS RLSSTLPTEL GVLSDLEFNK FVTTSRIDYQ SFTWIAEVVP PSCLARQELV VLTAAKKALD ARINVKICKL KIVRVRIVGA LEPLMSNKKA SLDIKVDCEV DDDGANYE
|
| |