Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49135 |
Symbol | |
ID | 7195607 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 21263 |
End bp | 25648 |
Gene Length | 4386 bp |
Protein Length | 1305 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183793 |
Protein GI | 219127128 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.272482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACGG ATGCCGCGGC GGAATTGTTG CTGTCGGGAG CGGCAGTCTT GTTGCTAACG CATTCATCCA GGATCATCCA AAGGTCGACG CGGGGCGCAC GTTGGACGTC TCCCTACTCC CCAGCAACAA GCAACGCCAC GACCTCCATT GTCTTACTCT GGGTTGCGGC ATCCTTACAG CACAACGCAC TGGCAATCCA GACCAACGCC GTCTTTGCGT CTTGCCTCCT GGTAGTAGTC GTCGGTCCCG TCGCTCCCAC TACCCGAGTA TTACGCACAC CCGTGACCGA CGCAAACTTT CCCACGGCGA CAGCCGGCTA TGCTCCGACG GCCGTCTCAC TGCCCTCCCG GTACCAGTGG TGGGCCCGTA CGAGTGTGTC TGTCACGCCC ACGGCACTCT GGACCAGTTT GTTCTTTGAC ACCACGACAC GTCCCTGGTA CAACCCGATC AGCGCAACGG AACCCGTTGC GCGAGTGTCC GTCGCCTTGC TTTCTGGAGT CGCAACGTGG GTGATGCTCA TGGCGTGGCA CTATTACCGT TACCGACGAA GGAAACCTCC TGCGCAACTC CACTACAGCG ACGCGGGTTC CACTTTTGGG ACTCCTTGGA ATGTGACGGA ACATACCAAC AAGCAGAGTA TGGAACAACA TTGGTACCAA TGGACCGTGT ACGAAACAAT AGTCTGGATG GAAGACGTCG TTCGGCAACA GCAACAGCAA ACGAACGATG ACTATGAAAT TCCTTTCGAC GTGGCCACAT TCTTGGGACC GGAACAAATT CACGGTTACC AACTGCCCCA ATTGACGACT CGGGATTTGA CGGTCTTAGG TATTCCGATG GATACCGCCT TGGGCATCAC CAAGGCTATT GTACAATTAC TACAGCAGCA TCCGGGGCCC AACGGTAGCC TTTTCTACGG CAACCAATCC GACTCGACTG TATATCCGAC CATTCGTACC ACCGACTGGC TCGCACAACA CGATGCCGAA TACAATTGCG TGAACATCCG TGGCCCCCCA GTCCAAACTT ATCCGCAGCC GACGTCGTTG TCAAATTTCT ACAGCGCAGC CCCAACACAC TACCATCCAA AGGATCAGCC GCAACAACAA CAGCAGCAGC AGCAACAACA ACGGCCATTG ACTCGTGAAA CGGCCGACTT GCCCGACGAT GCACAAGAAC GAGCCCAACG GATGATGCGG GATAGGTATG GTCTAGAATT GCCAGCCTTT CGCACAGCGA CGGTCTGCAC CGACTCGGGT GGAGACACCG TCTCACCTGG GACGTATCGA AACGACGACT CGCAGACCGT CCCCCCAACC ACGGGAGCCG TTCCTCCGGA CGGTCCAGTG CCCTCCTCCT CCGCACCCGT CCATCTGACT CAGAGCAATA GCAACGGCCG GGCTCCCCCT TCTCCATCAG CAACGACACT ACCGTCCGAT TTTGTTGCCA ACTTGCCGCC ACACATCGCC GACATTCTCC GTCAGAAACC CGATTTGGTC GCACAAGTTT TGCGGTCGCA CCGACGGGCC ACCCATAGGA ACGATGGCGC TCAAGCTGTA GTTCCACAGT CTTTCCACGA GACGTCGCTA CCGCTAGACT CTTCTGTCAG GGAAACACAC TCGCCGGGCG CGGACAACAA TGACGTTGGG GATGTGGAGA GCACGCCCCT GCTGGCGGAT ACGCCTAGCG CTTGGGGTGC CGACAACAAC CAAGAAACCG CCGAGTTGCT ACGCAAGCGA CGCACTCCTA TCTATTACAA ATCATTGCAC TAATTGTAAT ATTTTATATT TTGTACCGTT ACCGACCGAC CGACCGACCG ACCGACCGAC TGATCATCAA TCACGCCAAT GGCTGGGACT ACCGGTAGCT TACAGCTATA GGCATGAGTC GCCCTTGTCT CACGATCCAG AATCGTAGAG TAGCCACCTA TTGGTGGTAC ACGCAACGGT CGTGACAGTG AGTCGTATTC GTAGACGATC TCGACAACGT ATCTTACGTG AACCGTGACG CAGCCGTTTT AGCTACTATG GTAGGCTGTT TCAACGGAAA CTTGCTATAG TTAGAATAGC CCTGTGGCAC GGAGGTGACA TTTCGGATGC CGCTCCCCGA CGAAATGGGA CTCGACGGGG AACGGATTGT GGAAATCCCA ACGCCGGGTT CGGGTGCACT CCCGGAATGC GACCCAGGGT GCGAGTACGA CTCCCATCGT TGTTGACAGT GAAAGAAGGA AGCCGTCCCG GAACAGAGAG TGTTCTCCCT ACCTATTGCC TACCCAACAG TAACGTGAGT GCAAAGGAAC TTACTTACCT ACCTACTTTA GCAACTCTCT CTTACTCTGT CTCCCCTTAT CGATCGACCT TGTGCGGGAC AGGTTTCGTT CCGGACCAAA CCTGCTCTTT CGTTACCAAT CCACACACAC AGGAAGGGGT GTACTTTTGG GTCGAGCAAT CTCACCATGG GGGACAATTT ACTCGACGAT TTACTCGACG ACATTTTGAT CGATACGGGA GCCGGGACGG AACATGATGA TCTACTCGAT CACTTGTTGG GGGACGACAA AGACGGGAAC GCAGCAATAC CAACGTTGCA GGAGACGCTC AATCGTCGTT CCCATCATTC CAGCAATGGA GACAATACGG CCGCGCACGA TGACAACGAC GACGACATTA ACGAAACGGG TCACGACGGT GATAACATCT CCTTGCCGAC CGAAACGGAA TCCGGCATCG ACAGCACACC CGACTCTCCG TTGGAAGACT TGTCGGCATC CACCGAGGAA GAGATCACCT TTGGACCAGA CGAGACCATT GTTCCAGCGA ACCCTACGTT ACGCACGGAA CCTTGGAACG CTCCGACACC GCCGTCCACA CCGTGGAAGA CGACCGATTC AAAGCACCGT CTTCGACGAC GCGCCCCCGA AGACGTGACC TTGGCCCAGA GTGACGCCTT TTTGCGGCGT GGACTCCGCA GCGGAAACGA CACCACCGTA CGTGCCAAGC ACCGTGCGCG GAAAATAAAA ATTCCAAGAG CAGAACAAAC GCGAAAGGTA ACCGTCCCAC TCACTCCCCA CTTTCACGGG CGTTCCGTCG CCCAACCCGT CGGACGTAAC GCATCCCGCA ACCAGGCTAC GTGGGCGCAG AGCTCCCGAG TTTTTAGCGA GGGACTGCGG GACGGGAAAG ATAACGTACC TACCGCAGCC AAGGATCACG ACCCCAACCA CAATCGCCTG ACGGTACCCG TCGGCCCCAA ATTCCACGGA ACACCGCACG AAATCCACAA ACCCCCCAAA TTTCACGAGG ACTTAAGCAT GGCCGAAAGT GTGGCTTCGT TTGGTCGTGG ACTCCGTCGG AGTACCAAGA ACGTTGGCAC TTCGCCCCCT AAAGCGAGGC ATCGCGGTGT AACGGTTCCC GTGGAGCCCA AGTTTCATTC CACTACGCAC TATACCCGAA ACCGCCCAAA AAGCGCGGAA GAACAAGAAG CAGAGCTCAT GGAGTACTAT AAAGCGAACC CTATCAAGGC AAATCCGTTA CCGAATTACC TCGCTGCCAG TGGTGGCAAC GCTCGTCCAC ATCACCGATC CAAGCAAGTA GACGCCGCTA GCTATGTCCA GAAGGACGAA CTCGACGCAC TGGAGTGCCG AAAACAGTTC AAGGCACGAC CCATGCCACA CTTTTCGGAA GAAGCGTTAC TTTCGAAAAC CCACGAACGT CGACCGCTGA CGCAAACGGA GCCCTTTCGC TTTCGCGTCG CCCGTTTGGA CTCGCCCACG CGTAAAACTA ACGCAGCACC TACTTCCCCC GAACGCAGTG TCGACGACAA TGTCGCGGTG GCATTCAAGG CTCGCCCCGT TCCGAAATCG ACCTACGTAT CCCCACCACG CAAAACCAAG ACCCCGCGGC CTTGTGTTCC CAATTTGACG CCCCAGGTAC TCACACATCC ACGCTCCTAC GATCCGGCAA GGGATGCCTC ACGACACATG CAGGCCGACA ATCGCGTGCG TCAACAGGCG CAAGCCAAGC AAAGTCGGCA ACGGGATCAG CATTGGCAAG ATATGCAACA TGCCATGCAA AGCGTAAACA TGGGAAAGTC TACACCTACA ATCAAACCAT TTCAACTTGA ATCCGTGACC CGTCACGAAG CCTACCAAGC TGAGTTAGCC ATCAAGCGAG CACAGGAACA ACGGGAACTT TATGAAAGGG CGCAATTCAA AGCCCGACCA GTGGGATTTA CAAAGCAAGC GTAGGCATGG GGACCGCAAC ATGACAACAT TCAGAAGGGG ACATAGAATC GCCGGGGGTT CAAGGTGACG CACAAGTTAT TCGCTGTATG GTACTCTTAC ATGACTAGGT AGAAGAGTTG ACGGTAAATG ATAATCGATG CCTTTG
|
Protein sequence | MSTDAAAELL LSGAAVLLLT HSSRIIQRST RGARWTSPYS PATSNATTSI VLLWVAASLQ HNALAIQTNA VFASCLLVVV VGPVAPTTRV LRTPVTDANF PTATAGYAPT AVSLPSRYQW WARTSVSVTP TALWTSLFFD TTTRPWYNPI SATEPVARVS VALLSGVATW VMLMAWHYYR YRRRKPPAQL HYSDAGSTFG TPWNVTEHTN KQSMEQHWYQ WTVYETIVWM EDVVRQQQQQ TNDDYEIPFD VATFLGPEQI HGYQLPQLTT RDLTVLGIPM DTALGITKAI VQLLQQHPGP NGSLFYGNQS DSTVYPTIRT TDWLAQHDAE YNCVNIRGPP VQTYPQPTSL SNFYSAAPTH YHPKDQPQQQ QQQQQQQRPL TRETADLPDD AQERAQRMMR DRYGLELPAF RTATVCTDSG GDTVSPGTYR NDDSQTVPPT TGAVPPDGPV PSSSAPVHLT QSNSNGRAPP SPSATTLPSD FVANLPPHIA DILRQKPDLV AQVLRSHRRA THRNDGAQAV VPQSFHETSL PLDSSVRETH SPGADNNDVG DVESTPLLAD TPSAWGADNN QETADRFSYY GRLFQRKLAI VRIALWHGGD ISDAAPRRNG TRRGTDCGNP NAGFGCTPGM RPRVRVRLPS LLTVKEGSRP GTESVLPTYC LPNSNQLSLT LSPLIDRPCA GQVSFRTKPA LSLPIHTHRK GCTFGSSNLT MGDNLLDDLL DDILIDTGAG TEHDDLLDHL LGDDKDGNAA IPTLQETLNR RSHHSSNGDN TAAHDDNDDD INETGHDGDN ISLPTETESG IDSTPDSPLE DLSASTEEEI TFGPDETIVP ANPTLRTEPW NAPTPPSTPW KTTDSKHRLR RRAPEDVTLA QSDAFLRRGL RSGNDTTVRA KHRARKIKIP RAEQTRKVTV PLTPHFHGRS VAQPVGRNAS RNQATWAQSS RVFSEGLRDG KDNVPTAAKD HDPNHNRLTV PVGPKFHGTP HEIHKPPKFH EDLSMAESVA SFGRGLRRST KNVGTSPPKA RHRGVTVPVE PKFHSTTHYT RNRPKSAEEQ EAELMEYYKA NPIKANPLPN YLAASGGNAR PHHRSKQVDA ASYVQKDELD ALECRKQFKA RPMPHFSEEA LLSKTHERRP LTQTEPFRFR VARLDSPTRK TNAAPTSPER SVDDNVAVAF KARPVPKSTY VSPPRKTKTP RPCVPNLTPQ VLTHPRSYDP ARDASRHMQA DNRVRQQAQA KQSRQRDQHW QDMQHAMQSV NMGKSTPTIK PFQLESVTRH EAYQAELAIK RAQEQRELYE RAQFKARPVG FTKQA
|
| |