Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45764 |
Symbol | |
ID | 7200785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 177548 |
End bp | 182359 |
Gene Length | 4812 bp |
Protein Length | 1603 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179989 |
Protein GI | 219118433 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCCT CGCAAGCCCA ACTGCAGCCA CAACAGGCGC CGCCGGTTGC CGCCCCGCTT CCGTCAGCGG CGGCGCAACA AGGATCGGCG GCGGCGCCGG CAGTGTCGCA CTACTCATCA CAGCACTTGT CACAGGTGCC GGCTGCGCAA CCCGGTCTGG CGCCGCAATC GCAAGGAGTT GTTCATCAAC AGCGCCCAGT GTCGCAAGGT ATAAACTATA CGACGCAAAC GTCACAATCG CAGACGCCGG CTCAGCAGCA GGCTCCCCCG CAACAGCATT TCCCTCACCA AGGACTCAAC GGCGGATGGC AAAGCGATAA GGACTATCAA GAGCGCCGGA AAATGATCGC CAAGATTGTG CATCTACTGC AACAGCGCAA GCCGAATGCT CCGCAGGAAT GGTTGAAGAA ACTTCCGCAA ATGGCGAAAA GATTGGAAGA GTCGCTGTAT CGCACAGCAA CGTCCTTTGA AGAGTATAAC GATGCCAACA CGTTGAAGCA TCGTTTGCAG CAGTTGGCGG TGAACATTGG CCAAAAGACC AAGAAGCTGC AACAACAGCA GGCGTTGTTG GCTCAACAGA GACTACAGCA GCAGCAACAA CAACAACAGG TTCGGCAGCA GTCTGATACA ACTCAGTACA CTCCCCAGGT ACCGCCATCT ACTTCACAGC AAGCATTAAT ACAGCCGCAG ACGCAGCAAG GTCTAATACC TCAAGTGAAG AGCGCCGCGC CACCTGCGTC ACAAGTACAA GGTCAACGGA TGGTAAACAT GTCGGAAATC AACCCGATAA TGGGACAACC GACCCAACAA CAGCAAAGCG CCCCAGCTCC ACAACCGCCG CCTCTCCAGC AAGTGCAGTA CACACAAAAG CCGCCCGTGG CCCCCATTTC AGCCCCCACT CCTCAGGCTC CCGCACCGGC CGCAAACGGC CCCAATGGCC AGGCTTCTGG TCGACAAGTT TCGGATCGTC AACAGGTGTT ACGCCATCAA CAGCAACGGC TGCTGCTACT GCGACACGCT GCCAAATGTC AGCATGAAGA CGGAAAATGC CCAGTAACGC CGCATTGCGC TGGGATGAAG AGATTGTGGA AGCATATTGC CGAATGCAAG GATCAAAAAT GTCTTGTACC GCATTGCGTC AGTTCCCGGT ACGTGTTAAG TCATTATCAC CGATGCAAGG ACGTTCGTTG TCCAGTGTGT GGCCCAGTAA GGGAAGCCAT TCATCGAAGC CACGAAAAGC AGAAACAGAT GCAGGCCCTA AAACAGCGAC ATCAGCAGGC CGTACAGCAA CAGGGCCAAC CTCAGAATGC GACTTCAGCG CCCGCCGCTA TCGGTGCTTT GCCAGTCCCC GCTCCTCCTG GACATAGTTT GGAACCTGTT ACCAAGAAAC AGCGCACCGC GCCCATTACA GCTTTGAGAG CTCCGATCAT GCCAGTTCAG CGACTTCAGC AGCCACCAGG TACTCGTCCG GCCGTTTCGC ATCCAACCAC AGTAAGACCT GGATATACCG GAAGTCAACC TCCCATCACT TCTGGTCCTG GTGGTCCACC GGTTGCGCAA GTACCTGGCC TAGCGTTTGC GAACGGACAA GTAGTAATGC CGAAACATTC AGGACCAAAG CCACAAGAAG ATCACACTTT GATCAACTGC TTCTCTGTCC AGCAAATTGA GACGCACATA TCTTCCTTGA GCAATGGGTT GGTCCTGCCT CCGCAGAAAT TGAAAACGAA AGGATTGGAC GCTCTTAAAA CGCTGCAGTC GCACCAACAT GCGTGGGTAT TCAACACTCC AGTGGATCCC GTGGAACTCG GCTTGCCGGA CTACTTTGAG GTCATCAAAA AACCAATGGA TCTAGGGACA ATAAGGAAGA AGCTCGAAAA TGGCGTTTAT CAGAGGCTGG ACGACTTCAA AGAGCATGTA CTGCTTACAT TTGATAACGC CATGATGTAC AACCCGGAGG GTTCGGTTGT GTATAACATG GCTAATGAAA TGAAGGTAAA GTTTCAGAGC GACTTCGTAA AGCTCATGGA ACAACTGAAC GCCGAAGAAG ATGTCAAGCG AAAGAACGGG GAGGCCTGTT GTTTATGCGG ATGTGAAAAG CTGCTATTTG AGCCTCCTGT ATTTTATTGC AACGGAATAA ATTGCCCTTC GAAGCGAATT CGGCGAAACA GTTATTACTA CATTGGAGGG AACAACCAAT ATCACTGGTG TCACCAGTGC TATCAAGAAC TCCGCGACAA TTCAACCATT GATTTAGGCG ACCTTTCCGT TAAAAAAGAA AGTCTCGTGA AGAAGAAGAA TGACGAGGTG CACGAAGAGA GCTGGGTACA ATGCGATCGT TGTGAAAGAT GGGTTCATCA GATTTGTGCT TTATTTAACA CTCGGCAAAA TAAGGATCAG CGATCCGAAT ACGCTTGTCC GAAGTGTACA ATTGACGAAC GAAAGGCAAA AGGCGAGCTT GAGGCAAAAT CGTCAACTCC GATGGCAGAG GACCTCCCTC GTACCAAGCT GTCCGAGTAC TTGGAGAATC ATGTGCGTGA GAAGGTCGAT GAGTTCGTTG AACAGAGGTC GCAGGATATG GTTGTTGCTC AAGGTTGCTC TATTGAAGAA GCCAGAAGCA AACTTAAGAT GGGAGGTGCA ATCACTATCC GACAGGTAAC TTCCATGGAC AGACGACTTG AGGTCCGAGA TAGAATGAAG CAACGCTATG CATTCAAAAA CTACCCGGAA GAATTCAATT TTCGGTGTAA ATGCATCGTT GTCTTCCAGA ATTTGGACGG CGTTGATGTT GTTTTGTTTG GCCTTTACGT ATACGAGCAT GATGAGAAAA ATCCTGCCCC CAACAAGCGG GCCGTCTATG TGTCCTATCT CGATAGTGTT CATTACATGA GACCACGTGA TATGCGTACT TTCATTTACC ACGAAATTTT AATATCTTAT CTTGATTACG TCCGGAGGCG TGGATTTTCG ACTGCTCACA TTTGGGCTTG TCCGCCGCTT CGCGGAGACG ACTACATCCT TTACGCAAAA CCAGAGGACC AGAAGACCCC GAAAGACGAT CGATTGCGTC AGTGGTACAT AGACATGCTG ATTGAGGCCC AAAGGCGAGG GATTGTTGGG AAACTTACCA ACATGTACGA CCTCTATTTT TCCAACGAGA AAAACGATGC AACGGTTGTC CCCTACATGG ATGGTGACTA CTTTCCTGCT GAGGTTGAGA ATATCATCAA GGATATTGAG GAAGGCAAGA CGGGAAAGAA AGGCAGTTCG CAAGGCAAAA AGAAAAAAGA AAAAGCCAAA CAGAAGAAGA AGTCAGGTCG TGGCGGAACT CGGTCTACGG GATTGGATGA AGACGCTCTT AAAGCGAGCG GATTTCTGCC ACCCGGTACT GATTCAAAAA GTCTAGAAGA AGGCGCTCGA GACTACGTCA TGGTGAAACT TGGTGAGACC ATCCAGCCCA TGAAGGAAAG TTTCATTGTG GCTTTCTTAG GCTGGGAAGG GGCGAAAGAG GGAGACATGG TTGTTCCCAA TGAGATCCAA GAGCACCGTG ACCTGCATGA GATCACTTGG AAACTTAAAA GCAGTAGCAC CAAAGCTGAT ACAGTGGAGA CTATCGAGAA CGAAAGCGAT AGGCAACAGG ACGCCGAGAT CAAAGATTCT AGGGATAAAA AAGGGGACAG TTCGATAAAG TTAAACGGTA CTACTTCAAA GAAGCCGGAT GACACGTCCT CAAGCTCAGG AAACATCGAA GACACTGCCA GCACACATAG GGCTCATGTT GACACACCGA TGGAAGGGAT TGTAAAAAAT GAATTTACCG AAACCAATGG AATTTTGCAA TCCTCACCTC AAGAGAATAA AGACTCTGAA TCCATCAACG CTCCTGCGCT TCGTGTTGGA ACTGAGGCTA TTGATCGCCC GGATGCTCCG CAGTCCGCGA TAACTGCAGC ACCAAACACT ATTTCTATCC GAGAGGGAAA ATTCGCTGCT ATGGCGGCCC GGAAACGTGA TAGAGAAGGG GAGCCGAAAG AGCCCGAGGA GGTGGAAAGT ACAAGTGAGA AGACGAAGGA AGAAAAGCTG ACTTCCATAA CAGTGACTGA TAGCAAGGGC CGTACTGTGA AAGTTTTGGA TGACGACGAG GAGGAACTTG ACTGCGAGTT TCTAAACAAT CGACAGGCGT TCTTAAATCT ATGTCAAGGA AATCACTACC AGTTTGATCA CCTGCGCCGC GCAAAGCACT CCTCCATGAT GGTTTTGTGG CACCTTCACA ACAGGGATGC ACCAAAATTT GTGCAGCAAT GTGCGACTTG CTCCAGAGAA CTTCTTACCG GATATCGCTT TAATTGTCCT ACATGTGGGG ATTTCGATCA GTGCCAAGAC TGCATTTCCA ACCCGAAGGT TCCTCGGCAC CCGCATCAGC TCAAGCCTAT TCCGGTGGCC AATGCGCAAC AAAACGAATT GACGGAAGCG CAACGCAAGG AACGACAGCG CAGTATCCAG CTTCATATGA CTCTTTTGCT GCATGCTGCT ACGTGTAGCT CGCCGAAGTG TCCGTCAGCC AATTGTACAA AGATGAAGGG TCTTTTAAAG CACGGCGCGC AATGCCAAGT GAAGGCCACT GGCGGTTGCA ACGTATGCAA GAGAATATGG GCTTTACTGC AAATTCATGC TCGTCAGTGC AAAGCGAAGT CTTGCCCTGT TCCGAATTGT ATGGCAATCC GTGAAAGAGT TCGCCAATTG AAAAAGCAAC AACAGGCGAT GGATGACCGT CGTCGCCAAG AAATGAATCG AGCTTACAGG GGGAAGCGCT AA
|
Protein sequence | MQASQAQLQP QQAPPVAAPL PSAAAQQGSA AAPAVSHYSS QHLSQVPAAQ PGLAPQSQGV VHQQRPVSQG INYTTQTSQS QTPAQQQAPP QQHFPHQGLN GGWQSDKDYQ ERRKMIAKIV HLLQQRKPNA PQEWLKKLPQ MAKRLEESLY RTATSFEEYN DANTLKHRLQ QLAVNIGQKT KKLQQQQALL AQQRLQQQQQ QQQVRQQSDT TQYTPQVPPS TSQQALIQPQ TQQGLIPQVK SAAPPASQVQ GQRMVNMSEI NPIMGQPTQQ QQSAPAPQPP PLQQVQYTQK PPVAPISAPT PQAPAPAANG PNGQASGRQV SDRQQVLRHQ QQRLLLLRHA AKCQHEDGKC PVTPHCAGMK RLWKHIAECK DQKCLVPHCV SSRYVLSHYH RCKDVRCPVC GPVREAIHRS HEKQKQMQAL KQRHQQAVQQ QGQPQNATSA PAAIGALPVP APPGHSLEPV TKKQRTAPIT ALRAPIMPVQ RLQQPPGTRP AVSHPTTVRP GYTGSQPPIT SGPGGPPVAQ VPGLAFANGQ VVMPKHSGPK PQEDHTLINC FSVQQIETHI SSLSNGLVLP PQKLKTKGLD ALKTLQSHQH AWVFNTPVDP VELGLPDYFE VIKKPMDLGT IRKKLENGVY QRLDDFKEHV LLTFDNAMMY NPEGSVVYNM ANEMKVKFQS DFVKLMEQLN AEEDVKRKNG EACCLCGCEK LLFEPPVFYC NGINCPSKRI RRNSYYYIGG NNQYHWCHQC YQELRDNSTI DLGDLSVKKE SLVKKKNDEV HEESWVQCDR CERWVHQICA LFNTRQNKDQ RSEYACPKCT IDERKAKGEL EAKSSTPMAE DLPRTKLSEY LENHVREKVD EFVEQRSQDM VVAQGCSIEE ARSKLKMGGA ITIRQVTSMD RRLEVRDRMK QRYAFKNYPE EFNFRCKCIV VFQNLDGVDV VLFGLYVYEH DEKNPAPNKR AVYVSYLDSV HYMRPRDMRT FIYHEILISY LDYVRRRGFS TAHIWACPPL RGDDYILYAK PEDQKTPKDD RLRQWYIDML IEAQRRGIVG KLTNMYDLYF SNEKNDATVV PYMDGDYFPA EVENIIKDIE EGKTGKKGSS QGKKKKEKAK QKKKSGRGGT RSTGLDEDAL KASGFLPPGT DSKSLEEGAR DYVMVKLGET IQPMKESFIV AFLGWEGAKE GDMVVPNEIQ EHRDLHEITW KLKSSSTKAD TVETIENESD RQQDAEIKDS RDKKGDSSIK LNGTTSKKPD DTSSSSGNIE DTASTHRAHV DTPMEGIVKN EFTETNGILQ SSPQENKDSE SINAPALRVG TEAIDRPDAP QSAITAAPNT ISIREGKFAA MAARKRDREG EPKEPEEVES TSEKTKEEKL TSITVTDSKG RTVKVLDDDE EELDCEFLNN RQAFLNLCQG NHYQFDHLRR AKHSSMMVLW HLHNRDAPKF VQQCATCSRE LLTGYRFNCP TCGDFDQCQD CISNPKVPRH PHQLKPIPVA NAQQNELTEA QRKERQRSIQ LHMTLLLHAA TCSSPKCPSA NCTKMKGLLK HGAQCQVKAT GGCNVCKRIW ALLQIHARQC KAKSCPVPNC MAIRERVRQL KKQQQAMDDR RRQEMNRAYR GKR
|
| |