Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49177 |
Symbol | |
ID | 7195625 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 134455 |
End bp | 138915 |
Gene Length | 4461 bp |
Protein Length | 1336 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183822 |
Protein GI | 219127188 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.116329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCACGC AGGCTGGCTA CTGGCGAGGC TGTCGGCGAA ACGGGAAATC ACGCACGAAT GTCCGGTCGT TGGTCGAACG AGACATTTCG ATCGTGGTTG GTTGGGAGGG GGCGGTGGAA TGGAATGGAT CGTACGGGAC CCCCCTCCCC ACCCGCGGGA CGGTGCCACC CCTAGAGGGA CCAACACGTG TGTCGTCCTA CGTCACAGGA CACGCAACAA CGACACATTC CCTCGAAACG AATGACCGGA CCGTGCGGCG CCCGGTTCAC TACCCAAACT CGTTCGCTCG CTCGGACGGA CTCGAACGAT TCCCAAACGG TGCTTCCAGT TACGGTACTC GTGTGGGTAA GCGTATTTGT ATGTTTGTAA GTGTGTATAG AGATAGGTAC GTGTGTGTGT AAGAGTGTGG GTGTGTATGT ACATTACCCA CCATCGGATG GATCGATCCG TCCATCCCCA CTCCCTACAC ACGTTCCTTC CGCACACACT AGTAGTTCAC CAAACCCTCC CTTCCTCTGT TCTCTCCGTT GGGTCTCTGT CTCGAGTTGT TGGTAATCCA GTACTCGTGT GTGGTCGCTG GTAGTAAGTA GGGACTCCAA AATGTCGGGT TCCCCCACGT CCCGGACTCC GGGACGCACC GGAGCCGGAG CCGGCGTCCG ACAGCGATCC AACCACAACC ACAATAATAA CAATAACGGT AACAGTCAAC CGGGACGATC CTCACGCAAC CACAACGACA CGCGTCGAAA CGCGTCCCGA TCCAACAACA ACAACAACAA CAGTAGCCGA CACCGCCACA AAGGGTCATC CAACCCATCT TCCGACGGGA CGCAGTTGCT GCGTGACGAG GTACGTCGCG TGGACCAGCC ACGAGTGTCA AAGGACACGG TAGGCACACA CATATACACT CACACACACA CACACACCCA ACAACAACAA CCACAATAAT CCATTGCAGG ATGTTCTCAT TGCCTTTATG CCGTACCTCA AAGACAGTCA TTCGGAAGCG GCACGAATAC ACCAAGCGTA CCAGTCGGGC AGTGATCAGC GGGCCTTTCT GCAAGCGGCA CGGGCCTTTT TGGAACGACA ACGTCGCACT GCTCCGTCCG TAGCACCACC AGCGGCAACA CCGGAACCAG CACCAGCACC CACCACGTTG CCGGCGACCA GTGCCGACGC TACGCTCTTC CCTCATCCGC CCCTTCTCCA ACCGACCCCC TCGGCAATGG ACCAGTCTCC ATCCGATGTC CACACCATGC TGTTTGGACC CACCAGTGTG TCGCAGCATT CTCTCTTTTC CCCGTTACAC CCGACGCAAA CCGAATACCG TCCCGAAGCC CTCGAAGCGC GAGGACTCGA TGAAGCCTCC CGTAGCCAAC GTCTACCACC GCCCGTGCCC ACCAATCGTC CCAACCCTCC ACACACGCAC CACCACCTCA ACAACAACAA TAATACACTC TCCATGCGCT CTTTAGAACA AGGACTGCAC GACCTTTTGA ATGATGAGCC CGTGGCGTCT CTACCCGTAC AGGGAGCATC CATCTGGGGT GCCACTCCGG TGGACAGAAT ACCCCACGGT CCTCCGCCTC CGGTGACTCC GCCCCGAACA GACCCGCTCC CGTCCCACGC GTACCCCTAT AACACGGTGC AGTCGACTGC GACTCCCTGG TCCAGTCCCT CCCGTACCCC AATCCCACCG GGATTGGCTC CGGTCTTCTT GGCTCCCACT CCGTCCCCGG CGGTACCGCA AACCGCTGCC GATCCCCCCA GACAACAAAC ACCGAAACTG TCCGAAGTCC CACCCAAGAC CGCCAAAAAG GAAGCGCCCC CTCCGAAACG CTGCTGGACG TATTTTCACG AGCAGCCCGG TCGTTTGAAC GTGGACGGAA CATCCGGAGA AGACTCGAGC GTCGTTTTGC CTCTGGCTCC CCGCTCGGAA TTGACGGCCC ACTGGACGTT GCCCTTGACC TACTTGCGCC AGTACGCACT AGAACATTTG GGTCCCCGCG TAGGCTTGGA AAAAGTCCTG GCCGGACTCT CTGTGGGTCT CTTTCGACGG GGCTGCACCG AAAACGGCTC CCAGGCGTCA ATCATTTCCA AATGCGTCCT CGGTGAGAAC AACCGCACCT ACAAATTCTG GAAGGACGCC GCATCCGAGT CCGTCCACGG AAAAGTCCCC TTTTACTCTC CCCGGACGCC CGGTCACGTC ATACTCCGTA TGTACTGGGA CAAGAATCCC CTCTACACAT TGGCCATGGG GCCCACGCTG CTGGTCCGGG TCACCGAAAA CGAATTTGAC GGCAGCGTCC GCTTTATCCT CAGCAACTTC AAGGCCAAAA AATCGAATCC GACGTCTTTG TCCTCGCTCC ACTCGTTGGC GTCGGTACTG GAGACGCCTT TGAGTCGACC GAACGAGTCG GCGGCTCGCG CAACGTGGGG TTGCGTGCAA GAGGCCCGCA AAGTCCTGGA CGCTTGCGCG GCGGAATACG CCAAAACCAG TGCCAAACTC TCCACGTTGG AAGAGACCGT GGACGAACTC AAGAAGCGTG TGGAGGCCGA GGAAACTTCC TCGCCGAGTG ATCGTTCTCT TACGGACGGC GAAAGCCGCG ACGGGGCGGA TCCGCAGCTC GACTCGGATC TTACTTTGGA CTTGCGGGAA AAGACCAAGA CCTTGATGGG TGGACGGGCC TCCTGTGAAC GCAAATGGAG AGATTCCCAA TTGGCCTTTG CCAGTATCCT CCGCGCCGTC GTGAGCAATT CCACCCTGTC CGTACTACTG CGTCGGGACT TGATTACGAA AATGCGGATC GAATACGAAC TCTGGTGTCC CCTGTCGGAA GAATTTGCCA TTCCCAGTGA CACGGCCAAA ATGTGGTACG AGCCACTGAA GGATCTGCCA CACACCATCA CAGCGGACGA TTTTCGTTTC TACGCGCAAG TACGCACGAA AATGCAAATG CGGACTCTGG GGTTCGACCC CAATCTCGTT TCCCTCGAAG ATATTCTTTT CCCGCCCATT CGCGGCAAGA CCAAGCAGAG AGTCATGGAT GCGGGCGCGG TTGGCGTTTT CAACAACGTG TCCGCGGCCA TGGGTCAATA CTTTCAAAAT CTCTACATGG ACGAGGACCG TATCGTGCGT CAAAGACAGA TGATACAAGA ACGTACCCAG CACTGTGTTG AGGAGTGCGC CGCCTTCCCG CTGGGGACCA AGGTAGTCAT CTTTGGTTCG TCGGCGAACG GTTTCGGATC TCCCAAATCA GACCTGGACA TGTGCTTACA ACTTCCCGAG GGCTCCAGAT TGAACCACGA AGCAGGTGGG GAAGCCATGG CAAAATTGGC GCAGTACTTG GACACTTTTG GCATGAAGAG TGTGGATACG GCTCGTCTTA CCGCTCGTAT TCCCATCGTC ATGTTTCAAT GTCCCAACCC CATGTCCACA GGAAACGGTG AGGACGATCT AATTGAGTGC GATCTGTCCA TGCACAATAC TCTGGCCGTT CTGAATACGG CACTCTTGCG GACCTATGCG GAAATTACTC CCGTCACTCG TGTTTTGGCT GCCATTATCA AGCGATGGGC GAAAGCCCGC GACATCAACA ATCCAGCTCG ACACACTTTG AGTAGCTATG GCTACATCAT AATGCTTCTA CACTTTCTTT CCTATCATAA ACGCAACGGC AATGGCCTCG TTTCGGCGGT CGCTCCGCCG GAAGGAAACG CGAGCTCGCG GAAGGAGTCC GACCCGTCGT CGACTCCGCT GCTACCCAAC CTACAGTGGA TGGATCCCGC GTGGCCCAAC TTTCCTAAAG GAACGCCTTA CAAAGAGCTT CGCTCGCTAC CGAAAGACAT AAAGGAACAT CCACTTGAAG AGAAGAAGAC CATTAATGCC TACTTTTACA AGCCCAGTAC ACCCAACGAT AAGGCTCTTT TGCAAATGCT TTTTCCCGGT CAGGATCTTT CTCTTGCCAT TCTGTTGGCC TCCTTTTTCC GATACTATGC ATACGAATTC GACTACAAAC GACAAGTTGT AAGTCTACAC TCAACGGCCT CCCGTGGTGT TTTGGAACGC GAGGTCAAGG CCGAATTGGA TGGGTGGAGA AACTACAGTG CCGCCTTGAC TATCGAAGAT CCGTTCGAGA CCTTTTACGA CGTGGCGCAC GTTTTGCGCG GTGGATACTA CCACCGCATT CGGCGCGAAT TTACCGTGGC GTATTCCAAA ATTGCCGACG CCTCTGCTGG ACGTACAGGC AACTGGAATA AGGGAGATTT GCGATCAATG AGTGGGGAAG AGCTGGTTGA CTGGATATGT GAACCCGTGT CGAGTGAATC CAACGATACC AATGCCCCTT GAATATAAAA AATATACATT TGTAAGCGAG TGTGTCGGGC CCAGTCGGAT CAGCTTTCCC GAGTACCCAA AATAGATGTT AAAAATTGCG CAATCTTCCT ACAGTTAATG T
|
Protein sequence | MGTQAGYWRG CRRNGKSRTN VRSLVERDIS IVVGWEGAVE WNGSYGTPLP TRGTVPPLEG PTRVSSYVTG HATTTHSLET NDRTVRRPVH YPNSFARSDG LERFPNGASS YGTRVEIVSR DSKMSGSPTS RTPGRTGAGA GVRQRSNHNH NNNNNGNSQP GRSSRNHNDT RRNASRSNNN NNNSSRHRHK GSSNPSSDGT QLLRDEDVLI AFMPYLKDSH SEAARIHQAY QSGSDQRAFL QAARAFLERQ RRTAPSVAPP AATPEPAPAP TTLPATSADA TLFPHPPLLQ PTPSAMDQSP SDVHTMLFGP TSVSQHSLFS PLHPTQTEYR PEALEARGLD EASRSQRLPP PVPTNRPNPP HTHHHLNNNN NTLSMRSLEQ GLHDLLNDEP VASLPVQGAS IWGATPVDRI PHGPPPPVTP PRTDPLPSHA YPYNTVQSTA TPWSSPSRTP IPPGLAPVFL APTPSPAVPQ TAADPPRQQT PKLSEVPPKT AKKEAPPPKR CWTYFHEQPG RLNVDGTSGE DSSVVLPLAP RSELTAHWTL PLTYLRQYAL EHLGPRVGLE KVLAGLSVGL FRRGCTENGS QASIISKCVL GENNRTYKFW KDAASESVHG KVPFYSPRTP GHVILRMYWD KNPLYTLAMG PTLLVRVTEN EFDGSVRFIL SNFKAKKSNP TSLSSLHSLA SVLETPLSRP NESAARATWG CVQEARKVLD ACAAEYAKTS AKLSTLEETV DELKKRVEAE ETSSPSDRSL TDGESRDGAD PQLDSDLTLD LREKTKTLMG GRASCERKWR DSQLAFASIL RAVVSNSTLS VLLRRDLITK MRIEYELWCP LSEEFAIPSD TAKMWYEPLK DLPHTITADD FRFYAQVRTK MQMRTLGFDP NLVSLEDILF PPIRGKTKQR VMDAGAVGVF NNVSAAMGQY FQNLYMDEDR IVRQRQMIQE RTQHCVEECA AFPLGTKVVI FGSSANGFGS PKSDLDMCLQ LPEGSRLNHE AGGEAMAKLA QYLDTFGMKS VDTARLTARI PIVMFQCPNP MSTGNGEDDL IECDLSMHNT LAVLNTALLR TYAEITPVTR VLAAIIKRWA KARDINNPAR HTLSSYGYII MLLHFLSYHK RNGNGLVSAV APPEGNASSR KESDPSSTPL LPNLQWMDPA WPNFPKGTPY KELRSLPKDI KEHPLEEKKT INAYFYKPST PNDKALLQML FPGQDLSLAI LLASFFRYYA YEFDYKRQVV SLHSTASRGV LEREVKAELD GWRNYSAALT IEDPFETFYD VAHVLRGGYY HRIRREFTVA YSKIADASAG RTGNWNKGDL RSMSGEELVD WICEPVSSES NDTNAP
|
| |