Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47698 |
Symbol | |
ID | 7202705 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 566491 |
End bp | 569191 |
Gene Length | 2701 bp |
Protein Length | 852 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181934 |
Protein GI | 219123235 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.389088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGATC CGGAACGCAC CAAATCTCCT CTGCGACCTC ATTCTTGCAA AGCAGAGACG GGCGACCAAA CATCGCAAAT GCACTCTCTC CTAAGCTGGA TGCAGCGCAG TACAGAGTCA TCGACGCCCA CTACTGTCGC GAGTACGCCG GATTGCTCCG ACTCTACCCA ACAGCAGACG CCACCTTCTA TTTACGATCT GCAAGATCGT TACGTCACCA CCGGCTCTCT CGCAATACCG GAGCAAACGG AAGACTACGA TGATGACGAT GACGTCATGT TGGAGGAGCG CTTTGGCGGG CGCGATCTCG ATCGAGAGAT CTCTCCGGTT TGCCGCACCG GCCTGACGGT GCGGACGAGG TCTTTACCCG GTTCTCCGTC CGAAGTAGAC AACTCCTGGG AGTCGCGGGA AAGCTACGCT TCCTATCCCA TTGGACGTAA ACTGCGCTCG CGTCGCAAGA GCTCGACGCA GGGTATTGTC CTCAAGAATC GTCGCCGTAG ACGATCCAGC CGTGCTTCCC GAGACGATGA TGACAATTCG CTCACCACCT TGTCCTTTTC GGATGATCCA CCCTCTCCGA ACAACGATTC TTTGCCCCGA ACAATCCCCT ACCGCAGAAT GTCTTCCCGC CGCTATTCAC CCGCGCGCTC GCTGGGGGAG TCCACCACAT TTCGGAATTT TGTTTGGAAA TTGGGGCACC CACGAGGTCT CCTATGGCTC ATGTGCGTTA TTGGACTCGT CAGCATCAGC ATGACCTACA TGTCGACGCG GAATATGGCG GGCCGTCCAG GAGAAATTGG AAGGATTCAA GTGCTTTACG CTGGAAACCA CCATGTCCCC CTGAGGACGT CGCGGCTGAA AGGAGGTGGT GTATCGGGAA ATTTCTTGCG TGGTTTTCCA GCGCCAACAT CTACGTCAAT CGCCGCCAGG AAATCTGTGC CGGAACAGGG ACGGAAGTCG GCGGTCACTT TGAATCATGC CATTGTGGAC TCCCGCAAAA TCTCAGAAGT AGGAGTTGAT GAACACCTGG CGAAAAACTA TGCTTTGAAG AAAAATGATG ACCGCAAGCA CCTTGAAGTG CATAAATCGA AACATGACGA AGGCAAACAT CACCACAGCC ATGACGAACA CAAGCATTCT GGGAATATGA AGGACAAGCA TCACATTGAG CACAAGAGTA CGGATAAGAA TGATGATCGC AAAGATCCGG CAAATCCAGA AATACAGCAT GAAAAACGCC ATGACAGCAG TGCTACGAAC ACTGAACTGG AAAATCACAA TGAGCCGAAA AACTCGCTTA CGTCCGTAGC AATGCCGTTG GTTCCTCTCA GCATTCCGAA ATCCGACTAC GAGGCTCAAG ATTGGAGATT GTACAGGGCA CCTGCCCACA GTCCCTCAAA CGTTACGCAC CACAGGATGG TATTTGTTGA TCCATCGTTG GCTCACGCCC CTCTTCGTCA CCGTAAGGTT GTGTCGTACC CATCCGATTA TACCGACCCG ACTCAGCTAT ACTCCATCCT GGATTCTGGC GACGAACGCA TCAAGCGCAT GGAACCCCGG GATCCATACG TGCAGGGCGA ATGTGTGCCA ATGCAGCCCT GGCAAACAAC ATACCACCCC TTATGCAACG GCGTACACGA GCTGGGTATC GATCAAATTC TCGGCGAAGA GACCGGTAGT GATATGCGTC TGTTTGGAAC AAAAGGATTT TGGCGTAACG CGTGGAGAGC GGATATTTTG AATGGACATT CGCATTTGCA CGAACGAGAT ACTATTGTTA TCAAAACTCT CAAGTATGTA TGCCCAGGAA TAAATTGCTT GATTGTTACT TACTGGATAT TTTACTAACT CCTCGATTTG TCTTCAGGCT ACAACATAAC TTTGAAGAAG CGCATTTTGA GCATGATCGT ATTGATGCGG TCGCGATGGA GCGCTTAACA TCGTCGCCGC ATGTTATTAA TGTGTTTGGA TTCTGTGGAC ATACGGTCAT GACTGAATTC GCTGACGGAA AGCGTCTTGG AGAATTGGCC GATCGAGCCA AGAAACAACC GTTGGAACGG CTCAAGATCG CTCGCGACAT TGCTGAGGGT CTTGCTGATG TACACGGCAT TGATGGAGAT GGCAATGTAT CTTTTGTTCA CTTGGATATC AACCCTGCTA ACGTGGTCAG CATAGGAGGC CGTCTGAAGT TGAACGATTT CAATATTGGT GTTCCAAGGC GCTGGAATAC TACGTCAAAT GAACCTTGCG GTTTCCCAAC ACAATATCCG AACGCGCAAT GGCGGTCACC GGAGGAGGCT CGCCAAGAGG AAAATCTGAC AGAAAAAGTT GATATCTTCT CCCTGGGTCA CATATTTTTT AGAATGATTT GCGGTCATGA GCCCTGGAGT TCATTTGAGC CGGGTGGCAA GCCCTCCGCC GACGAACTAC ATGAGAAGGT TGGAAGAGGT GTTTTACCCA CAATTCCTAC GAATGTTTTG GAGTCGAAAG ACCCCGAAGT TGTCGCCATT CGAAACGCGA TGATCCAATG TTACACTTTC ATTCCGTCCG AGCGTCCAAG TGCGAAACAA ATTGCCCGAA ATTTGCAGAA AGCTCTGGAC AAGGCCGAAC TAGTACAAGA GGCTCGGCCT TAAATCGAGA CAAAAGTGCG TGATATCTTG GCCATCAAGT AAACGGTTAA TAGACAGACT TTTCCTATGT G
|
Protein sequence | MPDPERTKSP LRPHSCKAET GDQTSQMHSL LSWMQRSTES STPTTVASTP DCSDSTQQQT PPSIYDLQDR YVTTGSLAIP EQTEDYDDDD DVMLEERFGG RDLDREISPV CRTGLTVRTR SLPGSPSEVD NSWESRESYA SYPIGRKLRS RRKSSTQGIV LKNRRRRRSS RASRDDDDNS LTTLSFSDDP PSPNNDSLPR TIPYRRMSSR RYSPARSLGE STTFRNFVWK LGHPRGLLWL MCVIGLVSIS MTYMSTRNMA GRPGEIGRIQ VLYAGNHHVP LRTSRLKGGG VSGNFLRGFP APTSTSIAAR KSVPEQGRKS AVTLNHAIVD SRKISEVGVD EHLAKNYALK KNDDRKHLEV HKSKHDEGKH HHSHDEHKHS GNMKDKHHIE HKSTDKNDDR KDPANPEIQH EKRHDSSATN TELENHNEPK NSLTSVAMPL VPLSIPKSDY EAQDWRLYRA PAHSPSNVTH HRMVFVDPSL AHAPLRHRKV VSYPSDYTDP TQLYSILDSG DERIKRMEPR DPYVQGECVP MQPWQTTYHP LCNGVHELGI DQILGEETGS DMRLFGTKGF WRNAWRADIL NGHSHLHERD TIVIKTLKLQ HNFEEAHFEH DRIDAVAMER LTSSPHVINV FGFCGHTVMT EFADGKRLGE LADRAKKQPL ERLKIARDIA EGLADVHGID GDGNVSFVHL DINPANVVSI GGRLKLNDFN IGVPRRWNTT SNEPCGFPTQ YPNAQWRSPE EARQEENLTE KVDIFSLGHI FFRMICGHEP WSSFEPGGKP SADELHEKVG RGVLPTIPTN VLESKDPEVV AIRNAMIQCY TFIPSERPSA KQIARNLQKA LDKAELVQEA RP
|
| |