Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46872 |
Symbol | |
ID | 7204427 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 675231 |
End bp | 677680 |
Gene Length | 2450 bp |
Protein Length | 670 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185929 |
Protein GI | 219121409 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.916479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGACCGA ACTCAAAAGA CCTGCACGAC AATTTTTTGC AGCAACCACC AAAAACACAA AGTAGGAAGT ATCCTATCGG ACAAAGGCAA TACTATTCAC CGCATGCAGA GGAACGCTAT GCATTCTCCC TCCAGATCAT CGCGGACAAC GGTGTCATCC AGAAATCATA GACAGCGGAG CTGCTGCAGT GTCGCGACGT TCGCCCAGTG CCTCGTGTTC GCTTCGCTAC TCCACCAAGG TGCGGAAGCG CGCCTTCACG ATGCCGGCAA CCGTTCGAGG CGAGCGTCGG CATCCAACGA AACGGGTGTC GATCTCGTAG ACGTAGTGAT CGGTTTCGAA AAGGGCGAAG TACAACGCGG CGGAATCAGT GACTTGGAGT TGCTGGAGCG CGGTGGATTG CTCGGACTCA CCGCAGACGG CGCGCGTTCG GATTTTCGAC GCACCAATGC CGTGTCGATG CAGATTCCGG CTAGTGAAGT GAACAGTCTG CGTAGTACGA CGGGAATCTC CTTTGTAGAG TACGATGCCA AGGCTTTTAT GCACGAAGAT GAGATCGTAA CAACCCGTGC TACTCAGGAA ACTAATCTTC CCTGGGGGCT CGCAGCGATC CAAGCCAACA GCAAAAAGAT TCCGTTACCC TCACCCAGTA GCGACTGTTT CAAGATCTGT ATCGTTGATT CAGGTCTTCT TGTGGAACAT CCAGACATTG TAAGTCGGAT TTGTTTCTTG TCAACAGAAC TTTTTGTTGT TGAAGTTCGA TTGCTCACCA TTGTTTTCTA TGGCACTCTT TTTTAGCCGT ATACACGAGG AGCTACGAAC ATTGCCGGGG AAGAGTTCAA CATTGGAGCA CTTAGTAAAT GGGATGAACC TATTGCAATT GCCAACCATG GAACACACGT TACGGTACGT ACTTTTGTTT GGCCATTGAA ATACTCCAGA TGTGACACGC ACGCTTCGCA AAAAATCTGA CTCAAAACCT CGTATTGCTG CATTTTCAGG GTACCATGTT GGCTACCGGC ACTCGCAACC CGAGGGTGGT TGGTGTAATA CCCAACAACC AGAATATTTG TCTGCTCGTG GCTCGTGTAT TTGGTGACGA AGTCAACCCA GGTGCCTCCA CTTCGACTAC CGATCGTGCG GCGGAATGGT GTGGAGACAT GGGTGCTAAG GTTATCAACC TTTCCTTCGG TAGCCCCACA TTTTCGACCA ACTCGGCCGC CATCTACAGA AGCTTGCGAT CCGAAGGAGT TCTGTTGGTT TCCTCGGCCG GAAATGCACA AGACAATTCC AAGTCGTACC CGGCATCATA CTCGGACGTT ATTTCCGTTT CATCCGTCGA CAAAGACCTA CAGCGATTCT GGACGTCCCA ACACAACGAC CAAGTGGACT TGGCCGCCCC CGGAGTCGGC ATTGTATCGA CCGTTCCCGG TATGGGTTTG TTCGACGAAC TTGGTAGAGA GTACAGCACG GCGCTCATTG ACTACTCTCC TTTTATTACA TCTCCCCTTA CTGCGGGTGT GGTCCAATGT GGACTAGGCA CAGAGCCGTG CAAGGATGCT GAAGGGCGGA TCTGTCTAAT ACAAGTTGGT TCCAACGCAA TAGACGACAA GGCTTTTCAT TGCGAAATGG GTGGCGGCGT TGCAGCCGTT ATTTATAATA CTCAGTTTAC AACCGCTCCG ACCCGCGGCG CTATTAACAG CTTTGTTTCC ATCCCCGTCA TGGGCGTGAG TTACGCAGAT GGCCTTGCAT TGTTATCGAA AGAATCTATT ACTGTCGACC TACAAGTCCC AAGCTACACC GAAAAACGCG GCACCAGCAT GGCCGCCCCG CACGTTGCGG GAGTTGCCGC CAAGATCTGG GCCGCGCGAC CTGCATGTTC CAATAATCAA GTACGCGACG CTCTGGAAGC TACCGCGAAG GATCTTGGCG ACCCTGGTCG CGACGACATG TACGGACACG GACTCATCCA AGCGGAAGCC GCCTACGAAT ATCTTCTTGC CTTGCCAGCC CCTTGCGGAA TTGGCACTAC CGGTACGCCC AACACGATCC AAGGCTCCGA TAGTGATAGC ACTCCTACGG CAAGTAGCGG TGGGAACACT CCTACGGCAG GTGGAGGCGG AAGCACCCCT ACCATTGATC GGGAAACCTT TCTGGCAAAC TTGGCCAAGG AGGGGACCCG ATCACAGGCG AATGATAAGA ATTCTTTGCT CAAGTCCATT CAAAATAAAA CAGGCCGGGT CCGGGGTGGC GGAGGTGTTC GTCGCCGGTT CCTGAAGGGT AGCAAGGCGA AACAATAAAA CAAAGCACCG GTGCAACCGT TGCAATTGAC GATTCCGGCA AAATTTAGGC GCCAGTGTTT TGCCGGAATG TTACCACTTA GCGTCCTCTC TTATTATATC CATAGATTCT TTCAAGAGGA AATAACACAG TAAGATCATT TTTTCCTCTT
|
Protein sequence | MQRNAMHSPS RSSRTTVSSR NHRQRSCCSV ATFAQCLVFA SLLHQGAEAR LHDAGNRSRR ASASNETGVD LVDVVIGFEK GEVQRGGISD LELLERGGLL GLTADGARSD FRRTNAVSMQ IPASEVNSLR STTGISFVEY DAKAFMHEDE IVTTRATQET NLPWGLAAIQ ANSKKIPLPS PSSDCFKICI VDSGLLVEHP DIPYTRGATN IAGEEFNIGA LSKWDEPIAI ANHGTHVTGT MLATGTRNPR VVGVIPNNQN ICLLVARVFG DEVNPGASTS TTDRAAEWCG DMGAKVINLS FGSPTFSTNS AAIYRSLRSE GVLLVSSAGN AQDNSKSYPA SYSDVISVSS VDKDLQRFWT SQHNDQVDLA APGVGIVSTV PGMGLFDELG REYSTALIDY SPFITSPLTA GVVQCGLGTE PCKDAEGRIC LIQVGSNAID DKAFHCEMGG GVAAVIYNTQ FTTAPTRGAI NSFVSIPVMG VSYADGLALL SKESITVDLQ VPSYTEKRGT SMAAPHVAGV AAKIWAARPA CSNNQVRDAL EATAKDLGDP GRDDMYGHGL IQAEAAYEYL LALPAPCGIG TTGTPNTIQG SDSDSTPTAS SGGNTPTAGG GGSTPTIDRE TFLANLAKEG TRSQANDKNS LLKSIQNKTG RVRGGGGVRR RFLKGSKAKQ
|
| |