Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50435 |
Symbol | |
ID | 7199250 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 58226 |
End bp | 60356 |
Gene Length | 2131 bp |
Protein Length | 527 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185422 |
Protein GI | 219130542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGGTTCTTT GTATAATAGT ACCTACCGGT GCCAACAACG ACAACAACAC AACAACAACT TCAAGATTGG GTTCCAGTTG ATTCAAAAGG TAACAAGTAC TCTGCCATTG TAGGTCGTAG AATAGGTGAG GCGCAACCTA CCAAACTCCA AACTAGTAAC AACAACACGT TGTCTTGGGA GCCAAAGAGT CTTTTGGGGG TTTTTCTCTC TGATTCGTCT CGAGCTCGGG AGCTTCTCCG TTGTACTCAC GCTAGGGGGT AGAAAAGGAA TTCTTCTTTC CTGTCGTGAA CAATGGCGAC ATCCACCAAT CGGGAACATT GGTCGCATCC TCCGGCGGTA CACCCTGCCC GACCCCCCGT ATACGACTCA CCGTACGAAC GTCGCGAAGG TCCTCCCCCT ACGTACGCGA CACCCTCGCC GTACCAGCCG AGAGGTCCGC CTCCGGACCA TCACTACTAC TATCCACAAC AACAACAACA ACAACACATC CACACGCATC CCGGGGTCCA CCCCGTCCCA GCGGATGGTC TCACGGAGGA GGAGGAGGAG AACCCTACGA ATCCTACGCT GGGGGACCTG CGGCGTACCG GGGATCCTCT GCCGCCACGT ACGGCGCACC GTCCCGGAGG GCCGCCTACC CTCACGTGCC GGTGCGTCCA CCCCGACCTC ACGACTTTGC TCCGCCGCTT TCGCGTTCCG GTTGGAGACC TCCGGTAGCC ACGGAACGAC ACGTGCGTGC CAACACCAAT CCTCCGCACA AGTCGCCTCC CAAAGCGGGA GATACCGGTT GTGCGACCAG TGCCTCTACC AATCGTGTCA GTAGCCGTAC CAAGAAAAAG GATGCGGATC CCCTGGCGCT GCTCGCCAAG GTGAGTTCCA CCATGGAAGA CGACGAGGAC GAAGAAGAAG TAAAGGAAGC TTCCCCTACA TCCCCTTTAC AGCGCCGCAG TCAGGAAGTC CCCGTGGCGA CCGCTACGCC CAAGTCCGAT CCGTCGACCT ACGTTCCCCG ACAGCCAGCG TACACGTACG GTCCCAAGCC CATTACGCCC ACCGGAAACT ACTCTCCCTA CGATGACGCT CCACAGTACC ATCATCACCG GGGACCGGCC TATCCACCAC CACAAACCCG GGGATCGTTG CCCCCTCATC CGTATCCACC GCCTTCCTAC CTTCCGCACG CTCGAGTCGG TCCTACGGAA TGGGACACGG GACGTCCCGT CATGGTGGAG CGTACCTCTT TTGATAGCCA CGACAGTGGG GACTACAGTC GCGGAGCGCC ACCACGCTAC TACTACGATG GTCCGGACTA CGGACCGAGT TCGCCGCCAC AGCACTACTA CGCTCGCAGT GGTTCGTACC CTCCCTCGTA TGGTGGTCCA CCACCCCAAT GGGGATACGA AGGTCCGCCG TCACACGCTC CATTCTACCC CCCGCGGGGT CCCCCCACCG CGCACGGCGG ACTTTACGAG CGACCAGGTC CGGAGCGAGA CGCGCCACCA CCCCCGTACC AGTCCTCGAC GGCCGCACCC TACACATACG TGCAACAACC AACCCTGGAA GAGAAAACGG TTCTGCGCAA AAAGTTTTCG TGGAAACATT ATCCAGAGGT ACGTTTCGCC GCAATAACAT ACCGTTCGTT CAGTTTACTC GAGTGCGAAG GTACACTGCT CACACCATGC GCTTGCTTAT GTTACCTCTT TCTCGGGACA ACAGCTGGAA CGATTCTTGA TCTCGCATCG CGACGAGTAT TTGGAACATT CCAGTAAGAA CTACACTGCG GAACAAAAGA CGTACAACAA CTGGTTGACC AATCAGTTGC TCCATTTGGC TGCCCAACAC AACTACCAGT TTGATCCGCA AGCCTTCACC TTTGTTGCCA TTCGCGATCG CATTCGTTGC TACTACAAGT CCTACGTCCA GACGGCCCGG AAACGAGGAC TGCCGCTGCC GACTACTCCA TCCATTAAAA AGACCAAGTC GCCACGCGAC GGCGACGAAG AAGTGTCTTT GTCGACGCCG CCAGTGGAAT CCAAAGGTGT GACGTCATCA GGCAACAATG AAGGCGAGAC TGGACGAGCG GCGAACGACG CGATGACAAC CGATCCTCGG GAGGATTCCG GAACCGATTG A
|
Protein sequence | MATSTNREHW SHPPAVHPAR PPVYDSPGWS HGGGGGEPYE SYAGGPAAYR GSSAATYGAP SRRAAYPHVP VRPPRPHDFA PPLSRSGWRP PVATERHVRA NTNPPHKSPP KAGDTGCATS ASTNRVSSRT KKKDADPLAL LAKVSSTMED DEDEEEVKEA SPTSPLQRRS QEVPVATATP KSDPSTYVPR QPAYTYGPKP ITPTGNYSPY DDAPQYHHHR GPAYPPPQTR GSLPPHPYPP PSYLPHARVG PTEWDTGRPV MVERTSFDSH DSGDYSRGAP PRYYYDGPDY GPSSPPQHYY ARSGSYPPSY GGPPPQWGYE GPPSHAPFYP PRGPPTAHGG LYERPGPERD APPPPYQSST AAPYTYVQQP TLEEKTVLRK KFSWKHYPEL ERFLISHRDE YLEHSSKNYT AEQKTYNNWL TNQLLHLAAQ HNYQFDPQAF TFVAIRDRIR CYYKSYVQTA RKRGLPLPTT PSIKKTKSPR DGDEEVSLST PPVESKGVTS SGNNEGETGR AANDAMTTDP REDSGTD
|
| |