Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36381 |
Symbol | |
ID | 7201788 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 319836 |
End bp | 321779 |
Gene Length | 1944 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180983 |
Protein GI | 219120492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAG ACGAAGGACA GACTATTCAC AGTGACTGTC GCAAGTCAGT CGCTGGCGAT CGACGGTACT CGAAATATTT ACAAATTGTA AGTTCAAATG TACACTTTTG AAATATAAGA ATCTACTGCC ATTCCCATGG TCGTAACTTC ATTCGTTTGG TGGTATCTTA CAGTTAATGC TGGTCGGACG GAACGTGATT CGTACCCGAG AGTATCATTT TCCACGCCAC GATGGGATTC TCTTTTTCGC AGAATTTTGG CACAAGGCAG GAATACGAAG AAATCGACGG ACCGGGCCGT ACACGAAGCC CACACCGAAG CAAGCTCGTG ATTGCGACAC TTTCCACGTT GCTAATTGCT TTTGCTATCG TCACTCTAGC TACCAGACTC GTCGGCTTGC CCAGATTGGT GGCGGAATCG CAAGATTCGT TGACGAATCT GGTCCCACTG ATCGCTCCGG AACTCGACTA TGGAGAGGAA AGCAACGAAG AACGATTCCA GCGATGTCAG GGAATCGACT GGGAACAGGC GTGTCGGAAT TTGGCGGGAG GGGCTGGGGC ACGCCGGAGA CGCCTAGTAG TGGAAAACGA TCACCCCGCC GCACCGTTCG TGGCCTCCGA TTTGGAAGAA GCCTACCTCC TCGATGTCGA TGATCCGTCG GTCGTCTACG ATGAGCACTG CCTGCGAGTC TACCGCCTCG ATTTACTCCA AAATATTACC TTCCCCTACC ACGCCAACAA CTTACTCAGG GCTGGCGGAA ACCAGACCAT GACACTCTTT ATCCAACACG GTGCCATGCG GGACGCCGAC AAATACTTTT GCTCGTTCCG ACAACTCATG AAATCGCAAA CGTATCGGCC GTTCGACGAC ATTCTCATCA TTGCGCCCGA TTTTAACTAC AAGCAAGACG TTGGCGTACT CCCAACGGAC GCCTTTTGGA ACTCATCGAA ACCTACGGGA GACTGGAGAG GCGGTGCACA ATCCGATCCC GAATGCTGCA GCAGCGATCT GACTCTTTCG AGTTACGAAA TCTTGGACCA CATGCTGCGT ATTTTGACCA GCAAAAAACT ATATCCGCGA ATGGACAAGG TACGTCCATG CTCTTGTACG TGTTTGCTTT GTTCGTCAAG GGCATACCCC ACATCGTAGC GGACTGTTTT GACACCGGGT ATCAAACTTG TCTGACAAAA CACCATGTTC TGCTGTTGCC ATATTTTGTG GTCATTCTAG ATTTCATACG TGGGTCATTC AGCGGGAGCG CAAATGGTGC AGCGTTACGC GCTTACGAGT CGATTGGCGG CCAAACATGA TGCCCATAAC GATGCGGTTG CACTGGAGTT CGTCGTAGCC AATCCCAGCT CGTACGCATA TCTGGATAAT AGGCGATGGA ACTACCATTG TGGGGAATGC CAATGCACCA GGAGTAACTG TACTTGCTCA CAAGACTGTA GTATTCCTCA CCGTCTTGGT GTTCCCACGA CAAAGAAGGG TACAGAGGCG CAATATGAGC AGTGGGTCTG CGCGGACGGT TCGTATAACT CATGGCCGTA CGGAATTGAC CTGGAACGAA AAGAGTATTT ACCGCCGTAC ATTCTAAATG CAGACATGGA ACGTGCCGTC CGACTGTACA GACAACGCAA CGTGATCTAC ATGGTTGGTC AAAATGACAC TTGCAACGAT GGCTTGCCAA CCTGCGATTC GAGTTGCTGG AAACGCCTTG ACTTTCTGCC GGGCGAAGAA CCATGTTTTC GCAACCATAT GGACAATCGA TGCCCCGCCA TGCTGGAAGG CCCCAATCGT CGCACACGGG GTCTCCAATA CATGGACTAT TTGCGAGAAA TCTACGGGAA TCATACACAC GTTCTACATG TGATTGATGG AGTAGGTCAC AACGCCACAG CCATGTTTAG TTCAACTGTG GGTCTGCTGG AACTGTTTGA CTAG
|
Protein sequence | MDEDEGQTIH SDCRKSVAGD RRYSKYLQIN FGTRQEYEEI DGPGRTRSPH RSKLVIATLS TLLIAFAIVT LATRLVGLPR LVAESQDSLT NLVPLIAPEL DYGEESNEER FQRCQGIDWE QACRNLAGGA GARRRRLVVE NDHPAAPFVA SDLEEAYLLD VDDPSVVYDE HCLRVYRLDL LQNITFPYHA NNLLRAGGNQ TMTLFIQHGA MRDADKYFCS FRQLMKSQTY RPFDDILIIA PDFNYKQDVG VLPTDAFWNS SKPTGDWRGG AQSDPECCSS DLTLSSYEIL DHMLRILTSK KLYPRMDKIS YVGHSAGAQM VQRYALTSRL AAKHDAHNDA VALEFVVANP SSIPHRLGVP TTKKGTEAQY EQWVCADGSY NSWPYGIDLE RKEYLPPYIL NADMERAVRL YRQRNVIYMV GQNDTCNDGL PTCDSSCWKR LDFLPGEEPC FRNHMDNRCP AMLEGPNRRT RGLQYMDYLR EIYGNHTHVL HVIDGVGHNA TAMFSSTVGL LELFD
|
| |