Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50171 |
Symbol | |
ID | 7198948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 261348 |
End bp | 263570 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185000 |
Protein GI | 219129658 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00469189 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGATA GTGCATCCTC GGAAAGTTCC AAGGCTACTA TCGGCGTTAC TCTTAGCAAC GCTATACCGA GTGATGCGTC GTCACGAGCC CGTTCCATTC TGGGCTCCAT GAAGGAGCAC CGGGAAACGT TTTCACGGGT CGTACACCAT GGACATTCCA GCCATGGCGA TGATTTCCGT ATGGCCTCAG CTGTGTCGTT GTCACCCAGT AATCCATTTT CGTCAAAGTC TGAAGATACC ATGCTGACGA TTGCGCAGCA ACTAAACGAT TCCTTGAACG ATAGGCGGCA GCGACTGTCT CGAGTTTCGG GTATCGATGT CGGTAACAAA ACAACTACAG TCGACAGCGC CGCTGGAGAG AACGTGCGTA CAGCTCCGAC ATCCCCAGTG AGTCAACGTT CGGGACACAC GGTTGCAACG GAAGAATCCA CGGTGGCAAC GGGAACTTCG TCACCTGGAC AGTCGTCCTT CGATCCGTCG TATGCGACAA CCCAGCCTCC CGATTCTCCG ACACAACAGC AGCAGAGACT TGGAATAGTT CAGTCTCGGG AACCCGTTGC GATTGCCTCC ACTGCGCAAT TGCCATCGAA CGAGGAAGTT GAGAATCCTG ATATTCTAGA GAATGCGGAC GTCCACGAGA AACATGGCGG TTCCGACAGC AAGGCAGCTT GTGAATCGGT ATCTACCGTC GACGTTAGCT TCGGTCCGGA GACGCAGTTT CGGCATGTTC GCAGCTCCAA GTCGTTTGGT GACAGTGCGC TAGAAAGTCG TAATGCTTCC GTTCCTCGAT CCATCATCAC AACAAGCTCC TCGTTTGGTC ATTTTTCATT TTCATCGTCT CCGTATGTAA CGGACAAGAC TACATCCAAG CAGGAGACTG ATTTGCGTTT CTCTCCTTTG CCCCGGCTGT GTCATATACT TCCTGCGTCT ATACCCCCAG TGCCATTGGA GGATCCACTT TGGGCAGCAG TCGATGATTG TGCGTCTCCC TTGCTGCGTG GATCCAATCG AACGAACGAC AATTTGGATA ACACCGCTGG ACAAACGGGC AATATTCAAG GCTCATATGC CACGCTCTAC CCTAATCAAA TTGAAGCTGA TGCTTGTATT GATGAAATAC ACTCAGCTTC GCAGCGCTTG CAATACGCAT CAGATGATGA CAATCATCCG GCGCAAGGTG AAGCCGAGTA CTCGCAGCAA TCGGCAGCCC GAACGTTACG CAGACCACAC GAACTCGAAA AGAACGTATG TGATTCGACG TCTCCGGATT ACCGAGAATT GCTCAACATT TCAGTCGTTG AAGAGTATGC TGACCGGCGC CGACTTCTCA AAGCCTTGCG TACCGACGAC CGTCGACTAA AACAAGCTCC AACGGAGCTC ACCGATGCCT CCAACGCTGA AGAAGCAACG TCAAATTCGG ATGCGCATGG AATCGGAGGG AGTTCGGAAG AAGACGATCC TTTTAGCAGT GAAGATTCGT CGCTGGATTG TACTACCGAC TCCGATACAT CGGGTTGGGT CAGTAGTAGC GATTCAATTA ACAGCGCGTT ATCGACTCGA GTCGGCAAGC GATCAGTCGA CTTTTCCTCA CGGGAAGGAA AGTGCAGAGC AGAATTGAAA GCGGACTGCG CATCCGCCAA CGTTTCGACA TCCCTGCTAC TATCTTCTAG TAACGACACG GCACAAAAAG TACCGCGGTC GGAACGGTTA ATTTGTCAAG GTGGACTGTT CTGGATAGCT TTACGATCCG ATTGGGATCA GTCTCATCCT GATCGAATTC AAACACTTCC CAAACCGAGC TCGGCTGACG TTCTTCACCT CTCGTCAGAT CTTCGTTGCG GCACGCAAGG ATCCATGTAT GACCTCTGTC TGAAATCTGG CTTCAACACG AATGCCAAAA CATGGGAACC TGTTGATTAC GGCACGCAAG GATCTATATA TGACCTCTGC CTGAAATCCG GCTTTAGTAC GAATGCGAAA GCAACGGAAC CTGCTGAAAT TCGAGAAGAG CAATTCAACG AAGCACGAGG TGGTCTTTGG TATTTGGCTA GTCGCTCTCC TCTCAGTAGG GACCGTCGGC TAGAGCCATA TTCGGTTAAA GGAAAACCTC GGACAAGACC TCACGGCGCA ATCAAAATTC AGTCTGACTA CAAAACAGAT CGAGAGTGTG GAGCGAAAGG CACGCTGCAT CTATGCATGA CGTTGCAGCT ACAGGTGAAT TAA
|
Protein sequence | MKDSASSESS KATIGVTLSN AIPSDASSRA RSILGSMKEH RETFSRVVHH GHSSHGDDFR MASAVSLSPS NPFSSKSEDT MLTIAQQLND SLNDRRQRLS RVSGIDVGNK TTTVDSAAGE NVRTAPTSPV SQRSGHTVAT EESTVATGTS SPGQSSFDPS YATTQPPDSP TQQQQRLGIV QSREPVAIAS TAQLPSNEEV ENPDILENAD VHEKHGGSDS KAACESVSTV DVSFGPETQF RHVRSSKSFG DSALESRNAS VPRSIITTSS SFGHFSFSSS PYVTDKTTSK QETDLRFSPL PRLCHILPAS IPPVPLEDPL WAAVDDCASP LLRGSNRTND NLDNTAGQTG NIQGSYATLY PNQIEADACI DEIHSASQRL QYASDDDNHP AQGEAEYSQQ SAARTLRRPH ELEKNVCDST SPDYRELLNI SVVEEYADRR RLLKALRTDD RRLKQAPTEL TDASNAEEAT SNSDAHGIGG SSEEDDPFSS EDSSLDCTTD SDTSGWVSSS DSINSALSTR VGKRSVDFSS REGKCRAELK ADCASANVST SLLLSSSNDT AQKVPRSERL ICQGGLFWIA LRSDWDQSHP DRIQTLPKPS SADVLHLSSD LRCGTQGSMY DLCLKSGFNT NAKTWEPVDY GTQGSIYDLC LKSGFSTNAK ATEPAEIREE QFNEARGGLW YLASRSPLSR DRRLEPYSVK GKPRTRPHGA IKIQSDYKTD RECGAKGTLH LCMTLQLQVN
|
| |