Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13053 |
Symbol | |
ID | 7201626 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 98507 |
End bp | 101014 |
Gene Length | 2508 bp |
Protein Length | 725 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180942 |
Protein GI | 219120406 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.34373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGACGAGA CTCGTCAACA CCCGTACGAT GTGGAACGTT GGCTTGTGTA TCTCGACGCT GTCGACGATT GGATGGTCAC CGATCAGCAG TCCCCGTCGT TCCGGCGTCT CGTGGGCCAA TGGATTGGGC AACGCGCACT CTGCCGGTTG CCCCGGAGTT ACAAACTCTG GAAACGGCAC TGGGAGTTTC TCGTCGATAC TCTTTCTTTA CTAGACGACG ACGACAGTAA CGACAACGAC GACAACAACA ACAACACTAC GTCCGTCGTG GTAGCCTTTG AACGGGCGCT CGTCACCTTG TCGGCCTACC CCCGGGTATG GGTTGCCTAC ATCGACTTTT TGCGGACCCA TCCGGGATGT TGTTCCGTTA CCCACGTGCG TCGGACCGTC AATCGCGCGC TACAGACGGT CGCAATCGCC CAACACGAGA AAGTCTGGCC CGGGATCGTG GAATGGTTCG GAACCGAACC GCACGACGAC ACCACGCCGA CGCCCCGCTG GACCCTGCCT TTGGAAACAC GCGTACGCAT CCTACAACGT TACGTTACCT TCCAACCCCG CTACGGACGA GACTTGTGCG ATTTCCTCGG ACGACACGGA TTGTGGGGAC AAGCTGCCGT GGCCTTTCAA CGCTTGTGGA ACGCCAATCC GGCAAGTACA CGGAGTGTGG AATTGTTGGC TACCGATAGT GGTAGTAGCA GTGCGGCTGC CGTCACGACA CACACTGCCA CCACTGCTAG GAACGACAGG TCCACGATAC GACCCGACCT GGACGATACG GCCTGGGCCG ACTTTTGTCG TCTCGTTACG ACACATCCCG TTGAAGTCCA ACAAGCTGGG GTTCCTTGGG AAGCCATGCT CCGGGCCGTC CTGCCCGACT CGAACATCAC CCAATCCTCC AATCCACGCC ACCACCACAA CAACAATACA CACCCGCACC GCAATACGAA TGCCTTGGAA GCCCTCGTTT GGACCAGTCT GGCCGACGCA TGGATTCGAC AAGGCCTCTT CGATCTGGCA CGGTCCGTCT ACGAAGAAGG CTTGCAGAAG GTACACACGA TACGAGATTT TTCCATCCTC TATAACGCCT ACCTTACCTT GGAAGAAGGC TTATTGGAAG CGGCCGTCGC CACTGTGGAC GCCATGGAAG ACGACACGGA CGAACACGAC ACGACGAGCC ACGAATACGA AACCACATCG CAATTCGCCA CGCTCCCGGA CGACGCGGAC GATTGGGACA TTTTGTTGGG GACTTCGTCG GCATCGCAAC TAGCCGATAT GGAACTGGCC GTGGCTCGGG CCGAGCATTT GACGTCCCGA CGACCGCTCT TGCTCAACGC GGTTCTCTTA CGACAAAATC CGCACCACGT GGGGGAGTGG CTGGAACGCG CCAAACTCTA CCAGTCGGTA AATCAACCCG GACAAGCCAC GGCCACGCTC GAAGAAGCCT TGCGCACGGT CGTGGCCAAC AAGGCCGTTC ACGGCCGTCC GTCGGAACTC GTGGCCGCCC TCTCCAATCT GTACGAGACG GTCCGCAACG ACGCGGCCGC GGCTCGGTCT ATGCTGGAAC GTATTTGTGT CCATCACGGC TATGCCTTTG CCAAAACGGA TGATCTAGCC GAATGCTGGG CCACGTGGGT AGAACTGGAA CTGAAACAAG AAGCTTGGGA CGACGCCTTG TTGTTGGCCC GACAGGCCGT CGCTGTCGGA TCCGGTACGC GGAAACTCCA TCTGACACAG TCACTACGGC TCTGGGATCT CTTGTTGGAT TTGGAGGAGA GTTTGGGGAC AACGCAGACT ACCAAGGACG CCTACAACCG TGCCTTGGAA ATCAAGGCGG CCACCGTCCA ACACGTATTG AATTACGGTA CGTTTTTGAC GGAACAAAAA TATTTCGAAG AATCTTTTAC GGCCTACGAA CGCGGTATTG AGCTCTTTGC CTTTCCACAC GCCGGGGCCA AACTCTTGTG GAAAGCCTAT CTCGAAGCTT TTTTGGACCG GTACCAGGGC ACCAAAGTAG AGCGAGCACG AGATTTGTTC CAACGCTGTC TGGAGGCGTG CCCGGCCGAG GACGCTGCCG ACTTTTACAT GATGAACGGA GAGTTTGAAG AAACCTACGG TCTGACACGG AGAGCGCTTT CGGTGTATCG TGCCATGTGC CACAGGGTGC CGAAGGAAGA GCGGCTGGTT GCGTACCAAC TGTACGTTGC CAAAACCATC CGGTACCTGG GTGTGACCGC CACGCGTGAT ATCTATCAGG AAGCGATCGA AAATTTGGCC GACAAGGATT CGTCAAAACT TTGCGTCGAA TTCGCGAAAA TGGAAACGGG ACTGGAACAA CTGGACCGCG CCCGGGCAAT CTTTACGTAC GGGGCACAAA TGGCCGATCC GAGACGCTTG CCAGAATACT GGAAGACGTG GAACGAGTTC GAGATTGCCC ACGGAAACGA AGAAACGTTT CGCGAAATGT TACGAGTGAA GCGGTCCGTG GAAGCCGCAT TCTCCACG
|
Protein sequence | EDETRQHPYD VERWLVYLDA VDDWMVTDQQ SPSFRRLVGQ WIGQRALCRL PRSYKLWKRH WEFLVDTLSL LDDDDSNDND DNNNNTTSVV VAFERALVTL SAYPRVWVAY IDFLRTHPGC CSVTHVRRTV NRALQTVAIA QHEKVWPGIV EWFGTEPHDD TTPTPRWTLP LETRVRILQR YVTFQPRYGR DLCDFLGRHG LWGQAAVAFQ RLWNANPAST RSVELLATDS GSSSAAAVTT HTATTARNDR STIRPDLDDT AWADFCPLVW TSLADAWIRQ GLFDLARSVY EEGLQKVHTI RDFSILYNAY LTLEEGLLEA AVATHLTSRR PLLLNAVLLR QNPHHVGEWL ERAKLYQSVN QPGQATATLE EALRTVVANK AVHGRPSELV AALSNLYETV RNDAAAARSM LERICVHHGY AFAKTDDLAE CWATWVELEL KQEAWDDALL LARQAVAVGS GTRKLHLTQS LRLWDLLLDL EESLGTTQTT KDAYNRALEI KAATVQHVLN YGTFLTEQKY FEESFTAYER GIELFAFPHA GAKLLWKAYL EAFLDRYQGT KVERARDLFQ RCLEACPAED AADFYMMNGE FEETYGLTRR ALSVYRAMCH RVPKEERLVA YQLYVAKTIR YLGVTATRDI YQEAIENLAD KDSSKLCVEF AKMETGLEQL DRARAIFTYG AQMADPRRLP EYWKTWNEFE IAHGNEETFR EMLRVKRSVE AAFST
|
| |