Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44505 |
Symbol | |
ID | 7197773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 760129 |
End bp | 761961 |
Gene Length | 1833 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178571 |
Protein GI | 219115551 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGCCCGGTT CCGGCTAGAA CGCAACGTGA ATATTGGAGG CACTAGACTT TGACTTTGAA CAGAATCAAC TGTTTAACTC TCGTATGTCT ACGTTCGACA TCTTTCGGAT TGGCGTCTGA TTAGAAGCTA CAATGCCGTA CTCAGTATCG CTTTGTTCAT TCGGACAGCA TACTCCTAAG ATCCAACTCG TTGCGGATAC GTTTGACGAG AATGCCGCCA TCGATACAGT CTCGCAATGT CCAACAACTT GCCGGGCTTT GGAGGAAGCA TTGGAAGATT TTCTGGACAA GGACGAATCC GCTTCCCCCT GCCTCCGGTA CGTCGCCCGC AAAGATGCAG GGTGGCTGTC CTGTCGCAAA CCGTATCATC GCCCCCACTC AAGTGTTCTA GGAATTCTAG AAGAATTGGA AGATCCTACA GCTGACGTAA CGATTCGATT GAATTGTTAC CGCGGACGAG AAAGCATAGT GGCCTCTCAC GATTGCGATT GTCCGGAACC TCACAATAGT TGGGAACGCA CGGAACTGCA TCTACCGGAA TCGACGGTCG AATCCAAACG GACGCAAATG AGCCTGGACT GGATTGACAT TCGCGATGTT ACGAACAGTG CACAATTCCT GTCAACGCCC TCTAGCTGTG TGCATTTTTT GAGGTCGCTT TTGGGTTGTG ATCATCGAGC GTCACCGTGG ATGGCGAGAA GAGGGCGGAA GCGCAAAGAA GACGCTGTCG GAACAGTCAT GTTAGTGTGT CGCAGCGAGC CCCCTAGAGT ATTGATTGAC TTCCGTGTGA CTGATGCCTG TCCCTCATGG TATCTGCATG GAATGGGGCG AATCAATCTA CCTATTCGCC AATCTGTATG GTGCAAACGT ACTACAAACA AAGCTGAAAT GGCGACACTT TTGATATATC GCGTATTGCC CCCTCGCGAT CAGATGAGCG TACTGCATCC AATAACCGGT GTTCCAGCTA CGGTAGGCTG TTTATGGGAA ATTGCAACTA TGCATCCAGC GACAACAGAC GAAATGGTTC CTCCGGAGCC CTTTCGGGAA ATTCGTATGG TTTCGCCACC ATATATCGCC GCAAGCGAAT ACCCCGCTGA TTTGTTAGGT CCCTTACTAA GCGAGCATTC CCTACAAGTG CTACAAATCG AAGCGAAAGC TATCGCACAC TGGACGGCGT GGCCAGAAGC TCAGCATTAC CAAGCCAAGA ATGATTCGCC TCTAGCGCCC TGGAATGTTT TCCCCTTATG TTACTGCTTC CCTGCTACCA ATGTGCAAGC TAGACAATGG ATTCACCAGA CATGTGCGTT GGTGCCATTG ACAGTGGAAC TTCTTCGGAG ACATCTAGGC GATGCCTTAC GAACGGCGTT GTTTTCCCGG CTCGATTCTG AATCAGTCTT GGAACCACAC ACTGGTTGGG AAGACCTTGC GAATCACGTA TACCGTGTGC ACATTCCTCT GATCGTCCCA GATGGAGATC TTTGTGGTGC CTGGGTTGAT GGCTGCGTCG AAACTCACCG ACGTGGGCGC CCCTTAATAT TTGATGATTC CAAAACTCAT CGAGCCTTTA ACTATAGCTC GCAAGAGAGA ATTGTCCTAA TCATTGATTT GGCCCGTCCT TGCACTTTTC CTGATGGCAC TGCTAGGGGC GGACATTCCG AAGAGCTTGA CAAGTTTATC GAACAAATGG GAGTTTAGGG TCTACAAAGC GAGTCGATAT CGCTTTGTGG TTTATCTGTT CACAACAACT AGCGCCCTAG AAAGTTGTAG TGTAGCAAAC AGTTCAAAAT AATTGTTTGG AAGCAATAGA TAGGTCGAAC GGTTCCCCCA TGT
|
Protein sequence | MPYSVSLCSF GQHTPKIQLV ADTFDENAAI DTVSQCPTTC RALEEALEDF LDKDESASPC LRYVARKDAG WLSCRKPYHR PHSSVLGILE ELEDPTADVT IRLNCYRGRE SIVASHDCDC PEPHNSWERT ELHLPESTVE SKRTQMSLDW IDIRDVTNSA QFLSTPSSCV HFLRSLLGCD HRASPWMARR GRKRKEDAVG TVMLVCRSEP PRVLIDFRVT DACPSWYLHG MGRINLPIRQ SVWCKRTTNK AEMATLLIYR VLPPRDQMSV LHPITGVPAT VGCLWEIATM HPATTDEMVP PEPFREIRMV SPPYIAASEY PADLLGPLLS EHSLQVLQIE AKAIAHWTAW PEAQHYQAKN DSPLAPWNVF PLCYCFPATN VQARQWIHQT CALVPLTVEL LRRHLGDALR TALFSRLDSE SVLEPHTGWE DLANHVYRVH IPLIVPDGDL CGAWVDGCVE THRRGRPLIF DDSKTHRAFN YSSQERIVLI IDLARPCTFP DGTARGGHSE ELDKFIEQMG V
|
| |