Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38545 |
Symbol | |
ID | 7203494 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 357818 |
End bp | 359846 |
Gene Length | 2029 bp |
Protein Length | 592 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182669 |
Protein GI | 219124770 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGAG GTCCCGTGGT CGCCCTGGAT CGGATCTTGC ACGGTACGTG TCGACGTTAA TGTGGACTGT CAGTTTCGAA CGTTTCCAAA AATCAATCGT TGACAGTGAA CGGTTGCCTT TGGTGCCCTA TCAATCGGAG ACTCTGTATT ACGCGCCGTC GCGATTCTAT TCTGTGACGA GACTCAGTGA TAAACATTAC TTTCACACTC GCATCCTAAC CCTGCTCTCC CCGCTTGCTA CCGTTCTCTG GTGTGCTTTG AAACATTGGG ACGGAACGGG GTTTGCTCCA CAGACTTGGC GCACGCATTG ACGGTTTCCG AACGATGGCG AGCGCTGAGA CGATTCCGAG AGATACTGGA ACAGGATTTG AAACAGTCCA CGCATCAACA GTGTCGGGAA CGTGAGGTGG ATGTACTGCT CGTCTGGAAT GTTCAACTCG CACTCCAAAC GTCCGATCCC ACACTGTTAC CCGACTTGCT AGAATGTCTG CGTTCCCTCT GGGCGCACCC ACCACAACGT CACCAGCACC ACAACCACCA CCACCACGCA CGATATACCC AACCGGATCC GGAACTGGAG TCGTTGAGTC GTTTCCGTAA CAAGGTGGCT CCCGTAGCCG ATGCTAATGG TACCACCATT CAAACTCTTT TGACGTGTTG GAGGACTTAC CGCTACGATA CAGTCGTACT GGAGTCGCTC GTGCCCTGTT TTCGGGCTCT GGCCAAACAG GGTCGTTTCT CCTCGACCGC ACCTTCGTCC GGGGTGGGTC AATGGAAGGA AGTCGCCGAA CGATTGCACG CGGCGTTGGC GCGCGAAACA CGTCCCGAAA ACGCTCTCCG ACCGACCCTG TGGGGACTGC TGAAAGATGC CGTCTTTCGG GCCTTACCGG ACGAAAAGTG CGGTTGGTTC GGAACCTTTC AGGAACTGAC ATTGCGGGAA CTATTGCCCA AGGATACAAC AACCAACACC GACATTGGTA TGGATAGTAG CAATTCCGTA TCTAACGTTA CCATGGAGCA CGGGGTGGGA ATATTGTGGA GTTGGGCGGT AATCCCTCGC TTGGGACGCA CCTTGGCCGA ATCCACATCG CTCTGGATTA TTCTAGATAT GCTACACAAT GATCGTTCAC CCCGATCGCG GATTCTGGCA ACCCGTCAAA AGGCAACGGC AACTCTCGGA ACCATTCTTA CACACTTTTC CAGCACAAAC GATGTTAATG ATGAAAATCC TGTTCCAGAA ACTATGACCA AACAGACATG GATAGCGTCC CGCTTGTTGA CTTTACTAGA GACAGAATCG GATCCGGATT GGAGAAGGCG CTGTCTACGG ACTTTGCGAT GTCTCTGCTC TTGCTCATGG GGACGACGGA TGCTCGACAA TGCATGTACA CATCTTTTGC TACCAACACT GCTTCGCATT GTACGGGACG ATACCGCCGA TTCTCCCGAC ACCCGGGTAC AGGCCTGTCA ATGCCTCTCG ACTATGCTGC CCTTACCACC GCCTTCGTGG ATCGATTCGC TGCCGTTGGT GGAAACTACT CTGATACAAA CAGCGTCCAA AAAATCCACA CCGGACAAAG TCGTATTGGC GGCCTCACAA GCGCTCGCGG CTAGCCTGCA ATACAGTCCC TGGAAACGCA GTGCCTCCTG TTTTTCAACA AGTTTTCACG AGCGTGTAAG AGATGTTCTA CAAGACAACG AAGCACGGGA GACGTACCAT GTCGCCTTCG GTGAGCTCTT GCACCAGATC GTACTCGAAC ACGAACATCA AACCAGCGTT CCAGATCTGC TGTCATCCAC TACCATTCTC GATATCATTA CGCTTCTGTT GACTCCGCTC GGTCCCGACT TTGGCTTGAG CCGTGACCAC GCCTTGCGGG TAGTGGAGAT TCTGTCGAGG CACGAAAAGA AGAAGCTCGC CGATCACGAA TCACTACTGG GTGCCCTCGT CAATCTCTGC CTCGTCACGA GTGGGGAGTT GAAAGTCCAG GTAAAAACCA TTGTACTCGA TCTCGTCCCC GAGCTGTAG
|
Protein sequence | MEGGPVVALD RILHDLAHAL TVSERWRALR RFREILEQDL KQSTHQQCRE REVDVLLVWN VQLALQTSDP TLLPDLLECL RSLWAHPPQR HQHHNHHHHA RYTQPDPELE SLSRFRNKVA PVADANGTTI QTLLTCWRTY RYDTVVLESL VPCFRALAKQ GRFSSTAPSS GVGQWKEVAE RLHAALARET RPENALRPTL WGLLKDAVFR ALPDEKCGWF GTFQELTLRE LLPKDTTTNT DIGMDSSNSV SNVTMEHGVG ILWSWAVIPR LGRTLAESTS LWIILDMLHN DRSPRSRILA TRQKATATLG TILTHFSSTN DVNDENPVPE TMTKQTWIAS RLLTLLETES DPDWRRRCLR TLRCLCSCSW GRRMLDNACT HLLLPTLLRI VRDDTADSPD TRVQACQCLS TMLPLPPPSW IDSLPLVETT LIQTASKKST PDKVVLAASQ ALAASLQYSP WKRSASCFST SFHERVRDVL QDNEARETYH VAFGELLHQI VLEHEHQTSV PDLLSSTTIL DIITLLLTPL GPDFGLSRDH ALRVVEILSR HEKKKLADHE SLLGALVNLC LVTSGELKVQ VKTIVLDLVP EL
|
| |