Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48837 |
Symbol | |
ID | 7195136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 407891 |
End bp | 409357 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183354 |
Protein GI | 219126208 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0535878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTAT CGTCTTTGGG GGGAACGTTG GAATTTGACT TACTGTTAGG AGTATTCTGG AAGCGACATA GTCGTTGTCG GACCATGGTA TTGCCGGAAC GGCGTCAGCA TCCAATACGG CGACGAAGTC TCCTTCGAGC AACGCATTTC CTGTGTCATC CCTTCCGGAT TATATTCACT GCTCTCTGGT GTTTTTACGT CTTTTACTGC TTCCGTTATG GCGATGTCAG CAACGGTAAC GGCGCCTACG CCCCTCCAAT TCACGAGAAT CAAGGCGGGA ACCATACCGG AGCAAGCAAT CCACACTATT CCAGTACTGT CATCGACGGT AAACGGAATG ATTTCGTCTC ACCGCAGTAT GATAGATCCG TCGAGACGGT GGCCTTCAAC ACCACACAAC AGTACATTAA GTACAACAAA CGTGGTCGAG CCGTGGTCAA GGCCATTACG CGGGTGCGCC TGGAAGATCT TACCGATTTG CCGGAATCGC TCTTGGCGAA AGCGAATGCG ACCAGTCCCG CTCTCCCAGA TCCCAATCGC GCGGACAAGG AACCGATTCT GCGCATGCTC CGCCAAGCCG GAATCACCGA CGTGGATCCC CGCGTGATAA CGCTCTTGCC CTCGTGGCGG GACGTGACGG ACTTGTACGG CGATGACGTC CCCATTGTTG GTCTGGATGA GGCGTCTTGT CGGTGGTTCC GTGACACTGT CCCCCTCCGT GACGCATACC TCGGCGCCGC CGGACTCTTC AACACGGGAA CCAACGCGTT GACGTACTAC CTCCGCGCGA ACCTTCTCCT ACCCTTCCGG GGCGAGGCAC CGATTCATCG GGACGGGAAG GATTCCAATC GACAAGGAAT TCTGACGCAA GTGCCCTGGG ACAAACACTG GTTTGCTCGA CTGCGGAATC ATCATACGGT GGATCTGTAC GCCAATGTGA CGAAAGCACA CGTTTTGCCG ATTGTCATGA TTCGGGACCC GCTCTCCTGG AGTCAGTCCA TGTGTCAGCA ACCGTATCTG GTCCGCTGGC GAGGTCGGAC GGTACACTGT CCCGATTGGC GAGAACCCGT GCTCATTCCA CACATGACTG GTGGACACCA TTGGGACAGT CTCTGGCATC TCTGGAACGA CTGGTACCGG GATTATTGGC AACACGAAAA TCCCCGACTC ATCGTTCGGT TCGAAGACTT GTTGTGGCGA CCACAGCAAG TCCTACGGGC AATTCAATCT TGTGTGGGGG CGACGTGGAC TACTCCCGGT ACTTTTTACT ACGTCGTCGA CCGGAGCAAA TGGGAACACG TCAAAACATT CCGGGCTCAG TCCAATATGG TGTCAGCCAT GATCAAGCAC GGCACGCCAT CGCAGCGCGT CCGGAATCTG TCTTTGGAGG AATTGGAACA AGCCAGGCAG ATTCTGGATC CACAGATCAT GGAATTGTTT GGCTATTCGG TGCCAAAACC AAGTTGA
|
Protein sequence | MTVSSLGGTL EFDLLLGVFW KRHSRCRTMV LPERRQHPIR RRSLLRATHF LCHPFRIIFT ALWCFYVFYC FRYGDVSNGN GAYAPPIHEN QGGNHTGASN PHYSSTVIDG KRNDFVSPQY DRSVETVAFN TTQQYIKYNK RGRAVVKAIT RVRLEDLTDL PESLLAKANA TSPALPDPNR ADKEPILRML RQAGITDVDP RVITLLPSWR DVTDLYGDDV PIVGLDEASC RWFRDTVPLR DAYLGAAGLF NTGTNALTYY LRANLLLPFR GEAPIHRDGK DSNRQGILTQ VPWDKHWFAR LRNHHTVDLY ANVTKAHVLP IVMIRDPLSW SQSMCQQPYL VRWRGRTVHC PDWREPVLIP HMTGGHHWDS LWHLWNDWYR DYWQHENPRL IVRFEDLLWR PQQVLRAIQS CVGATWTTPG TFYYVVDRSK WEHVKTFRAQ SNMVSAMIKH GTPSQRVRNL SLEELEQARQ ILDPQIMELF GYSVPKPS
|
| |