Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39006 |
Symbol | |
ID | 7194703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 109037 |
End bp | 111043 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183155 |
Protein GI | 219125789 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.517261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCCC CAAATGCGAT AGACCCGCAC CAGCCGACGC GTTCAGGTTC GTCCCAAGGG CTTCCGCAGT CTTCGAGATC CAGAGACCAA TCCATGCGCA TAAGTCCGTC GAATGGAGAC TCTGACTGCC AACAATGGCG TCCCTTCGCG CTCCCTTCGA ATCGAGATGC GAGTTACGTG CAAAAGAGTG CTGGCTCGCA AGTAACGCGT CTCGACGAAG TCAAAACAGT TTTGCCGACA AACTTTGCCA CTGCACCAAT CTTTCGTGGT GTACCGGCCA TTCAACCCGG AGAGAACTTT TGTTCCTCGA CAACGGGCCT GGCAAGTACA TCTCCTGAAG AATTAAAGCC CTCATCGGAC TGTGTTCGTA GAGGCGGAGC TACATCCGAA TCGACGTATT TGGACCATCT GGTTGATATG AGCAGTATCA ACAGTTCCTT CTACAACGAA GTTGGTGCTG GTACTAGTCC CAATACTGAT CCTAGTCCTG GTCTTCCAAT TGGATGCTCT CCAGTTTCAA CGAGACAGAG CGGCAAGGAA TCCCAGCCCT TCCTGCCGAC ATCTTTGCCC ATTTCGTTCC GTGATCTGGC CTCGGGAATA CCCGGGGAAG TCAAGTCCAA TTTTCTGACT GGCGAACCCT TACTTATGGA TGAGGATGAT TTGTCCATAT CTATCGAAGA TGAAACCAAT GCTCACTTCA ACGATACTAA TTCGGTGAAC TTGGACCCAA CAAAAAATAC AGCAGCAAAG AAAACGCCTT CGCGCACGTT ACGTCCCATT TCCACTTCTA GAAATTCGTC ATCCCTTAAG AAAGGATCGA CGATTAACGC ACGTTCCCGT AAACGCAAGA AAACACCCTC CGTCTCAACG ATCTCCCATC GGGACGAGAA TTTTTTGGGC AGGAAGCCCA AGTTTGTCAA AGACCCTGTG TTTTTGAACT CGGCGCCCAG CTCCAACTCT ATCTCAAGCA GGCAGAAAAG GGCGACTCTT GAGTTAGTAG GTCCCGACGC GCCCCAAATG CAAACTCTAT GGCGCATGTT AGCCTTTCTC GAAGGTAACC CCAATCCAAC ATTCTCGACT GTAGCTAGCG ATGTCCGGAA GGAGATTGCT CTCTGTACAA AGCGTCATCT GGATGGGGAG CACCGCTTCC GTCACTTACC CGGTGCCATC TTTGAGCGTC TCGTAGCCCA GTTCGGAGGT GATTGGTTTC GGGTGGTGTT TTTAGAATCT CAAACTCCCA ATCGTACGGT GAGAAAGAAG TCTGTCGAAC CTTTCGATGG TGCAGGTGCA CTTGCGAAGG TTGGTGTGCA ATCAAACCCT TGCAAGAGAG TCACGGACCA TCTCTCGAGC CATGTCCCGA AGAGCAGAGG CAGTCTTTCA ATCAATCCCA CAGATAGGCT TGTTGATACT CTGGTGATAA GCGAAGCAGA ACAAGGACGC ACAGAGACCT CTCCCTTGAA AACAGTCAAA GATGGCAGTG TTGATAAGAA AGAAGCCTTA GGTCGAGGCA GCACAGCAGG TGTCCAACTT ATCGAGGCAC CCTCAACCAT GACGAAGGAA TCGTCACCCA AAGCTCGCAC ATCGACAGTG ATGGAAGAAT CCGAGCATCC ATCACTTGCA AAGTATGAGA ATCCTGCGAA TGCTACGCTG CCAATCACAG GAAATACAAA GGCTAGCCCT AACATCGTCT TGGAAGGCTT GGTAGAAAAC GTATTGCGGG GGAATGTGGT TTTAGCGCCA GTGGGCCACA ACACACAAAT TCCAACGGCA CCCGAATTGT GCGTCGCGTA CGGTTACTTT CTCGGACAGC GCGGATCCTT GTCTCCAGCA CTTCCGTTGT CGCCATTGTG CAAGTCTCGG CCTCGCCACG AGGAACAATT ACTTTGCCAG GCTCGCACGG GATCGGCACT CGTCAGTGAC TTGTCCCGTC CGGAACGATA CACCTTTTGG AAAGCTTTGT GGACATTGCA ATCCGACGAA GCGACTGCTT TCGCAAACAC GGTGTAG
|
Protein sequence | MASPNAIDPH QPTRSGSSQG LPQSSRSRDQ SMRISPSNGD SDCQQWRPFA LPSNRDASYV QKSAGSQVTR LDEVKTVLPT NFATAPIFRG VPAIQPGENF CSSTTGLAST SPEELKPSSD CVRRGGATSE STYLDHLVDM SSINSSFYNE VGAGTSPNTD PSPGLPIGCS PVSTRQSGKE SQPFLPTSLP ISFRDLASGI PGEVKSNFLT GEPLLMDEDD LSISIEDETN AHFNDTNSVN LDPTKNTAAK KTPSRTLRPI STSRNSSSLK KGSTINARSR KRKKTPSVST ISHRDENFLG RKPKFVKDPV FLNSAPSSNS ISSRQKRATL ELVGPDAPQM QTLWRMLAFL EGNPNPTFST VASDVRKEIA LCTKRHLDGE HRFRHLPGAI FERLVAQFGG DWFRVVFLES QTPNRTVRKK SVEPFDGAGA LAKVGVQSNP CKRVTDHLSS HVPKSRGSLS INPTDRLVDT LVISEAEQGR TETSPLKTVK DGSVDKKEAL GRGSTAGVQL IEAPSTMTKE SSPKARTSTV MEESEHPSLA KYENPANATL PITGNTKASP NIVLEGLVEN VLRGNVVLAP VGHNTQIPTA PELCVAYGYF LGQRGSLSPA LPLSPLCKSR PRHEEQLLCQ ARTGSALVSD LSRPERYTFW KALWTLQSDE ATAFANTV
|
| |