Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_25666 |
Symbol | |
ID | 7204368 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 466862 |
End bp | 469929 |
Gene Length | 3068 bp |
Protein Length | 885 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186352 |
Protein GI | 219113537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGATCTT GCTGATGAAA TGCCTTGAGC ATTACTTTTC GGCATGAAAT TTAACTACAA ACTGAAACGT CTCTGTGGCG CATACTACGG CCATCCTTTG ACTAACGAAA CTACTGGAGG GACACGATGG AGTGGATCCA ACGTTGTCTA CGATTCCAGT GGTGACCTCC TTATATCCTC TGTTTCCAAC CGCATCCAAG TTTTGGATTT GAGGACACAC ACAGTCCGTA CATTGCCTGT CGAATCCCGA TCGAATGTTC GCTGTCTCGC CCTTTCACCC GACGATGCTA TTTTGATCGT CGTTGATGTC AAGAACTACG CACTCATTGT CAACTTTCGA AGAGGGATTG TTCTCCACCG ATTTCAGTTC AAACGTAAGG TAAGGCAAGT TCTGTTCAGC CCAGATGGAA ACTATATCGC AGTTACACAC GGCAAACAGA TTCAAGTTTG GTGCGCACCG TCGCAACTTC AAAAAGAGTT TGCCCCCCTT GTACTGCACC GCACATACAC TGGACAGGCC GATGACGTTA TGTGCCTGAG CTGGTCCTCA GATTGCTCCG TGATAGCAGC TGGTAGCAGG GACTGCTCCG TTCGAATTTG GAGCTTGCAC ACAACGCGAA ATTTCGAACC CGTGACACTC TCTGGACACA AACGCGCGAT AGTAGGCGTA TACTTGATAG GGAACCATAG CGGCCGGGTG GAAACCTGTT ACAGTGTGAG CGAAGACGGA ACGCTTGTAT CGTGGGAATG CAAATTGAAG GAAGGGGAAT GGGATGTTGA TCACCAGAAT CGAGAAACAC CCCCGGAGGA TTCAACGGAC GATGCAGTCG ATTTTTTCAC AGGCGCATTT CCTAGGCGGG CGTCGGAACG TACTGGGAAG TCTCAGGCTC ATGATTTAGT ACAGTCTTTT TGGTCTGTCA AGTCGAGACA TTATTTCCAC CAAGATGGTG CAGATGTTAG CTCTGTCACA TACTGTGAAC GTGGTCAGCT ACTGGCGGTC GGATTTTCTT CGGGTCTCTT CGGGCTTTAC GAAATGCCTT CCGTTTCCAA TATTCATACC CTTTCTGTTG GAAACAACCA ACTAGTAAAA ACGTGTGTCC TTAACAAGAC CGGCGACTGG CTCGCTTTAG CCTGCCCTCA TTCACAGCAA TTGTTCGTTT GGGAATGGCG TTCCGAGACA TACGTCTTGA AACAGCGAGG TCATGCGTAC GGCATGCGAT GCATGGCCTA CTCGCCAGAT GGTGTAATTG TTTGTACGGG AGGCGACGAT GGCAAGCTTA AGCTATGGAA TGCTACATCT GGATTTTGTT ATGTGACAAT GGAGAAAAGT CATACAGCCC CGATAACGGC CGTCGCATTT GCCAATGCTA GCGTTGTTCT GTCCTCCAGT TTGGATGGCA CCGTTCGAGC CCATGATCTC TATCGATACC GCACTTTCAA GACTTTTACC ACACCAACTC CCGTACAGTT CTTGAGTCTC GCAGTAGATC CAAGCGGTGA GATTGTAACT GCGGGTAGTA CAGATCCTTT TCATATTTAT GCTTGGAATC TCCAAACCGG TAAGCTTCTC GATATATTGA CTGGTCATAG GGGACCGGTC TCGGATTTGT CTTTTCAAGG GAATGGCGGT ATTCTAGTTA GCGGTTCTTG GGATGGTACT GTGAAACTCT GGGATCTTTA CAAGGGGAAT GTTCCAACTG AAAGTTTGCA ACACACGGCA GATGTGGTAT GCGTGACCTT TCGGCCGGAC AGTAGAGAAG TGTGCACGGG TACAATGGGC GGTATTCTAA GTTTTTGGGA TGTCGACAGT GCCAAGTTAA AATTTGAAAT CGACGGTCGG AGAGACATAG CTGGTGGTCG TAAGATTAAT GACCGAATGA CAGCCGACAA CAACGCTTCT TCTCGATATT TCACATCTGT TTGTTACTCA GCAGACGGAT CCTGCATCCT TGCCGGGGGC AACTCCAAAT ATGTCTGCAT CTACGAGGTC TCGCAGCAAA TGCTATTGAA AAAATTTCAA GTCACTTTTA ATCGAAGTTT GGACGGAGTT TTGGACGAGG TACGTTCAAT GATCTCGGCC ATAGTCAGCC TTGGCTCTTT AACGCTATGA TGATGAACTT CCCATAAACA TTCGCAGTAT GCAGCTGACG GGTTAGACCC ATTTGGAGCT TTTAACTTCC TTGGATGTCT GAGCGTTGAT TTTGCTCGTT GTCTCTGAAC ATGTTCGTTA TGCAAATCTA ACATATCCCT TTGGTACTGT CCTTGCAGCT CAACTCCAAA AATCTGGGTC CCGGAGGACC GATTGATGAT CACGCCGATT CGGGGGATGA TACGATGTAC AATGCCTTGC AATTGCCGGG TGCTCGTCGA GGTGACGATG GCTCGCGTAG TTCTAGGGTA GAAGTACTGA CGCTACAAGT CGCGTTTTCA TCTACTGGCC GAGAATGGGC CACTATTTCG GGAGAGGGGC TTCATGTCTA CTCGTTGGAC GAAGATATGA TATTTGACCC GATCTCTCTC ACCGAAACCA TCACCCCCGC TGCGGTGGAA TCTAAGCTAA GCACGGGAGA CTACATCATG GCGCTGCGTA TGGCCCTTCA TTTGAATGAG TTTGCCCTTG TAAAGAACGT GCTAGAGTCG ACACCGTTTG ACTCGATTGC TCATGTTGTT CGATCCATCG GTCCTGAACA TTTGGAACGA GTCCTGCAAT ATGTGGCAAA AGTGATGGCG GACTCTCCGC ACATTGAGTT CTATCTGCAT TGGTGCCTGG AACTGCTACG TACACACGGG ATTCACATGG ATAGGAATCG TGGCAATTTT ATGCGAGCCT TCCGTGCAAC GCACAAGTGT GTTCAAACGA AATATGACGA ACTCCGGGCC ATATGCCAAG AAAACAAGTA TAATCTCGAC TTCCTTCAAG ATCACGCTCG TCTACTTCTC ACTCATGAGG AAAGTGAGAA GATGAAGGTG GAAGACTATC GGGAGTAACT AGTGCTGTTG CCGATTGTTT GTTCTGGGCA CTCCGTTAAT GTAAAGCAGA TATAATACCG AGAATTTTGT TTACAGTT
|
Protein sequence | MKFNYKLKRL CGAYYGHPLT NETTGGTRWS GSNVVYDSSG DLLISSVSNR IQVLDLRTHT VRTLPVESRS NVRCLALSPD DAILIVVDVK NYALIVNFRR GIVLHRFQFK RKVRQVLFSP DGNYIAVTHG KQIQVWCAPS QLQKEFAPLV LHRTYTGQAD DVMCLSWSSD CSVIAAGSRD CSVRIWSLHT TRNFEPVTLS GHKRAIVGVY LIGNHSGRVE TCYSVSEDGT LVSWECKLKE GEWDVDHQNR ETPPEDSTDD ASFWSVKSRH YFHQDGADVS SVTYCERGQL LAVGFSSGLF GLYEMPSVSN IHTLSVGNNQ LVKTCVLNKT GDWLALACPH SQQLFVWEWR SETYVLKQRG HAYGMRCMAY SPDGVIVCTG GDDGKLKLWN ATSGFCYVTM EKSHTAPITA VAFANASVVL SSSLDGTVRA HDLYRYRTFK TFTTPTPVQF LSLAVDPSGE IVTAGSTDPF HIYAWNLQTG KLLDILTGHR GPVSDLSFQG NGGILVSGSW DGTVKLWDLY KGNVPTESLQ HTADVVCVTF RPDSREVCTG TMGGILSFWD VDSAKLKFEI DGRRDIAGGR KINDRMTADN NASSRYFTSV CYSADGSCIL AGGNSKYVCI YEVSQQMLLK KFQVTFNRSL DGVLDELNSK NLGPGGPIDD HADSGDDTMY NALQLPGARR GDDGSRSSRV EVLTLQVAFS STGREWATIS GEGLHVYSLD EDMIFDPISL TETITPAAVE SKLSTGDYIM ALRMALHLNE FALVKNVLES TPFDSIAHVV RSIGPEHLER VLQYVAKVMA DSPHIEFYLH WCLELLRTHG IHMDRNRGNF MRAFRATHKC VQTKYDELRA ICQENKYNLD FLQDHARLLL THEESEKMKV EDYRE
|
| |