Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44468 |
Symbol | |
ID | 7197700 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 655320 |
End bp | 658190 |
Gene Length | 2871 bp |
Protein Length | 932 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178550 |
Protein GI | 219115509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00222146 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTACG ATTACGATGC ACTCTGTCGA GAGTTTGCAC AGCACCGAAA GGCCTCAATG GTGACGGAGA ACGTGATAGA GCGCTCGGGC CATTTGCAAA AGGCAGTCCA GGCCTTGGAA ACTCTTTCAT CTTGCTGTCC CATGACACCT GCGCTTTGGA TACAATATGC CAGCACTGCT GCGGAATGGA TTTCACAAGC GTTGTTACGA CAGGAGGATT CGTGTGACGC AGATTCGAGA ATCAATAAAG AATCTCTCCA AACGCGTTTA CAGACTCTGG AATTGGGCTT ACAAGAATTT CCCGGATACG TTTTGTTGCA TTTACATTAT ATAGAGCTCT TGATGCACAA AAACGCATGT ACTGATGCGC CCAAGATCGA ATCGGCGCTG CGGACGGCGA TTGCACAAGT AGGAGGGGGT TCTCACCGGA ACGAGGGTAG CTGGGTCGTG CAGCTCTACA ATCATCTTGC GACCTTCTTG GTCAAGCAAA ATCGAGTGAA AGAGGCGTTG CAGTGTTTTG TACAACGCGC CCGAATTCCC ATGAAAGATG TGAACGATGA GATTGCCAGT GACTACAGAG GATTTTGCGA GAACCACGGC CTTACTCCTT CTACCAAGCA CTTGGAACAA ATGGAGCAGG GCCGGCGACT AGAAGCCAAA CTTTTTAATC GGTATATAAC GCTGGAGGAC GAAATTGATG CAGTCATGCA TAGCCAGGGG ATTCTTCCGA GGTATGATGT TGGTGTAGAC AAACTCGACT GGAAGATTAT GCTTCACACG GATCGCTACG GAATGGGATT GGGAGGGGCA GACGTGGCAA CCGCATTTGT CAAATATGCT CTCGAGTGTT CAAATATCTT CAAAAGCGCC GCTCGGCAAG TAGACGAGGA CGACCAGGAC CTAGAAATAG AAGAAATGCG TCGCCATATT TCTGGCCTGG CCTCGGCAGT ATTTGAGCGC GGTGTGGCCG AATGTCCGAC CGTCGACTCT TTGTGGCTTT CCTATATTCG CTTCTTGACG GAACAAGACA ACGGAGATTC TCTGTCCCTG CTACCAAGCG TTTCTCAACG GGCCGTACGC AATTGTCCAT ACAGCCAGGC TTTGGCTTGT CAACAAATGG ACAACGTATT ACTCTTGGCT GACAAGGGTC TTATTGTTTT TGATCCCGAT GCACTAATGG AACAGGTGCA GACAGCACTA GACACAAAAT TTCTTCCAAA TCCAGTACAG TTTTTGGAGC TCTACCTTTG TGTTATTCGT ATCGTCAAGC GGCGCATGCT AAGTATCTTG TCTGGTGTGG CTGTAACTGA ACAAAGTGGC AAGGCGGTAC TACGGTATGA CGAGGCGGAG CCTATCCTAA AGTCAAGCAA TGCGAGATCT CCAAAACGTG ACGGAAAGAT AGACGGGGTG CTACAAGAAG TCCAGGACTT ATGCGAGGAC TTGACAGACA TGTACGAACA TATCGAACAA AAAATGAGGA AAGTTCTGGG AAAATGGTCA GAAGGCCGAT CACTCTTGTG GTTGGAGCGA GCATATACGG AAAAATATTT CTTGAACCCG CTGCGTCGCA TTTTTGAAGG CTCTGGTGAT TCCTCCAGAT CTACAGAAGA CCTAGAAATG CTGCTATTGT GCGAGAAGCC AGTTCGGTCT CATAGTCCGC CCCATCCTGA CTTGTACTTG CGCTACATCG AACAATACTT GTTGAGCTAT CCTGTTGTGA ACGCGACTGA TGTCCTTCAT CGTTTGAGAC GTACTCGTTG GCTGTACCAA AAGGCCATTG TGGGGGTCGG AAGGAGCAAG GAATCAAAGC CCGTTCCTTC CTTGGTGATA CCAGATTTCG ATAGCGCGTT CGCACATTTA AGTCATCACT GGCTAGAGTT TGAACAGATG TTCGGGTCTC GAAGTTCGGT GGCTGAAGCG CACAAGGCAA TTGCACGAAA GATGCACAAA CTTGGTGAGA ACGTTTCACA TCCTTCATCG GACCCCTTGC GAGAAAGGAG GAGTGATGCT CCATCTTCTA TGAATGTGCA TATGGTGCAA GGTCCCGATC GAAAACGAAA AGTACATATT GATTCTGATG ATTTTGAAGG GAAAAGAATT CGTACAAAAA CAGATTGGGT AGATCGAGAC ATTACGACTG ATGATGACCG CTATGTCGAT CCGGGGCAAT CAAAAAATGC TAAAAAGCCA AACCAGGATT TTAAATATCA TCCTTTTTCT GTCCGTGTAT CTGGATTAAG TGAGAAAACA GACGATATGG ATCTGGTGGA TGTTTTCCGA CCTAAGTGTG GTGAAGTAGT TCATGCGAGA ATCATCCGAG AGAAGGAAAT TCGTCACAGC TTGAAGGGAA AATCAAAAGG ATGGGGATTG ATACAGTTTG AAGATAGAGA ATCGACCGAG AAAGCCCTGG CATTGGACGG TATTATCGGT ATACACGAAA AACTTGTTGT GATTGAGCGA TCTTACATGC CTGCGGTAAT GATTGTGCCG CCCGGAATGC ACAGAGTTCA ACCCAAGGGG GAGGGAAAGA GCTCAAAAAT AAACGAAAGG CGCAAGGAAC GAGAGCAAAA AACGACGCGA TCCACAAAGT CAGGATCTGG CCCTGTATTA GGCCCCTTAT CGGAAGATTC CTTCAATCCT CTGCAATTTC GACCCCGCGG CATTCAGGCA AAACCACGGT CGAAAGTAAG TGTAGACTTG CAATAGCAAG GCAAGTGCGC AATAGGTCAC TATCACTTCG ACTTGGGTAA AGCCTCTGCC ACAGTGGCGT TATTTCCGAC ACACCACTGT CGCCAGAATA TTCTTTGAAG TGGATACTAC CATTACATTG AACTTTTAAA CAAGCTTATC TGCAAAGTCT TTGCAATAGT C
|
Protein sequence | MSYDYDALCR EFAQHRKASM VTENVIERSG HLQKAVQALE TLSSCCPMTP ALWIQYASTA AEWISQALLR QEDSCDADSR INKESLQTRL QTLELGLQEF PGYVLLHLHY IELLMHKNAC TDAPKIESAL RTAIAQVGGG SHRNEGSWVV QLYNHLATFL VKQNRVKEAL QCFVQRARIP MKDVNDEIAS DYRGFCENHG LTPSTKHLEQ MEQGRRLEAK LFNRYITLED EIDAVMHSQG ILPRYDVGVD KLDWKIMLHT DRYGMGLGGA DVATAFVKYA LECSNIFKSA ARQVDEDDQD LEIEEMRRHI SGLASAVFER GVAECPTVDS LWLSYIRFLT EQDNGDSLSL LPSVSQRAVR NCPYSQALAC QQMDNVLLLA DKGLIVFDPD ALMEQVQTAL DTKFLPNPVQ FLELYLCVIR IVKRRMLSIL SGVAVTEQSG KAVLRYDEAE PILKSSNARS PKRDGKIDGV LQEVQDLCED LTDMYEHIEQ KMRKVLGKWS EGRSLLWLER AYTEKYFLNP LRRIFEGSGD SSRSTEDLEM LLLCEKPVRS HSPPHPDLYL RYIEQYLLSY PVVNATDVLH RLRRTRWLYQ KAIVGVGRSK ESKPVPSLVI PDFDSAFAHL SHHWLEFEQM FGSRSSVAEA HKAIARKMHK LGENVSHPSS DPLRERRSDA PSSMNVHMVQ GPDRKRKVHI DSDDFEGKRI RTKTDWVDRD ITTDDDRYVD PGQSKNAKKP NQDFKYHPFS VRVSGLSEKT DDMDLVDVFR PKCGEVVHAR IIREKEIRHS LKGKSKGWGL IQFEDRESTE KALALDGIIG IHEKLVVIER SYMPAVMIVP PGMHRVQPKG EGKSSKINER RKEREQKTTR STKSGSGPVL GPLSEDSFNP LQFRPRGIQA KPRSKVTITS TWVKPLPQWR YFRHTTVARI FFEVDTTITL NF
|
| |