Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50103 |
Symbol | |
ID | 7198821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 47017 |
End bp | 48900 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185036 |
Protein GI | 219129733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCGA GGAAGGCCTC TCGCTACGGC GTGGAAAACG CTCAATGGAC GCAAGCACCC CTGATCTGTC GGACAATCAT TTGTTTCTTA CTGATAAACC AGTCTAGAGC TATAGTGATC GAGGGCATCC CGCGAAGTTA CATAGGACCA TCTTGTCGTT CCAAGTACTT CTGCTGCCGG AACCCGGCGT TTCCGTTTAT GGCAGTATCC AGAGCACCCT GTTTACGGAA TCGGCGTCAA AGAACGAGGC GTTATGAGAG CTTTCGAGAC GATAGGGATA GTGCTTTGGT ATCCGCATCC GACCTTGTAT TTAATGGATC GACATCGGCT GCTTTAACAT GCTCACCCGA GGAGCAAACC AGCCGTACTT CACAATCTTC CGCCACTTTT TCCGAAGTCG ATGTACTGTA CGGAAGGCGA GCTGTGCTCG TGTATGATCC CTTACAAGAG CGCTACGTGA AAGTTTCCGA GAAGAACAGA GTAGCCGACA GCACCAAGCA AGAGTCGGTA GCTCTACGGG CACGCCGATC ATCTCTCGCT CGATTTATTA CCACCAAAAT ACTCCCCCGT CTTTCGCTCG CCTTCCTACC ATCAGGTGTC ACAAACGACT ATTATCGATT TGTTCGTTGG CGTATACTGC AGCGTTTCGT CAATGCCAAT CTGAACGTCT TTGGCACGCA GAGTCTGCTT TTGGCGCTGC GAATTAAGAG CTCGGCTTCG CAGCTCGGCG CCTTGTCCGC CGCTCTTAAC TGGGTCCTCA AAGACGCCTT GGGAAAGATT GTCCGGATGC TCTGGGCTTC CCGTATGGGA CGGAGGTTCG ACTCGGACGC TAAACGATGG CGGTTTCGTT CCAGTTTTGT CTTTGCTGCT GGCAATGGAC TCGAAATCAT CACCTACGTG TTCCCATCGC TCTTTCTACT GTGGGCAACG TTGGCAAACT GTTGCAAACA AATATCGATG CTCACGTCCA GCTCTACACG CACGTCAATC TACAACTCCT TTCGGGACGG ATCACGGGAA AACATTGGCG ATATTACTGC GAAAGGTGAA GCGCAAATTG CCATTGTCGA CCTATTGGGG ATCGCGAGCG GCGTAACCTT GTCCCGCACG GTTGGTACCT CAATTCGTGC TGTACTCGCC GTATACGTAA CACTACAAGC GATTGAGATT GTCTGTGTGT ATCACCAGTT GCGAGCGGTC ACCTATCGAG TTATGAATTT TGAACGAATG ATTTCCGTTG TGGCAGACTT CTGTCAAGCC CGCCAAGGAC CAAAAGACGG ATTAGAAGGA CTAGCCGCGT CGTGCACGAC GCCTACTCCC GCTGGAATTC CCACTCCACA GACATTGGCG TCGCAAGAAC GCATATTTTT GCCACCGAAA CATTTGACTC GTCGCGCCAT TGCCTTTGGG TCCATCGGCC GTGCCAGGCT CTCTCCCGAC GAGCTGGGAA CGCTTCTCGA AATTTTTAAG AGAGAGCGTT TCATTCTCGT TGTTGGAAAG AACGTCAAAC ATCCGAGACC ATTTATGGCG AAGACTGCAA AGCAGAATGA AGATCCGGTT TCGCGGATTC AAGAAAATTG CCATATTGTG CTGCACGAAG CAGCCACCAA TATGGATATT GTGAAGAGTA CACTTGCGTT GACGCTTTTA CGACGGAAGT TGGCCTTGTC AAAATTCGAT CCGTCTCAAG TGAGGTCGTC CAATTGTTTT GATATAATGA AGGTGACGCA AGAAGAAACA AACGATTTGT TCCCCTTACT GCTGCGAGAA ATGAATACGC AGGGGTGGGA GTCGCCGGCG CGATTCATGT TTGGAAGGGT GCACATGAGA GCTGACTGGC CTCTCACAGC AAGGTCCAAG GGAAGAACAA CATCTGCCAC ATAA
|
Protein sequence | MRSRKASRYG VENAQWTQAP LICRTIICFL LINQSRAIVI EGIPRSYIGP SCRSKYFCCR NPAFPFMAVS RAPCLRNRRQ RTRRYESFRD DRDSALVSAS DLVFNGSTSA ALTCSPEEQT SRTSQSSATF SEVDVLYGRR AVLVYDPLQE RYVKVSEKNR VADSTKQESV ALRARRSSLA RFITTKILPR LSLAFLPSGV TNDYYRFVRW RILQRFVNAN LNVFGTQSLL LALRIKSSAS QLGALSAALN WVLKDALGKI VRMLWASRMG RRFDSDAKRW RFRSSFVFAA GNGLEIITYV FPSLFLLWAT LANCCKQISM LTSSSTRTSI YNSFRDGSRE NIGDITAKGE AQIAIVDLLG IASGVTLSRT VGTSIRAVLA VYVTLQAIEI VCVYHQLRAV TYRVMNFERM ISVVADFCQA RQGPKDGLEG LAASCTTPTP AGIPTPQTLA SQERIFLPPK HLTRRAIAFG SIGRARLSPD ELGTLLEIFK RERFILVVGK NVKHPRPFMA KTAKQNEDPV SRIQENCHIV LHEAATNMDI VKSTLALTLL RRKLALSKFD PSQVRSSNCF DIMKVTQEET NDLFPLLLRE MNTQGWESPA RFMFGRVHMR ADWPLTARSK GRTTSAT
|
| |