Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39864 |
Symbol | |
ID | 7195517 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 225990 |
End bp | 229265 |
Gene Length | 3276 bp |
Protein Length | 966 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183952 |
Protein GI | 219127458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAG CGAGTCACAC CGCAACTAGT CGGACTCCAC TGTACGCAAC GTCGTCAATG CATGATATAG CCACCGCGAC TGTGGAACCA ACTCTGCTGG CACAAGCGTG GACACCACAA TTGCTGCAAG CCGAGTCCGG GACGCCGACA CCTTCGGGAT CCACACTCGA CGACTCGTCC GCGCTTCGGG TAGTCATCGC ACGAGCCGCG GCACGCACGC TGCCTCTTTT GACGGACGCT TCCGCTGTCC CGTTCGTCTG TCGGTACCGG GTGGACTTGG TCAATCCGTT GACCACACGG CAAGTACACT TGCTACAAAC CTTGTCTTCC CGACACGCTA GTCTCCAGAG TGTACGAGCG AAAGTGCTCC GAGCCGCGGC GGTGGCACAC GACGGCAAAG GCAACGCCGA AAACGAAGCC TGGATCAGTA AAGTGCAAAC CAGTACGTCC AAGGCGGAAT TGGAAGATTG GTACGCTCCT TATAAGCCCC CGTCGAAAGG TTCCGTGCTG GATCGTATCC AAAACGACCA TCCCGAACTA ATTCCCCAAT TGGACGCCTT TTGGCACGGC GATCAAGATT CATTTTCGAT TCATCGTATT TTGAAGCAGC ATCCGAAAGA CGCCGTTTTG CACGTCTTGA GCACCAAGCT TATCGCCAAC GAACCATCAG TAGTGGAAAC CGTTCAAAAC GAATTGTGGA AACATGCCAA AATAAGGACG AAACCACCAG CTTCGCCGTC AAACGACCCC GCGGCCGACC AAAAGTACGT TGTATACCAC GACTTTACTG CCCCATTGTC CCGGTTACGG GATCATCAGA TACTGGCAAT TCGCCGGGGA GTCGAACAAA AACTACTGCA ATTGTCGTAC GAAGTGGACG GATCCAAAAT TGAAGCCTGT ATGCGATACG CCGTGCGGCG CCGATGGCCC CGCCACGGTG ACGCCCTCCC AGCCAATCTC TTGGACGAGG CCGTGCACGA AGCGTATACG CGTACGTTGC GCCGAAAACT ATTGACTCGA TTGTGGTCGA AAACTTGTCT GCCCCAAGCT CAGGCCCGGG CAAGTATTTG CCGAGAACAC CTCGCGTGCA CTCTTGGCGC CGCCTGCGAG TCGCGGCGGC GGTAGTAGTG GGAATAACGC CAGCTCTCGA GCTTCTCCAC CACTGTATCT ACTCAGTGTT GATCCAGGAT TTCAAGCAGG ATTGAAATGC GCTGTCCTGG ACGTTAATGG GCATGTTGCG CTACAACCGT TGACGACAGT CAAGTACTTG GGCAATGCCC GAACGACTGG TGTGCGAACG ATGAGTACGT TGCTACGAGA CGTTGCCGAT GCGACGCAAT CGAATACTGT GGTGGTCACA TTAGGGAACG GCCACGGTAC ACACGAAGCA CGTGATTTGT TACGGGAAGC CGCCGCTGTT GCTACTGAGA AATTAGAATT GGATATTCAA GTAGTCCACG AAGCTGGTGC CAGTGTGTGG AGTGTGACGG AAGCGGCACG AGAGGAATTT CCGAATGATC CCCCTTCCGC CATTGCGGCC GTTTCGATTG GTCGACGGTG GCAGAATCCA TTGCACGAGC TCATCAAGGT CCCGCCAGCT AGTTTGGGAT TGGGTATGTA CCAGCATGAC GTACCGCCCG CTGATTTGGA CGATGTACTG CATCGGACCA GTGTAGATGC CGCTGCCGCC GTTGGCGTGG ACGTGAATAC CTGTCGGGTC GAAATTTTGC GCAAGGTACC GGGATTGGCC AAACTAGCCG ATGCCATCAT GGCAGCGCGG CCTTTGGCAA CGAGGCAGGA TTTGCTGTCG AGGGTAACCG GTTTGGGTCC GAAAACATTC CAAGCCTGTG CCGGCTTCTT GCGGATTGTC GACGGACCAG AACCGTTGGA TGGAACCCTG GTACATCCGG AGTCCTACGA CACTGCACGA TGGCTACTAC AAACATTGTC GTGGGATCTG TTGACGGTAC CGACGAACCT CCCACCGCGC GCCGAATGGA AGTCCTGCTG GAAGGATGTA CTGAACGCTG GGTCGACGCA ATTTGGCGTT AGTCCGGAAC GCATGCTTTC CGTGTTGGAA AACCTGGTGG ATTCGTTAAT AAACGTAGAT CCTCGATTGC GCCAAGGCAA TGATACCGGG AGCCGTCGTG GTGCGTCTCC GCGGTGGGAC GGCTCCAACT GCGCACGATT ACCGGCGGGG CTGGCCCACG ATCTAGCGGC CTTGCAAGCG GCGTGTCCGG TCCGCAACGT CCAAGGGACG GTTCGAAATT TGGCCGATTT TGGTGCCTTT GTCGATTTTG GTGGACCCCA CGATGGCTTG TTGCACATTT CGCAAATGAT GGCGAATCGT GTCGCGTTGG ACACGCTCCT GATTGGACAA GGAATCGGGA TCGACATTGT CAGCGTCCAA GACCATAAAG TGTCGCTCGC TTTGGCCGGC GGTGCAGTAA AGAACGGCGC GAGCGTTCTT ATGCCACCGC AGACGGGTAC GGGAGTAGGG TCCCGTCGGA CCATACAAGG TCCGCGGATC ACAGTTCCAG TGGGTCCGAA GCCAACCGGT GGTGGCAAAC GTAGCGCTTC TACTAGCAAA AGTGTTCGAC CAACAAATAT TGGAAGGAAT ACGAAACGGA CCAAGCGATC GACGTGAGTC ATCTAATCAT AGATACATGC ACGTGAACTG TGTATGCCAG GAATATAGAT CTTTTTGGAA GTGGCACGGT GCATTGGTGG AGACCTGGAT CATCGGAAAC GTTGTAAATT TCACACGCTC TTTTTTTTTG TCTGGGATGG CTCCGAATGT CTCGACGCTC ATACCGATAA AGCGGTAGTT ATCCTTGGAG ACTGGTGCCT TTCCTCATCA GGAACACTCA CCTCGCAATT GTTTACCCGT CGCTCCAGGA ATAGGTAGTT GATTGCTCTC TCACTACTGG TATAGTAGCT ATTGTAACTA TAATAGTTCA ATAACTCTAT TACTAGCGAA CGGGGAAGGT ACCAAGGATT TCTCTGTCGG TGCGTGACTG GCAGTCCCAG AAAGCAACGC TGTTTCCTCT CGCGCTGGAT GGAGGATCGT TGATGGCCGT GTACGATCCG ATCTGTTCCA AGTTCTCGCG AATCATGGAA CTTGTCGGTC CTCGTGTTCA CGGGCTCTTT TCGCTTTTGC GACCTCTCGG CGTCGTCACG TTGCTGTGTG CGATTTGTCG AGGTCGCTCG CTCTTGTTTG TCACTAACAA CTCGTTTTCC CCGTCGGCTG ACTGTGACAA CAAACACCCC TATTGA
|
Protein sequence | MKAASHTATS RTPLYATSSM HDIATATVEP TLLAQAWTPQ LLQAESGTPT PSGSTLDDSS ALRVVIARAA ARTLPLLTDA SAVPFVCRYR VDLVNPLTTR QVHLLQTLSS RHASLQSVRA KVLRAAAVAH DGKGNAENEA WISKVQTSTS KAELEDWYAP YKPPSKGSVL DRIQNDHPEL IPQLDAFWHG DQDSFSIHRI LKQHPKDAVL HVLSTKLIAN EPSVVETVQN ELWKHAKIRT KPPASPSNDP AADQKYVVYH DFTAPLSRLR DHQILAIRRG VEQKLLQLSY EVDGSKIEAC MRYAVRRRWP RHGDALPANL LDEAVHEAPG QVFAENTSRA LLAPPASRGG GSSGNNASSR ASPPLYLLSV DPGFQAGLKC AVLDVNGHVA LQPLTTVKYL GNARTTGVRT MSTLLRDVAD ATQSNTVVVT LGNGHGTHEA RDLLREAAAV ATEKLELDIQ VVHEAGASVW SVTEAAREEF PNDPPSAIAA VSIGRRWQNP LHELIKVPPA SLGLGMYQHD VPPADLDDVL HRTSVDAAAA VGVDVNTCRV EILRKVPGLA KLADAIMAAR PLATRQDLLS RVTGLGPKTF QACAGFLRIV DGPEPLDGTL VHPESYDTAR WLLQTLSWDL LTVPTNLPPR AEWKSCWKDV LNAGSTQFGV SPERMLSVLE NLVDSLINVD PRLRQGNDTG SRRGASPRWD GSNCARLPAG LAHDLAALQA ACPVRNVQGT VRNLADFGAF VDFGGPHDGL LHISQMMANR VALDTLLIGQ GIGIDIVSVQ DHKVSLALAG GAVKNGASVL MPPQTGTGVG SRRTIQGPRI TVPVGPKPTG GGKRSASTSK SVRPTNIGRN TKRTKRSTNI DLFGSGTRTG KVPRISLSVR DWQSQKATLF PLALDGGSLM AVYDPICSKF SRIMELVGPR VHGLFSLLRP LGVVTLLCAI CRGRSLLFVT NNSFSPSADC DNKHPY
|
| |