Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48873 |
Symbol | |
ID | 7194953 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 538664 |
End bp | 541278 |
Gene Length | 2615 bp |
Protein Length | 808 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183375 |
Protein GI | 219126251 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.424343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGGTCGGTAA CACCGACAAC GAGCCATCCC CGGAGGTTCA ACGCACACAA CATGAGAAAA GCGAAAAAGA AAAAGTCGGC AGCGTCCAGA AGAAACCACG GCAAAGGCCC GCCACAGAAC GCCACACAAT CGTCCGTCTC CTGGATCGAC GATCTAGTCT TTGCCCCAGT TCTTCCACAG AACATGGGAG ACGAAGCTAA GAATACATTA CACCGTACAA ATGTTGGTCC CCGCACCGGT ATCCACGCCA TTAGCGACGA CAACGCCAAC AACAACAACC ATACCATTTC CGTCGTGTCT TGGAACATGC TGGCGGAGGC CTACTGCAGC CCCCGTTCCC ACGTCAACTT GCCCGCATCC TACCGAAAGG TTGTCTTTGA TCGAACCGCA CGAAGGGATC GTATATTTGC AATCCTCAAG CGCTTGGCCC AACAGCAACA GCCCGTCATC GATGTCTTGT GTCTGCAAGA AGTCGACGTC CATCTGGACG ACGCCTTGCA AAGGTGGGGG TACGCCGCTG GACGGCAAAC GCCACGTACC CGAGCTGGTG GCGGATCCGG AGGCCGTGTA GACGCTTGCG CCGTTTACGT ACGCGACCAT CGCTGGCAGA TAGTCGCACA CGAACTTGTG CGCTTGGATG ATCTCGCCAC GCTCCGCTCC GCGCCGCATT CGGTCCCTGA CGACCGCCGT ATCGGTAACG CCAGTAATAG TGCCAGTGCT ACCAGTCATC TGCAAGGTCT CCAACAGGGG TTTCTGCGTC GCAACGCCGC TCTCTTGGTA CGTCTGCGGG ACACTTCGGG AAGGACAGTG GTAGTAGCCA ACGCGCATTT GTATTGGAAT CCTGGATTTG AATACGTCAA GGTAAGGTAT CGGGTGCGGG TAGTTCCTCC CGTAGACCAG TACAACAACA ATATCAACAT TGTCTTACCA CGGATCCCTT GCTGCTTCTG CGTTCGTGGC ACAATCCTTA GATGTGTCAG GCACACTACA TTTTGCAGCG CGCCCGAGCC TTTCTCCAAC ACGCCGAGGA ACCCCTCATT TTTTGCGGGG ATTTGAATTC ACGACCACAC GGAGCCGCTC ATACCTACCT GACACAAGGA TTCATCAACG CCAAACGCAT AGCACCATGG TACAGCTACG GTAAGGCAGA CGAAAGCATG ACAAACGATT GCGAAACAAC TGATGACGGA GCATTGTACA ACGAGGCAAA GGATGACGAC GAACCGGTCA CACACAGCGT TCTTGAGCAG ACGTTTTCGG CTCTGGATCT CAACGACACG AACAACACCT CAGAACTAGC CAGTCCTCGG ATTCGCTACG TACTCGACGC GACGTTGAAT AAACTCTGTC GATGGCTCCG AATTCTGGGT CAAGACGCCG CGCTGGAAAC AGACAAGGAG GAAAAATCTC GGACCCAATA CAACGGTACA ATACCCCTTT TCGATCGCGC CCGTGAGGAA AGACGTACGC TTGTGACGAC TTCGACGCGA CTCATGCAAC GCCGGGATTG TCCCACCGGG GCTTACTGCT TGAATCCCTC TGTTTTGCCG CATTTGGAAG TCGCCTTGGT GCATCTGTTG CTGACCCATG GAGTCGTGTT GGAACCGGCT ACGTTCTTGA GCCGGTGTGT GGTCTGTAAC GGGAAGATCG TTAAAGTCCA GGAGAAATCT CGACAACGTC AAATTTTGGA AGGCTACGAT GCTCCCGTGT TGGCGGAAGA GATGGTAGTG TACGAATGTG ACGGCTGTCA GCAAGGCTAC TGGTGGTGTG ATCTGCCAAC GAGTTCAGCA TCGAGGGTGA AAACGCAAGC CACCCGACTA TTTCAAGTCT GCTTACAGGC GGGTGTTCCC ATAGACGGGC GCGTTCCGGA TTTGTTTGCC CACGTGAATG TGCCACAAGA ACAGGAACAG GGCTGGGACT ATACCCAGCC AGGCAGTGAT CTGCTACGGC AAAGGCTGGA CGTAATCGAT TGGCTCAAGG ACGAATCTCT ATCTTGTCCC TTTCATTTAG AGAGTGCATA CGCGATGCGA GACGAATCTA ACAATCTGGT GGGCGAAGAA ATACCTTTCA CCAACGTGAC GGATTCATTT GTCAATACAT TGGATTATAT TTTCTTTGAA CCCAAGCGAA TGGAATTGTT GTCCCGGCTG TACGTTCCGA AAAGCTTTCG AGAATTGAAC ACCAAAAATA TTCCGCGGGG ACATTTGTTG CCGAGCGATA TATGGCCAAG CGACCACCTT GCGATTGGCG CGACGTTTGC GTTGCTTCCT AGGACCGCTT CAACGGACCC ATCCGTGTCG TTAGCGGAGA GGACTGATTC TGCGGCTACA CTGCAAAAGG ATCATCGAGG CGTGGCACCC TTGGAGCCAG CCTCAGCGAG AGTCACCAGC GGGTCGGACA TAGACAGCGA GTACTGTTTG CCGACGGGTG CTGGAGTAGG TGCGCTACCG ATTCGACCCA CGGCTCCAGT TTTTCCAAGG CGGCACAAGA AGCGATGCGA CTGTGGGTGC GTACCGTCTA TTCTATCAAT GTTCGAAATG GCGGAACTAC GCAAGCAGGC ACGACTGGCC AAAGCGCAGC GCACGGAATC GGGACTGAAT TCTGTAGTCT CATAA
|
Protein sequence | MRKAKKKKSA ASRRNHGKGP PQNATQSSVS WIDDLVFAPV LPQNMGDEAK NTLHRTNVGP RTGIHAISDD NANNNNHTIS VVSWNMLAEA YCSPRSHVNL PASYRKVVFD RTARRDRIFA ILKRLAQQQQ PVIDVLCLQE VDVHLDDALQ RWGYAAGRQT PRTRAGGGSG GRVDACAVYV RDHRWQIVAH ELVRLDDLAT LRSAPHSVPD DRRIGNASNS ASATSHLQGL QQGFLRRNAA LLVRLRDTSG RTVVVANAHL YWNPGFEYVK RARAFLQHAE EPLIFCGDLN SRPHGAAHTY LTQGFINAKR IAPWYSYGKA DESMTNDCET TDDGALYNEA KDDDEPVTHS VLEQTFSALD LNDTNNTSEL ASPRIRYVLD ATLNKLCRWL RILGQDAALE TDKEEKSRTQ YNGTIPLFDR AREERRTLVT TSTRLMQRRD CPTGAYCLNP SVLPHLEVAL VHLLLTHGVV LEPATFLSRC VVCNGKIVKV QEKSRQRQIL EGYDAPVLAE EMVVYECDGC QQGYWWCDLP TSSASRVKTQ ATRLFQVCLQ AGVPIDGRVP DLFAHVNVPQ EQEQGWDYTQ PGSDLLRQRL DVIDWLKDES LSCPFHLESA YAMRDESNNL VGEEIPFTNV TDSFVNTLDY IFFEPKRMEL LSRLYVPKSF RELNTKNIPR GHLLPSDIWP SDHLAIGATF ALLPRTASTD PSVSLAERTD SAATLQKDHR GVAPLEPASA RVTSGSDIDS EYCLPTGAGV GALPIRPTAP VFPRRHKKRC DCGCVPSILS MFEMAELRKQ ARLAKAQRTE SGLNSVVS
|
| |