Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49652 |
Symbol | |
ID | 7198299 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 311924 |
End bp | 315054 |
Gene Length | 3131 bp |
Protein Length | 842 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184343 |
Protein GI | 219128277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.390924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTTCCTGGT CGAGTTCACT ACTCACTGTC AATTTAAGTA GTAGTGAGCT ATACGGTGTG AACAATCCTT CCTTCGTGCA GAAGAGTTTG GGTGCACGTC AACAAAACAC GTCAACGTCA ACGTATACCT CGCGACCGAC CGATCACTCT TCATGTCTCC GTGCAGCGTC GTCCTGTTTT GCTGCGTCGT CGTGGTCGTC GTGGGTTCCA ATGGGATTGT CCACGGCTTT ACCAGTGTAC CCCCATCATC ATCACCACCA TTGCTGCGGG TAGTAAAGGG TGGGATTCGG ACGCACCGTC CATCACTATC ACCATTATCT CCGCACTCGT ACAACCATGG TAGTCGGCGA CCACTCCGTA TGCTTTTGCC TGCGAACGAT TCCGCGTCTC ATACTCGTCG AGTCGCGACA CGCCCGACAC GGTACGTGAC AATCTCACGA CGCTACTTGC TCCTGTTCGT GGACTCGATC GTTGCCATTG ACTCGCAAAC TACTTCTGTT GCTGATACTG GAACGATTGC GTGCCTGCAT GTTCGTGTGT TGGGATTCTC GTGCTCTCGT CTTTCGTCAC TCACGCGCTC ACTCGTTCGT ACTACTATTA CTACTACCGT GACAGACTTC TCCGTCCGAC GTCGGGGTCG TCCAAGGGGT ATCCGTCCCG TAAATTTCGG GTGTTGTCCC TCGCCGGACG TCCCGACAAC AACAACGACA ACGACTCGAA AGAGCGTCGC CAGTTTATGC TCGATCCCGT CACCCAAACC TTCGACCACG TGACCAAGTC CGTGCAAGAA TCGGTCGAGA ACGCCCGGAC GCGTCTCGAA CAGCTACTCC GCTTCCTACC CGCACCCCTC CGCAATTTCT ACCGCGCCTT CTTCGAAAAA CTCCACGCGT GGAAAGGATT CTTGGTTAGT TTCACTGCCG GAGCCTTTCT CGCCACCGCC GCCATTATTT ACCCCATTTA CGCTTCCGTA GAATCACTAT CCCAACCCGT CACACTCTTC GAAACCATAC TCGGCGATCT GGAACAAGCC TACGTGGAAG AAGTCGACAC CAACAAACTC TTCGAAACCG GAATTGCCGC CATGCTCCGG TCACTCGATC CCTACACGGA ATTCGAAGCT GCCCAAGAAG CCGTGGCACT CACGGAATCC ATCGAAGGCC GCTACGGTGG TGTGGGTCTC GTCATTGCCG GAACACCGCG GGCCGCGGCC GAACCGAAAT CCAACGCCAA CCAACTACTG CCCGCCGCCG CACAATCCGA CACTGCCAGT CAAGAAGATA CCGAACGCAA TCGGAATACC ATGAGTAACG TCATGACCGA AGAGGAAGAA GACGAATACA TGGATCGCAA GGAACAACGC AAGGCTCTGG AGAAGGCACG CAAACAAGGA ATCCGGGTAG TCACCGCCTT TGAGGGGTAC GCCTTTGATT ACGGTACGTT CGTTGGAACG CCCGCCCCGA CATTTTTACT AGTACAGCCA GTCGGGTCGG GGCTCTACAC GAACCTCCGC ACGCTCAACC GGCACTTTTT TCTTCTGTTT CCCAGGCTTA CGCGTCGGCG ACAAGCTCTT GGCGATTGAC GATAAGCCTC TCACAGCGGA TACGACGGTC GAAGACGTCC GCAATATGCT CCGTGGACAA CCCGGAACCT TGGTAAGTAT TGAGTTCAAC CGAGATGGCG TCGATGACGT ACAAACCGTT ACCATGCCCC GTGCCGTTGT TCGCCTCCGG GACGTCAAAC TCGCCACCCT CGTGGGAAGT CCCCGGGACG GGATCGGCTA CATCCAATTG AGTGGCTTTA CCTCCAACGC CGGTGCCGAA ATGCGTCAGG CCATTACGTA CTTACAGCAA CGGACACTGG ACGCGACCAA CGGAGACAAG AGTTTACAGG GACTCGTTCT CGATTTGCGG GGCAACCCCG GTGGCCTTTT GACGTCGGCG GTAGACGTAG CGTCCCTGCT CGTCCCGAAC GGCAGTGACA TTGTGTCGGC CCGCGGACGG GGCTTTCCCG GAATGCTCTA CCGGAGTCGG GTGGATCCCA TTCTGAATCC CAACACCAAA CTGGCGGTGC TCGTCAACGG ACAAACGGCG TCAGCGGCCG AGATTGTGGC CGGGGCCGTC CAAGATTTGG ATGTGGGCGT CATTGTGGGT GCGGACCGCA GTTTTGGCAA AGGGTTGGTG CAAAACGTGG AAGAGTTACC TTTTAATACG GCACTCAAGT TCACCGTAGC CAAGTATTAC ACACCCAGTG GCCGGTGTAT TCAAGGCGTC AACTATAAAG GAGGGGGTGG CCTCAAGGAA GAAAATGGAG GATACATTGC CAGTAAGGTG GCCGACGCTG ATCGCAAGGT GTACTATACC AAAGCGGGCC GCATGGTGAG AGACGGCGGC GGTGTGGAAG CGGATTACAA AATTGAAGCT CCCAAGGCTT CGGCCCTGGA AGTGACGTTG CTGCGATCGG GAATGTTCAA CGAGTATGCC GCGGAATGGA GTAAAACACA CATGCTGACC AACAATTTTG CCGTGGACGA AGATATTTAC CGAAACTTTA TTGCCTTTGT CGATCAAAAG CAGAAAACTG GTGACATTGA GCTGGATGCG CTGTACAGTC GACCGCTATC CGATTTGAAA AAGGCTCTTA AACGGAGTGG ATATAAGGGT GCCGAAAAGG AGGTGGAAGT GCTACAGGCC AACATTGTTC GGGAAGTCCA AAAGGATTTC GACAAGTATC GAAAAGATAT TAAAGAAGAT ATTTCCCAAG GCATTCTGGC CCGATATCTT CCGGAGAGTA TGTTAATTGA ACGAGGTATG AAAAACGACG CACAGGTGGA GGCAGCGATC AAGCTGGTGG CCAACAAGAA TACATTCGAT AAGATTCTCG CGCAAGGAAA CACGGCCGAG CGCATGGGGG GCGCCAATAG TTTGAATATG GCATCCGGCG CCTCTGCACA AAGCACTAGC GGTGTACGAG CTACTATCCA ATGGTAGAAT TGGATCCTGC AAACGTCTTG TACAAACAGT AACAGGATGA AGCCAACCGA GGAGAGAACT GGCCAGCAAA CTATTCACCG GTCCCCTCGG GTTCGCTTGG GCAGCAACAT TAAGCTAACT ACGCTTCGGC ATTTACCCAT T
|
Protein sequence | MSPCSVVLFC CVVVVVVGSN GIVHGFTSVP PSSSPPLLRV VKGGIRTHRP SLSPLSPHSY NHGSRRPLRM LLPANDSASH TRRVATRPTR LLRPTSGSSK GYPSRKFRVL SLAGRPDNNN DNDSKERRQF MLDPVTQTFD HVTKSVQESV ENARTRLEQL LRFLPAPLRN FYRAFFEKLH AWKGFLVSFT AGAFLATAAI IYPIYASVES LSQPVTLFET ILGDLEQAYV EEVDTNKLFE TGIAAMLRSL DPYTEFEAAQ EAVALTESIE GRYGGVGLVI AGTPRAAAEP KSNANQLLPA AAQSDTASQE DTERNRNTMS NVMTEEEEDE YMDRKEQRKA LEKARKQGIR VVTAFEGYAF DYGLRVGDKL LAIDDKPLTA DTTVEDVRNM LRGQPGTLVS IEFNRDGVDD VQTVTMPRAV VRLRDVKLAT LVGSPRDGIG YIQLSGFTSN AGAEMRQAIT YLQQRTLDAT NGDKSLQGLV LDLRGNPGGL LTSAVDVASL LVPNGSDIVS ARGRGFPGML YRSRVDPILN PNTKLAVLVN GQTASAAEIV AGAVQDLDVG VIVGADRSFG KGLVQNVEEL PFNTALKFTV AKYYTPSGRC IQGVNYKGGG GLKEENGGYI ASKVADADRK VYYTKAGRMV RDGGGVEADY KIEAPKASAL EVTLLRSGMF NEYAAEWSKT HMLTNNFAVD EDIYRNFIAF VDQKQKTGDI ELDALYSRPL SDLKKALKRS GYKGAEKEVE VLQANIVREV QKDFDKYRKD IKEDISQGIL ARYLPESMLI ERGMKNDAQV EAAIKLVANK NTFDKILAQG NTAERMGGAN SLNMASGASA QSTSGVRATI QW
|
| |