Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42729 |
Symbol | |
ID | 7196363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 915431 |
End bp | 918453 |
Gene Length | 3023 bp |
Protein Length | 1003 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177189 |
Protein GI | 219110875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGA AAAGGAATGC CAAGAAACGA CGTCGGGAAC AACGCCGCCA GATGGAAGAA CATTCCAGGC TTACTAATGA TTCCTTCCAG GACCAGCAAA GAGCTACCAG TACGAACAAA CATTCGAAGG GGGATGGTGC ACCACCATCA TTTGACAAGA GTCGCAGCGT CAATTCATCT TCTCTCCCTA CAGCAGTGGA TGGCTTCGGT TCCCAAAAGC GTTCCATCGC TGCAATCCAA TTTGTTTACC CCGAACGGAA TGGAAGAAAA CTTTACAGAT TTCAAAAGGA GCTGCTCGAC ACCTTCATCG TTCCTCCCCA AGAAAAGATT GAAAAAGGGC TCCCTGAATT CGTCGCGACG AACGTGAGTT TGAACGACAT TCTATACCCG GACAGCAACA GAGAAAAACA ATCCGATTCT GGTAGTCAGC TCAAAGGCGA CTCCTCTCAG AAATCTCCAA AAATCCTGCA TGGCGAGCAT CACACCAACA ACGTAGTAAA GGAGCTGCGT CAGAAGAACT TGGATGTACA GTCACTTTCT GGGCAATCGC AAGGCCAGCT GTCGGGGATG TCCGTGGAAG ATGCCGTTCG CGACAAACTT GGATGTCGGC CTCGTGCGAA TTCGACAGAT GGAGAACTCA ATTTACCGCA ACGAGGGCTT TGTGATGAGC GGAAGGTACT AGAGTCTTTC AAATGGATCC CCACAAATGT CAATTTGTCT TATCCGAAAG GCTTTGTCAA TCTGGGTAAT ACGTGCTTTC TGAATAGCAC TGTACAGTGC CTCGCCTACC TACCACCATT CTGCCAATCC TTGCTTTCAA TATTATCTCA TGAATCAAAA CATGGTGAAA AAAGAAAGAC CAGTCAAGGA AGGAAGGTAA CATTTATCTT GCGCTCCCTC TTTTCTCAAG TACACGGCAT TGATGGTGGT ACCACACACT CTGGAAGTTC ATTGGCTCCT CGTGCCATTG TCCAAGCAGT GCCAACTCTT GGTTCCTGCG GTAGTCGGAA AGGGTACAAG TTTCGACCCG GTCGACAAGA AGATGCACAT GAGTTTTTGG TGCATTTGCT AGATGCCATG AACGACGGTG AGCTGAAAGA AGCGGGAATC AACCAGAATG CAAGTGGATG GCGTGATCGA TTGCCAATTC CTCGCCTAGA TGAGACGACT TTCATACATA GAATATTTGG TGGATATTTC CGAAGTCAAG TTCGCTGCAC ATCCTGTAAC AATCGCAGCA ACACGTACGA TCCCTTACTA AACTTGTCTC TGGAAGTCAG TCGCAAGGCG TGCAACTCAG TTGCTCAGGC TTTGCACGAG TTTACTCGCA AGGAAACCTT AGACTCTCAG AATCAGTGGA AGTGTTCGGG CTGCAAAAAA TATGTTTGTG CTACAAAGCA GTTGACTGTA TTTCGACCAC CTTTGTCTCT TTGCATTCAA TTAAAGCGAT TCACATACAG CGGTAGACTA AAATTTAGTG TTGGCTTCGG GAGCTTCGGC AATGGGGGAG GAGGACAAAA GATATCGAAA TCTATCGAGT TTCCAGCCCA GCTGAAGCTT CCTTTGAGTG ATGGTCGCTC CTGTGGGTAT TCCTTGACCG GAATTGTGAT CCATGTAGGA GGTAGTGCTA GTTCTGGACA TTACACGGCG TATGTTCGCA AGCCAGGTGG TGGTAGCAAA TCACAATGGT TTCACATGGA CGACTCTTTC GTCGAAGCTG TATCGGAACA AACAGTCCTT CGACAAAGAG ATGCCTACTT GCTGTTCTAC TGTCGCGAAG AAGTTAAACT TGAGTTTCCA ACACCTCCAA TGTCGGCCAA GGAAGCGCAA GAACTTGGCA GAGCCAAAGC TCGCGCCCGC GCGGACAGTT TAACAGAATT ACAAGCAAGC GCATCGACAT CTTTAATCAC AATCACTAGC AAGTCGCTTT GCCATGAAAG TACAGCGGCT TTGGAACAGA AAAGGAAGGT TAATAGAGTG GAGGAGAACT CGAATGGTAC GGTGGCTTCT TTTCCACCAA ATCTGCGAAA GCAGGAACAA AATGAGAATG GAGACTTGTT GGAGAAACAA CGGAATGAAA CATCCTTTTC AGCCCCAGGC AAGAAAACTG ACAGCGCCGA ACTACTGCCA ACGCCAATCA CAGCTCCGGA TTTTGGCTTT GAGATTCAAC GCTCGTCAGT AAAGCAATTT AGCCGTAGGA GTCCGGATCC GTCGAAAGTG GTAGTTTCCT CGAAATCCCA GCGGGAAAGA TACGGAGGCA ACATTCAGCT CGGACAATCA CCAATGATGA AACCTGCCAT TGCACTTCCG GATCAAGGCG AAGAATCCTC GTCGGAAGAG TCCTCACCCA GCGACGATTC TTCCGTACAA GACCAGAACG ACAGCACCAG TTCCGCAAGG ATTTCTCTGC CGGTTATTAA GTCGGAGGCC AAGACGTCAT TCGCCGACGT TTCTTCCTCG TCCGAAAACG AAGAAAAAGC GTTTAGGACA ATAAATATGA CAGAATCCGA AAGTCCACAT GCTGTCCAGA AAAAGGCGTT GGAGAAACCG CGGACTCGCA TCGTGCTTGA TCGTGGTGAA GGCCGTGAAA AGGTGGAAGT CATGATGGGT CCGCGCTCCG AGACCAAGGC GTGGACGCCC AGGGCTGGGG CGGTTACAAA AAGTGAGGAC TATGCGCTTT TGGGGAACCA GCGAGTAGGA AGGTGGGATG ACGAAGGCAA CGATGTTGCA ACTCGCCAAC ATGACCGCGG CCGCGAGAAC CTTATCCAAC AAATGGACAA GAAAGAATCC AACCGAAAGC GCAAAATGTA CTTAGATCGC TGGGACGCCA TGCTGGATCA AGGCCAAACC AAAAAGGTGA AAGAAAAGAC TGATTCTATC AAGCCGACGA CGCCCAAGAA AAATGTGTTC CAGCGAATCC AAAGCAGCGT GCAGCGTATG AATCGAGGCC GTGCAAAAGG ACACTTTCGA CCCGAGACGC AAAAGAAGAA ACGCGGTCGA CGCAGTCTCT GACAGTGTAC AAA
|
Protein sequence | MSKKRNAKKR RREQRRQMEE HSRLTNDSFQ DQQRATSTNK HSKGDGAPPS FDKSRSVNSS SLPTAVDGFG SQKRSIAAIQ FVYPERNGRK LYRFQKELLD TFIVPPQEKI EKGLPEFVAT NVSLNDILYP DSNREKQSDS GSQLKGDSSQ KSPKILHGEH HTNNVVKELR QKNLDVQSLS GQSQGQLSGM SVEDAVRDKL GCRPRANSTD GELNLPQRGL CDERKVLESF KWIPTNVNLS YPKGFVNLGN TCFLNSTVQC LAYLPPFCQS LLSILSHESK HGEKRKTSQG RKVTFILRSL FSQVHGIDGG TTHSGSSLAP RAIVQAVPTL GSCGSRKGYK FRPGRQEDAH EFLVHLLDAM NDGELKEAGI NQNASGWRDR LPIPRLDETT FIHRIFGGYF RSQVRCTSCN NRSNTYDPLL NLSLEVSRKA CNSVAQALHE FTRKETLDSQ NQWKCSGCKK YVCATKQLTV FRPPLSLCIQ LKRFTYSGRL KFSVGFGSFG NGGGGQKISK SIEFPAQLKL PLSDGRSCGY SLTGIVIHVG GSASSGHYTA YVRKPGGGSK SQWFHMDDSF VEAVSEQTVL RQRDAYLLFY CREEVKLEFP TPPMSAKEAQ ELGRAKARAR ADSLTELQAS ASTSLITITS KSLCHESTAA LEQKRKVNRV EENSNGTVAS FPPNLRKQEQ NENGDLLEKQ RNETSFSAPG KKTDSAELLP TPITAPDFGF EIQRSSVKQF SRRSPDPSKV VVSSKSQRER YGGNIQLGQS PMMKPAIALP DQGEESSSEE SSPSDDSSVQ DQNDSTSSAR ISLPVIKSEA KTSFADVSSS SENEEKAFRT INMTESESPH AVQKKALEKP RTRIVLDRGE GREKVEVMMG PRSETKAWTP RAGAVTKSED YALLGNQRVG RWDDEGNDVA TRQHDRGREN LIQQMDKKES NRKRKMYLDR WDAMLDQGQT KKVKEKTDSI KPTTPKKNVF QRIQSSVQRM NRGRAKGHFR PETQKKKRGR RSL
|
| |