Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44310 |
Symbol | |
ID | 7197971 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 197430 |
End bp | 200838 |
Gene Length | 3409 bp |
Protein Length | 974 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178179 |
Protein GI | 219114767 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTTACATT ACTTCCAAAA ATATACCTGA CAGTGACTGA AAGTTTTATC CAAACGAAGG GTACAAGTTT ATCTTACGTT GTATCTAACC TTCAAAATCC TCATAAGTTT GAGGTGCAAG CTCTCCTCGA CTTACGTTGT ATCTAATCTT CAAAATCCTC ATAAGTTTGA GGTGCAAACT CTCCTCGACT TACATCACAC TACACGATAA GCACAAACAT GAAGGACGGC AGCAACAGTC GAAATGATTC AAGCACGGAG GGTCGTTCCG TTTTTCGGAA TCTAGTTCCA GATCGCCATG ATGAACATGA TCCGTCTCCT TATGTACACG AACAACGCAC CACCGCTCCA CCATTGGTGT ACGATGCGCC CTCCTTGTTG AATGAATCCC AAGCCGAAAC GTCCAACTTT GCAGCGATTC ACTCGAACTA CAACGACAAC AGCACAAGCA GCACCCGCAC ACCCGCTGTC GGATCTCTTC GGGATGCCGC GCGGCGTGTT TCCATGCTCA ATCGGACTGG ATTGTTATAC AAACATGTAC TTTCCTCTTC TCACCAGGCC CATCAACGCG TCTCGTCCCG TGCACAAGAT CTGTTGTCAG CTATTCAAGA AGGTGGACAT CTCAGCAATG ACGAGCATGG CGATGACAAT GTTCTTGCCG ACGATATGTC AAGGTCCGAT TTGACGGACG AAGAAGAGCA ATTGCTCGGC ACCGCCAGCG ATGCGGCCAA CAATACGAAC GACACACTTC CGACTCGTCG TTCGGAGTAC GGCTCCGTTC TCATGCAGCG ACAAGAATCA ACATCCCAAA AGTCTGTAGC ATTCCGGCGC ATGCGGTCCT GGCGTCGCAA GGTGGCAGTA CTTTTTCACC CGACCCGCAT GCTGCGAATT GCTTGGGAAA GCTTTCTTTT CCGTCTAGGC TTGCCCTTTT GCCTAGCAGC GTGGTTGCTG TATTATCCAC TCGGAAACCC CGAAGTAGAC GCCTTACCTG GCAAGACCCG GCTCTCTTGG TGGTTCAATT TTGCCGGTCG ACAAGTGATT ACTCTAGAAT TGGCCCGACT CACGCAATGG CTTCTATTGG ACCTATGGCT GTGTAGAAGT GGTAGTAGTG CTAGGGAACG ACCGTGGTGG AAAGTTGATC CATTGGTGAC ACTCTGGAGC ATTCACAGCC GCGGATGGCC GTTCCTCATT TCGACTTGGG CGCTTTGGGA TTTACTCATT TTGCATGGCG ATAATCCCTT CAATACGCAC TGGTTGTATT GGACAGGCTG GGCTTTGTAC AGCAACGAAA ACGCCAATTC TGGGGCCTTC GTAATCAGCA GTGACCTGTA TTTGCGGATC TTACTGGCCA TGTTGGTTGC AGGCATTGCC ACCGCAGCCA AATGCGTTTA TGTTGTCTTG CAATTCAGTC GTCGTCAGGT ACAAACATTT CAAATTCGGC TTAAGGCAAT TTTGGGCGAG CTCGTCACGG TCTCGCAGAT TGCTGCGTTG GCAACGCAGG CTGATCTTGT GGCGGCACAA TTGGAGATGC AAATGGATGA AGACTACAAC GCTCCGGTAG GCACATCCAT TCGTTCCATC GGCTCGTCTC TCAACAAGTC GACCAGACCT CAATCTTCCC TGCCACGCGG GGCTGCGCCT GTTACAGGGA ACACCAATGT TCGTTGGAGT AACCTACATT TTGAAGAACA CGATTCCAAA GCTCCAGAAC ACGATGATGA TGATGACGAT TCATCCAGAA GCGAGAATAC ACCGGGATCC TCGTCTGGTC TCAAACGGTC GTTGTCCCAA GACTCGTCCG GTAGTTTGGC AGTTCTAAGC CTACTTGATC GCTGGGAGCC ACCAGTCAAC AAGGCGAACA AGAGCGATGT TGCAATTTCG GACGTGCTTA AATTTCAGCA AGCCTTGCGT TACATGGACG ATGACGCTGT CTTTGGTGAA GACTTCGGAC CCGCTCGAGA TCGTAACGAG TGCGTGGGAT CAGCTGTCGC AATGTACCAC AAACTTGTCA AATGGACGCC TGATTCCTAC GTACTTAAGT TTGACACGTT GGAGATTTTG GCCATGGACG AAGACGGTGT TGTGGATCCA CTAAAGCGAA AAATGTTACG CAAGCTGTTC CGACCAGATA GATCCGGTCG AATCCCATTG GTTGCATTCA TCCAATCCAT TGATGCTGTC TATAAGAGAT TGCGTTACTT TCGTGCGTCC GTGACGAACG CGACAGTGAT TGATGACGTC TTGGAACATA TTGTGGATGG GTTGTTCTAC TTTGTATTGA GTTTGGTTGT ATTGAGTTTG TTAAATTTCA ATCCCTGGAC CTTTTTGGTT CCCATCACGT CCCTCATGGT GTCGCTTTCG TTTGCCTTTG GTGGGAGTCT CAGCAAATAC GTCGAGGTAT GTTATTCGCG GGTCGTCTTG GGAAATTTGC GAATTCTGTT GAGCGCTCAC AAAATTTGCC GGTTTGCTCA GGGTGTGCTC CTCATTGCGG TCCGACGCCC TTACGACTTG GGCGATCGCA TTTTCATTGG CAGCGCGGAA GCTCAGGCCG AAAGCGATAT GTCGATCCAA ACTTGGTTTG TTGAAGGTAA GTGCTTTTGT TGTTGAACAC ACCAATTTTG CGCCGTATGC GTAGGGCTCT TCCATTAACC TCTTATGTCT ACTATTTCTT TTAGATATCA ATTTGACCAC GACGACTTTG CGATTCGCTC GTACCAACGA GGTCTCCACT GTTAACAACT GGGCCATTTC CGGCTCTCGT ATTATCAACT GCAATCGCTC ACCCAATGCT CTCATCTTCT ACGAATGGAA GCTTCATATT AGCATATTCG ACGGCAAGAA CTTGGATAAT TTCAAGGAAG CTTTGAACAA GTACGTCCGG GACCATCCCC GAACTTGGAA CAGTCTGGCG TTCATCCGAC ACGACGTTAT TGACGCGGAT ATGGAACAGG TGGGCTTCCG CATGGCCTTT CGCCACCGGA ATGGATGGCA GGACGCAGCG CGGATCAAAC TCAACCGGGC AGACCTATTG CGCTACATTC ACGACACGGC CAAAGCCATG GGGGTCAACT TTGAGACCTC CCCGGCCCGA CGTCTCTTGT ACTACGGTGG CGTCTTGGAA AGCGGCCAAG TCAAGGATTA CAAGAAGAAT CTGTTGCGTC CATCAAACAT TCGTAGTCAC AGTCACACCT TTGACGATCA TCGTTTTGAG TCGTCCTACC CCGGTACGGC GGGGATTCCG CATTCTCCTC CTCCGCAAGC AACTCGAGTC GCTCCACCCC CGGGAGACGT TTTAATGGGT GAATAGCCTA GCTGCAGGAG TTGTCAATGA ATTATGCAGT GAACGATTCC GAACCGATCA GGATGGTCCA TTCTTTATAT TTAAGTCTAC TTACTTTCGT GGTTTTACC
|
Protein sequence | MKDGSNSRND SSTEGRSVFR NLVPDRHDEH DPSPYVHEQR TTAPPLVYDA PSLLNESQAE TSNFAAIHSN YNDNSTSSTR TPAVGSLRDA ARRVSMLNRT GLLYKHVLSS SHQAHQRVSS RAQDLLSAIQ EGGHLSNDEH GDDNVLADDM SRSDLTDEEE QLLGTASDAA NNTNDTLPTR RSEYGSVLMQ RQESTSQKSV AFRRMRSWRR KVAVLFHPTR MLRIAWESFL FRLGLPFCLA AWLLYYPLGN PEVDALPGKT RLSWWFNFAG RQVITLELAR LTQWLLLDLW LCRSGSSARE RPWWKVDPLV TLWSIHSRGW PFLISTWALW DLLILHGDNP FNTHWLYWTG WALYSNENAN SGAFVISSDL YLRILLAMLV AGIATAAKCV YVVLQFSRRQ VQTFQIRLKA ILGELVTVSQ IAALATQADL VAAQLEMQMD EDYNAPVGTS IRSIGSSLNK STRPQSSLPR GAAPVTGNTN VRWSNLHFEE HDSKAPEHDD DDDDSSRSEN TPGSSSGLKR SLSQDSSGSL AVLSLLDRWE PPVNKANKSD VAISDVLKFQ QALRYMDDDA VFGEDFGPAR DRNECVGSAV AMYHKLVKWT PDSYVLKFDT LEILAMDEDG VVDPLKRKML RKLFRPDRSG RIPLVAFIQS IDAVYKRLRY FRASVTNATV IDDVLEHIVD GLFYFVLSLV VLSLLNFNPW TFLVPITSLM VSLSFAFGGS LSKYVEGVLL IAVRRPYDLG DRIFIGSAEA QAESDMSIQT WFVEDINLTT TTLRFARTNE VSTVNNWAIS GSRIINCNRS PNALIFYEWK LHISIFDGKN LDNFKEALNK YVRDHPRTWN SLAFIRHDVI DADMEQVGFR MAFRHRNGWQ DAARIKLNRA DLLRYIHDTA KAMGVNFETS PARRLLYYGG VLESGQVKDY KKNLLRPSNI RSHSHTFDDH RFESSYPGTA GIPHSPPPQA TRVAPPPGDV LMGE
|
| |