Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36079 |
Symbol | |
ID | 7201258 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 498878 |
End bp | 500707 |
Gene Length | 1830 bp |
Protein Length | 565 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180650 |
Protein GI | 219119795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGA CAAGGTCGAA GAGAGCCAGT GCACGCATTC GAAAGAGACC CATCAGCTTT GGCAACCCGG CTCTTGGAAC CTATCATGGC GATGTATACA CGGTCGAAGG TGACCACGAT AACATTGGCA CCCAGCTTCT AGCGGAAGGC GTAGTGACCG CCAACAATTT AGTTGCCATG GTCGATGTGC CACCAGAGCA GGTACCCGAA GGCGTATTGA ATTTGGCGCG TTCCCATCGA CCGTTGATTG ACCATATGCG CGTAGTTATT GCGGAAGGAA ATGGCCAGGT CGAGGCCGTA CATCGGTCTA CAGAGACCGA TGTGGTTAGC GAAGAGGCAA GAGGGAAACC GCTCGCTTCC TACACCTTCG CTGGCGTCAA TGCCGCTTCC ATTTTAGCTT CTGACGTTGA AGTGTCGGAA GCTGCGATTG ATCCAAAAGC GCACATGGAG CAATTACTGG AAGACGATGG AGCCGAGCAC CATCCATGTC GAACATACCT TGTCTTGCTA GAAGCATGCT CGCCCGAAGC CGCAAAGGAC CTTGTGCAAG ACTTGCGTGG AATGCCTTAT ACATTTCTAG ACGAGACACA AACTTGCAGC GTTTTCCACG TTGTGGCACT AGAAGGGGCC GACGGAGTTT CCCTCATGTC CCCTTTCTTC GCGCCGTCCA CAAAAGCGAC AGACAATGGG CACTTGGAAA TTTCATCGTC TCATTCAAGT GACTCCCGAA TGGAGGCAAG AAATCGCTGC GGGAGCGTTG ATTGTAAATC TGGAGAATCT GAGCACCCTT CAGGGCAGCA CCAACGCTCG GAAGATTACA ATTGCGCAGT CTGCCTAGAG CATATGGACA TGACCTATCC CAGATCTGGC GAACGGACCT CAATTTTGAC AACAGTCTGT AATCATTCGT TTCATATGGA TTGCTTGCTG CAATGGCAGG ACTCTCCCTG TCCGGTTTGT CGCTTTGATC ATTCTGGTTT GAACGAGGCG TTGTCACAGT GCCACCTATG CGGAAGTACC GCCCACAACT ACGTTTGTTT GATATGTGGT ATTGTGTCGT GCAGCGGAGG GCCCCGCTCC TCTAGTGCTG CTGCTGGTAG GTTAGGCCCA CATGACAGCT TTTCACAGTG TCGATCCGAA GACACGCCCA TATTGCCGTG TTACCAGAGA CAAGCGTTGT CGCATGCACG GCAGCATTAC GACGAAACAC TACATGCATA TGCATTAGAT ACGGAGACGC AGCATGTATG GGACTTTGCC GGTCAAGGGT ACGTGCATCG CCTCTTACAA AACAAAGAGG ACGGGAAACT AGTAGAAGTA CACGATCCCT ATAACACCAC TTCCCAAGAA CGTTCGCTAA GTCCTGGTTT GAGCGAATCG CAAGAAGGAG AAGTTGTGCA TCGCAAGCTA GAGGGGTTTG CTAGTCAATA CTATACATTG CTGAAATCGC AATTAGAGCA GCAACGTATT TTTTATGAAG GTCGATTGGA AGAGATTCGA CGCGATTACG ACGTGGCGAA GCCTCTTAAA AAGTCGACCG ACCTGATTGC TGCTCTAAAA CAAGAGCGCA ATCAACTTTC GCAGCGACTA GTTACGCTAG AGACGCGTCG ACGAAAAGTG CTGGAAGACG TTTCCTTTCT CGTCAGTATG AATGAGAGCC TGGTAGCCAA CAAGGAACCA CTCAGGCGAC AGATCGAAGA AGCTCAACAA CAAAGCTTAA ATGCTCGTCG TACCTTCGAA GAACTTTTAC AGCCATTGCA GGATAAGGTC ACGGCTCTGA TGTTACAGCT GGAGGATGAG GAAAGCGATA AAAAGCCAGC AGCTCTATGA
|
Protein sequence | MSSTRSKRAS ARIRKRPISF GNPALGTYHG DVYTVEGDHD NIGTQLLAEG VVTANNLVAM VDVPPEQVPE GVLNLARSHR PLIDHMRVVI AEGNGQVEAV HRSTETDVVS EEARGKPLAS YTFAGVNAAS ILASDVEVSE AAIDPKAHME QLLEDDGAEH HPCRTYLVLL EACSPEAAKD LVQDLRGMPY TFLDETQTCS VFHVVALEGA DGVSLMSPFF APSTKATDNG HLEISSSHSS DSRMEARNRC GSVDCKSGES EHPSGQHQRS EDYNCAVCLE HMDMTYPRSG ERTSILTTVC NHSFHMDCLL QWQDSPCPVC RFDHSGLNEA LSQCHLCGST AHNYVCLICG IVSCSGGPRS SSAAADTETQ HVWDFAGQGY VHRLLQNKED GKLVEVHDPY NTTSQERSLS PGLSESQEGE VVHRKLEGFA SQYYTLLKSQ LEQQRIFYEG RLEEIRRDYD VAKPLKKSTD LIAALKQERN QLSQRLVTLE TRRRKVLEDV SFLVSMNESL VANKEPLRRQ IEEAQQQSLN ARRTFEELLQ PLQDKVTALM LQLEDEESDK KPAAL
|
| |