Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37218 |
Symbol | |
ID | 7202017 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 414596 |
End bp | 417528 |
Gene Length | 2933 bp |
Protein Length | 671 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181375 |
Protein GI | 219122066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000635734 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGT CCGCGACACT TAAAACTAGT AATAACGATC TAATCAATCA TATTACTAGT CTTAATTTGT CTGTAGTCCC CTCCCCGCCT AGTACCATGA CCTCGACCAT TGCCGACACC TTTTGCACAG GACACTACAT CACCGTGTCC TGCCCCCACT TCAACCGACA ACCTGCCGCC TCTCCACTCT CCGTCCGTGT TCCCAACGGC GCTACCCTCC GTTCCAGCCA CACGGCAACT CTCGACCTCC CTGGTTTTTC CCCTGCTGCT TGCCAAGCTT ACATCTTTCC TGGCCTCGCT TCCCACCCCC TCATTTCCAT TGGCCAACTC TGTGACGACG GCTGCACCGC CACATTCTCC GCCACCCGAC TCGACATCTA CCGAGACACC ACCCTGCTTC TTACCGGCGC TCGAGCACCC GCCACCGGCC TCTGGCACCT TGACCTCACC CCAGCCAAGA CTGCCCATGC CCTCATTCCC GACAGCTCCC TGGCAGACCG CATCGCTTTC GTACATGCCT CCCATTTCTC CCCTGCTCTC TCCACTTGGT GTACCGCTAT TGATGCCGGG CGCCTCCCCA CCTTCCCCGA CATCACCTCC AAACAAGGGC GCAAGTACCC TCCCCTCTCT ATGGCCACCA TCAAGGGCCA CTTAGACCAA CAACGCGCCA ATCTTCGGTC CACCAAACCT TCCTCCGTTC CTCCAGTGGC CTTGCCCAAC CCTCTCCATG AATCCCAGCT AGACTTCTGC CCGGCCCCGG CCACTCCTCC CGCTGGCCGA ACCCACCATG TCTTCGCCGC GCACCAACGA GTCACCGGCC AGATCTACAC AGATCAACCG GGCCGTTTCC TCACTCCCTC GAGTACAGGC CACACGGACA TGCTTGTGCT GTATGACTAC GATAGCAATG CCATCCATGT TGAACTCATG AAGAGCAAGT CCGGCGCCGA GATCCTAGCA GCCTACCAAC GTGCCCACTC ACTCTTCACT CAACGGGGCC TTCAGCCGCA ACTCCAGCGT CTAGACAACG AGGCGTCTCC CGCTCTCGAG TCCTTTATGA CGGCCAACCA GGTCGACTTC CAGTTGGCAC CACCCAATCT GCACCGTCGC AACGCCGCCG AACGCGCCAT ACATACCTTC AAGAATCACT TTATTGCCGG TCTCTGCAGT ACGACCCCGG ATTTTCCGCT TCACCTTTGG GACCGCCTCA TTCCCCACGC TCTGCTTAGT CTCAATCTCC TCCGTGGCTC TCGCATCAAC CCCACCCTCT CGGCCCACGC ACAGCTCCAT GGCGCGTTTG ACTACAACCG CACCCCGCTC GCCCCTCCCG GCACTCGCAT CCTCGTCCAC GAAAAGCCCG CCGTTCGGGA AACTTGGGCA CCCCATGCTG TTGAAGGCTG GTACCTCGGC CCCGCTCTGC ACCACTACCG CTGCCATCGC GTTTGGATCA CAGAGACGCG TGCCGAACGT GTTGCCAACA CTCTTGCGTG GTTTCCCAGT CGCATTCCTA TGCCCACTGC CTCCTCCACC GATCGCGCCC TGGCCACCGC CCGTGATTTA GTGCGCGCCC TCCAAAATCC CTCTCCCGCT TCGCCGTTTG CACCATTGGA CGCCACCCAA CACCAGGCCC TTCTACATCT TGCCGATCTC TTTGCTTCGG TCGCTGCTCC GGCCTCTCCG ACCGCTGCAC CGACTCCCGC GCCCCCGGTC CCAGCACCTC CCCCTGCTCA AGTCCGCTTT GCTGTTCACA TTGTCACGGC CGAGCATGCT CCTGCACTTC CGAGGGTGCC CATCCTTGCG CCGCCAGCTC CGAGGGTGCT CTCTCGGACC CGCAATCCCG GCCGCCGCCG TCGCAAAGCA CGCAAGCAAC CGCCAACCCC AACCTTAGTT CCGGCTCATC CACACAACAC CCGCACCCGA CCCTTTCTTG TCCCAGCCTC CGCCAACGCA GTTGTCGACC CCGCAACCGG CGCCTCTTTA GAGTACCGCC ACCTACGCAC CGGTCCCAAT GCTCCCGATT GGATTCAAGC CGCGGCTAAC GAAATTGGCC GTCTCACCAG CGGTAACCCG CCTCACAGCA CTCACGGTAG CCAAACTATG CACTTCATCG CGCATACCGC CATTCTTCCC GGACACAGGG CCACCTACTT ACGCATTGTT GCCAGCATTC GCCCGCAGAA AGCAGAACCC AAATGCATAC GTTTTACCGT CGGTGGCAAC TTAGTTCAGT ACCCCGGCAA GGTTAGCACC CCAACTGCGG ACATCACCAC AGCCAAGCTC CTCTTCAACA GTGTCCTCTC AACTCCTGCG GCAAAGTTCA TGTGCATTGA TATCAAAGAC TTCAATCTTG GCACCCCCAT GGCATGCTAC GAATACATGC ATATCCCGGT CCCAGATATT CCTCCCATCA TTTTGGCTCA GTACCAGTTG GCCCCTCTTA TCCACAACAA TTCAGTTACC GTCGAAATTC GCAAAGGTAT GTACAGCCTT CCCCAAGCCG GCATTCTCGC CCATGACCGC CTTGTTGAAC ACCTCGCTCG CCACGGCTAC GTCAAGACCG CGCATACTGC GGGCCTTTTT CGACACGTCA CACGCCCGAT TCAATTTACC CTAGTTGTCG ACGACTTTGG CGTAAAATAC ACCGGCACCA ACAACGCTCA GCACCTCATT GACACATTGC AAGCGCTCTA CACTATCACA ATTGATTGGG ATGGTACGCG TTACCTAGGT CTTACACTCG CTTGGAATTA TGAACATCGA ACCCTTGACA TGTCCATGCC CGACTACATT GATCAAGCCC TAACCCGCTT CCAACGTTCG CCTCCTACCA AGCCGCAACA TGCGCCTCAT CGCAAAATGC GCTCGCACTA CCTTTTCGAA CAGTCATCAA CCAATGACAT AGCAACTTCT CATTTGCAGC AAGGGTGTGT TGA
|
Protein sequence | MSPSATLKTS NNDLINHITS LNLSVVPSPP STMTSTIADT FCTGHYITVS CPHFNRQPAA SPLSVRVPNG ATLRSSHTAT LDLPGFSPAA CQAYIFPGLA SHPLISIGQL CDDGCTATFS ATRLDIYRDT TLLLTGARAP ATGLWHLDLT PAKTAHALIP DSSLADRIAF VHASHFSPAL STWCTAIDAG RLPTFPDITS KQGRKYPPLS MATIKGHLDQ QRANLRSTKP SSVPPVALPN PLHESQLDFC PAPATPPAGR THHVFAAHQR VTGQIYTDQP GRFLTPSSTG HTDMLVLYDY DSNAIHVELM KSKSGAEILA AYQRAHSLFT QRGLQPQLQR LDNEASPALE SFMTANQVDF QLAPPNLHRR NAAERAIHTF KNHFIAGLCS TTPDFPLHLW DRLIPHALLS LNLLRGSRIN PTLSAHAQLH GAFDYNRTPL APPGTRILVH EKPAVRETWA PHAVEGWYLG PALHHYRCHR VWITETRAER VANTLAWFPS RIPMPTASST DRALATARDL VRALQNPSPA SPFAPLDATQ HQALLHLADL FASVAAPASP TAAPTPAPPV PAPPPAQVRF AVHIVTAEHA PALPRVPILA PPAPRVLSRT RNPGRRRRKA RKQPPTPTLV PAHPHNTRTR PFLVPASANA VVDPATGASL EYRHLRTARV C
|
| |