Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45431 |
Symbol | |
ID | 7200676 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 131608 |
End bp | 135232 |
Gene Length | 3625 bp |
Protein Length | 1158 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179586 |
Protein GI | 219117587 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.693925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTC CAACCGCATC CCCCAGCTAT GATTCGGACG ATCCGCCATC TCATACACAG TTCTTGGATT TGACCAACGG CAAAGGGGGT GATCTGCTCC CGCATCCATA TCTTCCCCCT CAGCCGGAGT CCGATTACGC CCACGCAACA GAGACTCCTG TGGACGCGCA GGATTCCGAC GATCAAGACG ACAGCAACAT GGGCGACGAC GAAGACGACG GGGGTATGGA GGGATCCCCC ACAATCCGCG GTACGGTACC ACCCGGTAGC GGTGCAATCG GTATTAATCC AAACACTCCC GGTAAGGTCG TTGACGTTGG GCAGGAACAC ACGGGCCGCT GGACCAAGGC CGAACACGAA GCTTTCTTGT CGGCGCTGCA AACGTACGGT AAGGAATGGA AAAAGGTGGC TGCCAAGGTC AAAACTCGGA CGGTTGTCCA AACTCGTACG CATGCACAAA AGTACTTCCA GAAACTCCAA AAGACAATCG AAAGCACAGG GAAAGACGAC GTTACTCAGG TTCACATGGG CATTGATAGC GGTGTCCTTG ACAAACAAGG TAGCGGGAGT GCTGCCGGAA GTTCACACCA GAAGAAGCAA CGATGTCCCG CTCCGGTCTC TCTTCAAAAG CCTGAACGTC GGTCCAGTAG TGCCACTATT TCCGCCGCAC AGGTCATATC CAACCTTTCG TCCCATACGA GCACACAACC TTCTTCTATG GGCCCATCTG TGGCTGCAAG AGGTAGCGCT CCGCTAAAGT CCAAGTCAGC CGATCCGCAG TATAGTGCAA TGCGTCCACA TGGCTTTTCA ACGGAGGTTT CTTCATCGTC TTTCCCATCC TCTTTTTCTT CCTGGATGGG AAACAATTCC ATGAAAATTA CAGCTCCAAA TCCGGAAGAC ACCAAAAACA GCTTTCCGGA ACCGTCTCCT GCAGCTACTG GCAAACGAAA ACTAGCGGAA ATTGCCGCTG CTAGAATGTT GGCCGGTGTA GGGCAACAGC AACAAAGGCA GCTTCAACCT CTGGTGGATC GCAACGATGA AGCTCCGACA CCGCCTCTAC CCGACACAGA AAATAGTAAA GGTTCAAGTC TAAATCTCCA TGAAGCCCCA CCGCCACCAT TACTATTTGG GGATGGGTTC AATATGTCAA GTTTGACGTC GAAAAAGGGA GTGGCTCTAC AAATTGTGAA TCCAGAAAGC TTGGGCGTTT CGCACGATAA ACCTCGTCGT GGAGGTGGAG ATTCGCCTGT CACACCCTGG GATGGGCAGC TCGAAGCCTT AGTCTACGAA AAGGCCAAAG TTGAATCTAA GGAGGAAGAG ACAGGAGGAT CAAAACCTGC TGCATTGCAT CCGGTATGTG GCCCGAGTAC AGCATATGGT CGAACGCCAC TACATCAAGC TGTCTGCGAA ATGGATTTGG ATGGCGTAAG GTGCCAATTG CAGGATATGC CCAGCCAAAA TGTCAGTGTG CTGCATGGCC TCGATGAAGC AGGCTATTCT CCGTTGCACA GTGCCTGTGC TTTACGATTG AGCTACGGCC AAAGCGCTAT CGTGGCTCCC CAACTTGTCA GACTCCTCTT GTCTGCCGGT ATCTGCGATC CCTCTCGACC CGACATAAAG GGAAATACAC CACTACACTG GGCCGCACGT TCCGGGGATC GAGATGTTGT GGAAATTTTG CTTCTGAAAA ATTCTCTACT GGATGCCAGA AACCAGGCGG GCGAGGCTCC CCTTCACTGG GCGATGCGGG CAGGTGAACG AGGGACTACG GTCGCTTTAT TACTTTTGGA AAACGGTGCT CGACCTAGCT CACTGAGCAA AGAGTACCGT CGACCCTTGG ATGTAGCAGC GGATGGATTC TTAGACGAAG AAAGGTCGTT GGCTGTTCTG CGGGTCGCGG AACAATCATA TCGAGGGATA AAGCCAAGCA AAGCATTGAA AAAACGGCTA AAAGAAACCG CAAGCGAACG GAGAGATGCG CGAGCTGCTT TGCTAATTCG GTCCGCACAG TCTAGAACGC TCGTATTGCA TCACCCCGAA TGTTTGGAAC ATCACCCGAA ATCAGCTACG GATTGGGAAG CGCCAGATCG AATAAGGAGT ATTATGCGCC GAGTACTGCC TGCAAGTGAC CCTACCGGTG CGACCGAGAC ATCGGGCATT TTCCCTCACG AGGTAACGGT GTCCAAAGAA TTTGAAAGGG CAAAGCTTGA TCTCCTCAGT CGAGTGCATA GTACAGATTA TCTATCATTT GTCAATGCAT TGAGCAAAGA CCTCGAAAAG CAATTGCGAG AATCAGGGGG GAGCTTCAGC GCAATGGACG AGTCTGACAA TGGTTTTGGA TCACCACCGC CGGTAGTTCC GTTCACACCG CTCGTTCAGA GATCGCTCAT TAAAGTAGAT GAGTCTAGAA TCAAGCTGGG TGTAAACTCC GATACATCGT TCAGTGCAGG GTCACTCCGT GCTGCACGGC GCGCAGCTGG GGCAGTGCAG CATGCAGTGG ATTGGTAAGT CATGACTCCT GAATTTTCAA AGCGTGCTTG CAAATCACTT TCTAATTTTA AATCTTCCTT GTCTAGCGTT TTGGTTGGGA GAAATCGCAA TGCTTTTTGT GTAGTTCGGC CACCCGGTCA TCATGCCGGC ATAAATGGTT TGCTGGATGG GGGTGAATCT TGTGGATTCT GTATTTTCAA CAACGTTGCC GCAGGCGCTC TTTATGCGAT TTCAGAAGAT AGGCTCCTGT GTGGCCGGTG CGCAATTGTT GACATTGATG TCCACCATGG AAATGGAACT GAAGACATCG TTCGAAAATG CCACGACCCT AGCAAACTTT TGTTCTTCTC AATACATCTC TACGACAACG ATAGGAAAAA GAGGGGTTCA AATCAGTTTT CCTATAAGTT CTACCCTGGA ACCGGTTCTG AGGATGACCT TGCATTGAAT ATCATCAACG TGCCCATTGT ACCTTTGTGG AAAGAACACT CCGCTACTGT GCAACCTTCG ATAAAGACCC ACAACACAAG ACGGAAAACT CGAACATCTC AGGAAGGTCC AGACGAAGAA AGTGATACCA CGCCAAAAGA TAGTTCACGT ACAAGCGATG TTGGCAGCGA AGAAGGCTCT ACCGCTGCGT CTAATTCATC TCCCAGACCT GGAGGACTGT CATCCGGAAG AACTGCGTAT CGAAATGCAA TCCAAAATCG CTTACTACCT GCGCTTCGGG CTTTCAACCC TGATCTCATC CTCATAAGCG CCGGTTTTGA TGCAGCAAAA GGAGATGTGG GAAATGCTCG ACACGAGCGA GGCGGAGAGA AAGTTGGGCT CGACTTAGAA CCCGAAGACT ATGCATGGAC AACAAGAAAG ATTCTGGAGA TTGCCGATAT TTGTTGCCAG GGCCGCGTTG TTTCGGTACT TGAAGGGGGA TATGGAAGAA CGCCAGCTGC CTTGCCCACA GGCTCGTCCG CCCTGGATCG CACCTTGTTT GCCGAGTGCG CCATCCGGCA TTTACACGCC ATGGTTGATC CGTACGACAC CGAGCAGCGA TTTAGCTGAA TTTTGCAGTT AGCTTGAGAT GGTCCGAAAT TTGATTTGAA AAATTATGAA TGTATAGCAA TAGTGGAATT AGAAA
|
Protein sequence | MSFPTASPSY DSDDPPSHTQ FLDLTNGKGG DLLPHPYLPP QPESDYAHAT ETPVDAQDSD DQDDSNMGDD EDDGGMEGSP TIRGTVPPGS GAIGINPNTP GKVVDVGQEH TGRWTKAEHE AFLSALQTYG KEWKKVAAKV KTRTVVQTRT HAQKYFQKLQ KTIESTGKDD VTQVHMGIDS GVLDKQGSGS AAGSSHQKKQ RCPAPVSLQK PERRSSSATI SAAQVISNLS SHTSTQPSSM GPSVAARGSA PLKSKSADPQ YSAMRPHGFS TEVSSSSFPS SFSSWMGNNS MKITAPNPED TKNSFPEPSP AATGKRKLAE IAAARMLAGV GQQQQRQLQP LVDRNDEAPT PPLPDTENSK GSSLNLHEAP PPPLLFGDGF NMSSLTSKKG VALQIVNPES LGVSHDKPRR GGGDSPVTPW DGQLEALVYE KAKVESKEEE TGGSKPAALH PVCGPSTAYG RTPLHQAVCE MDLDGVRCQL QDMPSQNVSV LHGLDEAGYS PLHSACALRL SYGQSAIVAP QLVRLLLSAG ICDPSRPDIK GNTPLHWAAR SGDRDVVEIL LLKNSLLDAR NQAGEAPLHW AMRAGERGTT VALLLLENGA RPSSLSKEYR RPLDVAADGF LDEERSLAVL RVAEQSYRGI KPSKALKKRL KETASERRDA RAALLIRSAQ SRTLVLHHPE CLEHHPKSAT DWEAPDRIRS IMRRVLPASD PTGATETSGI FPHEVTVSKE FERAKLDLLS RVHSTDYLSF VNALSKDLEK QLRESGGSFS AMDESDNGFG SPPPVVPFTP LVQRSLIKVD ESRIKLGVNS DTSFSAGSLR AARRAAGAVQ HAVDCVLVGR NRNAFCVVRP PGHHAGINGL LDGGESCGFC IFNNVAAGAL YAISEDRLLC GRCAIVDIDV HHGNGTEDIV RKCHDPSKLL FFSIHLYDND RKKRGSNQFS YKFYPGTGSE DDLALNIINV PIVPLWKEHS ATVQPSIKTH NTRRKTRTSQ EGPDEESDTT PKDSSRTSDV GSEEGSTAAS NSSPRPGGLS SGRTAYRNAI QNRLLPALRA FNPDLILISA GFDAAKGDVG NARHERGGEK VGLDLEPEDY AWTTRKILEI ADICCQGRVV SVLEGGYGRT PAALPTGSSA LDRTLFAECA IRHLHAMVDP YDTEQRFS
|
| |