Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45684 |
Symbol | |
ID | 7200462 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 922652 |
End bp | 924638 |
Gene Length | 1987 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179746 |
Protein GI | 219117921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCTTCGAA GGGGGAGGAG GAGAACAACT CTACGTATAT CCTTGATTTG CTTCTATTGT TCTTTGTTTC CCGAATAGAA TAGTGTACAA CAATGAGAGA TGGGGCAACG TTAATAGCGT TTCTATTCGC GAGCGTCTTC TCGGCAAGTA CCATCTCGGC TTGGATTCCC CTGCGTATAT GCACAAGATC TCGTGTCCGC AGTCGCAACG TCCTTGCGGC GTCGTCGGAA TGGTCCGCCA CGGACGATTG GAACAACCTT TCGTCCGAAA ATCCGGACAA TGGACGACAG GATTACGCCG TCGATCAGGA TTTTGCACAG CGCGAGGCGA TTAGGATGCA GAATTGGGAT CTGGATAGCC TCGACCCAAC GGCACTATCC CCGGAAGACG CTTGGTTGCA GGACGCGATT GAAACCGTTT TATTGGACAG CACGATTACT CCAGAGGAGC GCTTGGATAC CCAAGACTTC TTGGAGGATA TGGGTAGAGA AATAGCTTTA CTCGTTCGTT GCAACCAAAG TCCGCAAGAA ATGCTTATCG CTGCAGGTAA GGCCTTACCC ATCTTGACGA CCGAAGACAA GCACAACCCA CGACAACTTG TGCGATTGGA ACCCTCCAAC CAGAACGAGG AGACGGAAGG GGAAGTCGTG GTTTGGGCCG CTACCGATTT CTTGAAAACC GCCACGCGTG TCATGTTCGA ACAACACGCT CACAACGCGA AGACCGGCGC GGGGGACACG AAGGCCATTT TGGATCCGCG TGGCGTCGCA TCGTGGATGA AAAAGAGCTT GCGGGAAGGC GCGATTGGAC CGCACGATCC TCGTGTAATG TTCATCATCT CCAAGTTTGG AACCTACGGG ACCGGAACCT TGCAGTACGA AGACTTTCTG AATCTCTACG TTTCCACCAT TTGTGGAAGC TCCCCTAGTC GTTGGAAACA GCTCGAGTAC CGTAGCGAGG AAATTGAAGC CGTCTGGCGA GATCTTCGCA ATCACGACAT TGTTTCTCCC GTGGAGCAGG AGCGGGTAGC CCTCTTGCAA AAAATGAAGG AGAAATACGA AGAGAGCTTT TCACACGTCA CGGACGAGAC ATTGCTGGAC GAGTGTGAGA TTATTGACGA TAAAGTCGCC TCATGGGAGG AGACTTCCCA GGGCCAATGG CGGCAGACTG GCAAAAGCAG CCACGAGCTA GTCGAATTGG CGTACGATGG CAAAACACCC CTGCGTCTCA AAGACGGTGA ATTTGTCTTT ATCGACGAGG ATTCGTGTAT TGGATGCAAG CAGTGCGCCT CGGCATCTCC AGCATCTTTT CATATGCTGG ACGACGGTCG GGCCCGTACG TTTGCACAGC GCAATAGCCT TGATGTCAAG GCTGCTGTTG CGGTGTGCCC TGTCAGCTGC ATGCACTATG TGGGTTTTGA CCGTCTCAAA GAGTTAGAGA CTTCGCGTGA TTCGCCAGAT GGCGATGGTC GGACGGATCA CCGCCATTTT GGACAAAACC ATCGAAACGG AGGCTATATT GCGAGAGCGC CGCTGCAGTA AGTTTTGTCA TCGAACCTTA AATCTATTTT GATTATTTTT AAAATCTTAC ACCATTTCTT TTTATGATTG GTAGTTTGAC GAGAAGAGAC AGCGACGCTA ACCACAAGAG CTCCTGGTAT CATTACTTGG TGAACAAGTG CTATTGTAAG TGTCGAGTCC GACGGAAAAT TGGACTTTCC CTTCAATACG TATTCTAACC CTTCTGTTGA TATTGTCTAC AACAGTATCA TCCGATTGTC CTCAGAGGGG CTGTTTTGAC TGTCCTCAAT TTCGCACCCA GCCAGGTAGC AACCCGAGTT GCCAATCGAA AATGAAAGAC GCACTGCACA TCAAGGCGGA ACATTTTATC CAAACCGGAG AAGCTAATCT TTACCGTAAA TCGGCCGACC TCTGAAAATG AGAAAGATAC TCAGCCGGAA ACATATAGAC GCTTTGTTTT GATTAGA
|
Protein sequence | MRDGATLIAF LFASVFSAST ISAWIPLRIC TRSRVRSRNV LAASSEWSAT DDWNNLSSEN PDNGRQDYAV DQDFAQREAI RMQNWDLDSL DPTALSPEDA WLQDAIETVL LDSTITPEER LDTQDFLEDM GREIALLVRC NQSPQEMLIA AGKALPILTT EDKHNPRQLV RLEPSNQNEE TEGEVVVWAA TDFLKTATRV MFEQHAHNAK TGAGDTKAIL DPRGVASWMK KSLREGAIGP HDPRVMFIIS KFGTYGTGTL QYEDFLNLYV STICGSSPSR WKQLEYRSEE IEAVWRDLRN HDIVSPVEQE RVALLQKMKE KYEESFSHVT DETLLDECEI IDDKVASWEE TSQGQWRQTG KSSHELVELA YDGKTPLRLK DGEFVFIDED SCIGCKQCAS ASPASFHMLD DGRARTFAQR NSLDVKAAVA VCPVSCMHYV GFDRLKELET SRDSPDGDGR TDHRHFGQNH RNGGYIARAP LHLTRRDSDA NHKSSWYHYL VNKCYLSSDC PQRGCFDCPQ FRTQPGSNPS CQSKMKDALH IKAEHFIQTG EANLYRKSAD L
|
| |