Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50019 |
Symbol | |
ID | 7198717 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 154552 |
End bp | 158225 |
Gene Length | 3674 bp |
Protein Length | 722 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184903 |
Protein GI | 219129453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.352902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATATTTTGC ATTCTCGGTT ACATAATTTC TTTGCCGTAG TTGACTGTGA ATACCACTGC TCCTACTGCT TCTTCTTCTG TAGCGGACCC ACTTTCAAAG TGTCCAACAG TCCTCACATT TCACGTCGCT GTCGATAACG CTCTAGCCTC TCTGTCCGAT CGGTACATTT CCTTCCTCTT GCATCAGTAC GATGAAGTAC GGTGAACACC TAAAAGCGAA CATTGCTCCG GAATACGGTG AGGAAAACTA CCTGCATTAT GAACGACTCG ATCAGATCAT TACGCAGCTG ACGGAGACTA AGCCTTCGCG GTAAGTTACC ATTCAAAGGC TGGAACGGTA GGAGGACACC GTGCGAGCTG CCAGCGAACG ACTTTCGGAA AGTAGGGTCC GGAAGGAGGC GTGCTGATAT CAACCCGATA GCAAGTCTCA CCCACACACT TGTCTCAAAT ACATCTCTTC ATCGCTGCCT TTGACACATT TACCAGTGCC GTGGAAACCT CCCGCGTGGT ATCAATGACT GCTCCACCGC AAACGAACGC ACAAGGGTTG GATACATCCA AGATAAACAT TACGGAAGAA GATTTTCTCA AGCTCCTCGA ATCGGACATG CAGAAGGTGG AAGTATTTAC GTTGTCGCAA GTGACAGATC TTCGCCATAA GATTCTCGGA ACCGAGGCTT TGTTGAAGCA GACGGACGGG AAGGGTGATC CGACCTGGGA TCCAATCGTG CTGGAAGAGA ACGCGGACGA AATCGCGGAA GACTTTTTGC GGTTGGAAAA ATATGTCAAC ATCAATTTCA TGGGCTTTCA CAAGATTCTC AAGAAACACG ATAAGCGCTT GCCCAATCTT GCCTGCAAGG AGTTCTACGT CAACCGTTTG CACGCCCAGG CCTGGGTCCG GGGTGATTAC AGTGATGTTG TCGTGCGTCT TTCCAATATT TACGCCGCCC TGCGCAACGA CAATCAAGCC GAGGAAAACC AAGACGCTTC GCAAAGCTTT TTGCGATCCA CCACAAAATA CTGGGTCAAA ACGGAAGACG TTTCGAGAGT CAAGTACGCA ATTCTGAGAC ATTTGCCCGT ATTCTTACAA AAAACTAGTA CGGGTGAGTC GGATTCGCAA TTTACCAACT CGGTCTATTT GGACAACGAC CAACTCGAAC TGTATCACGG TCGTCTGGAC AAGACTCCAG GTGCGATTGC CTTGCGTTTG CGTTGGTACG GACCGGGTGA CCCTAAGTTG GTCTTTTTCG AACGAAAGAC TCACAATGAA ACCTGGACCG GAGACGTTTC GGTGAAGGAG CGCTTCACGG TAAGTTTTTC TTTGGCTGGT AGGATTACCG GAGACACTTG GCACGAGTAC TATTGTACTC ATTCTATCTA TTTGTTATAG GTCGACGAGT CCGAGGTCCA ACAGGTGTTA ACCAACACTT ACCCAATTGC GGAAAAAAAG CAAGCTATGC TTGAATCGGG CACGTCCCAA AAGGAAGCCG ACGAGTGGGA AACCCTAGTC CGTGAAATCA CCCAGGTCAT TGTTTCCAAA CAACTTGTGC CTACCATGCG TACACAGTGC ATGCGGACCG CCTTTCAGAT TCCTTTCGAT GCAACAGTTC GTGTCAGTTT GGATACCAAC TTATGTATGA TATCGGAGCG TGGCTACGAG TTGGAGGATA TGAAGGTCTG GCATCGTGAT TCTTCATGGG TATTGGAACC AAACGAAATT CATCGTTTCC CGCACGCTGT TCTGGAAATC AAACTCGAAC TCGGTGGTGG TAGTCTCACG CCGCCAAAGT GGGTAACGGA CCTGCAAAAT TCGGGATTTT TGTACGAGTG CCATAAGTAC TCCAAGTTCG TTCACGGCTG CGCTGTCCTT CTTCCGGAAG ACGTGCGTGC CGTCCCGTAC TGGATCGACG ATGTATCGCT TCGCCAATCC ATTGTGGATA GTCAAGCTCA ACGCATTTTG GCAGACGCCA CTGTTGGTGT CGGGCCTGGT GCCAACCAAG TGTACAATCA TTTGCTGCCG TTTGGAACAA CTGTCAACGA TCGATCACTA ACCGCTGTGG GACGCACGAA CGCCGACCGC AACGCCTCCA AAGGGTTGGT GGGTGATAAG GCACCCCTAT TAGGTGCCAT TACCACGGGA GCAAATTTCT ATGCTGACGA GGATGGCGAC GACTATGGAG AAGACGAGGA GGACATTTCG TGCTGGCCCT TTCCCTTTTG CTCTCGGCAG AATTACGTTC TGGATGGCCC ATTGGCGCCC ACTTCAATTC AAAAGATTGA ACCCAAAGTC TTCTTTGCCA ACGAACGTAC ATTCCTACAT TGGCTGCATC ACGGTGTTAT TTTGTCAACC ATCGCTTCGG GAATTCTATC CTTTTCGCAT GAAACTGGTG CCGGCTGGGG CCAATGGTAC GCCTTGGCTC TGCTGCCAAT TTCGCTCAGT TTTTGCGTGT ACGCCGTCCA CATCTTTCTG TGGCGAGCGG ATCGTATCAA AACACGCATT CCAGGACGTT GGGATGATCC CTGGGGTCCG CTCATGCTGG GATCAACGGT GGCTATAGTT CTTTTCTGCA ACTTCATTGT GAAACTGGTG GCCATTTCGA AAATGGACCT AATGTAATTG GGGAGACGTA GAAGAGGAGA CAACTCCGGA ATGATGTACG ACCATAAGCA ATTAAAGTTA GTATGTGCGA CTAAAATTGA GAGGCAATTT CATAGATAAA TGTTGTTACA CAATATGTTG GTTTGCCGAG GTAGAAAATA GCACCGAGGG CTTTGAATCG AAGGAATAGA GTGGCGTAGT AGAGAATTTC TTCCAGTCGG ACAAGCCCGG ATCCTCCAAA CTCTCTTCCT CGCCACGAAA ATAAATTGCC TGTTTAATAC GGCTATGCCT TCCGATACAA GGTGGAGAGT GTCAAAATAT AGTGCTGTGT ATTGAGAAAA TTCGCGGATC CCTTCATGCA GCTTGATGAG GCACGTGGAC AATGTGTTCG CTTCCTTGCA TAATCAGGTT TTCCACGTCA TCGACGTAGA TGAAGCGCTC ATTCCCTTCT TCCTGCCCTA ATTCCAGCAT CAAATTGGTA ATGTCGTCTG TGGTCAAACA ATCGTCCGCT CCTATGTTGT GAAAAAATCG GCAGAGTTCG TTTCGCGAGA TTTTGCCGTC GCAGTTGGTG TCCGCGGCTA CGAGAATTTC CTTAACGAAT CGGCTGAACA AGGTCTGTCC AAAACTATTG CAAAAAAGGA TGAATGGCAG AAAAGGGCAG GAGAGTGAGT TCGCTGCTCA ATCGTCTGAT ACATGTTTTG TTTTTTGGAT ATTCGGAGTG CTTGCTTACT TTTTCTGTCG GTACTCATTG AGAATGGACC GCATCTTGAG GCGTGCTGGA CTGAGTTCAT TATCTTTGGG CAAGTTCCGT TCGTAAAACA CGGTCCGCGC GGTCGTTTCG TGTGGGAAAG AAAAGTCTTG TACGGTAGAA AGGGTGCGCG TCGAAGACGC GCCCCCCACC ATCAGGGAAG AACGTACGGT GAGACTCGTA ACGGCACACC GGAGTCCAGA AGTGGACAAG AGGAGTCGAG TAGAAAACAT TCGGAAAGAA AGGAAGCAAA AGAAAATGAT ACAATGATCG CTGTGTACGG TTTTTTTCAC AGTAGGTGGA GTAGCAACTT TAGGGAGGAG CGAG
|
Protein sequence | MKYGEHLKAN IAPEYGEENY LHYERLDQII TQLTETKPSR AVETSRVVSM TAPPQTNAQG LDTSKINITE EDFLKLLESD MQKVEVFTLS QVTDLRHKIL GTEALLKQTD GKGDPTWDPI VLEENADEIA EDFLRLEKYV NINFMGFHKI LKKHDKRLPN LACKEFYVNR LHAQAWVRGD YSDVVVRLSN IYAALRNDNQ AEENQDASQS FLRSTTKYWV KTEDVSRVKY AILRHLPVFL QKTSTGESDS QFTNSVYLDN DQLELYHGRL DKTPGAIALR LRWYGPGDPK LVFFERKTHN ETWTGDVSVK ERFTVDESEV QQVLTNTYPI AEKKQAMLES GTSQKEADEW ETLVREITQV IVSKQLVPTM RTQCMRTAFQ IPFDATVRVS LDTNLCMISE RGYELEDMKV WHRDSSWVLE PNEIHRFPHA VLEIKLELGG GSLTPPKWVT DLQNSGFLYE CHKYSKFVHG CAVLLPEDVR AVPYWIDDVS LRQSIVDSQA QRILADATVG VGPGANQVYN HLLPFGTTVN DRSLTAVGRT NADRNASKGL VGDKAPLLGA ITTGANFYAD EDGDDYGEDE EDISCWPFPF CSRQNYVLDG PLAPTSIQKI EPKVFFANER TFLHWLHHGV ILSTIASGIL SFSHETGAGW GQWYALALLP ISLSFCVYAV HIFLWRADRI KTRIPGRWDD PWGPLMLGST VAIVLFCNFI VKLVAISKMD LM
|
| |