Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50140 |
Symbol | |
ID | 7198843 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 169424 |
End bp | 171765 |
Gene Length | 2342 bp |
Protein Length | 652 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185062 |
Protein GI | 219129787 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.155969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAACTTTGT AAATTGTATC TGGTAGCATA GTTCTGGCAT TGAATACTTA CTGTTAGCGA TACAGACTGG TTGACAAAAT GCCGGTGGCA AATGTGATTG ATGATTTCGA AACGGCCTCT TGGCCAGATG CCTTGCCGCA TCCGGATGCG GAGTTTGTGT TTTCCATTGT GTGTGGAAAC TCGCACTATC GATGGTGTGT TCTTTCTAAG GATCCAACGG ATGGAACGCT GTCACCGACT TTATTCTGGA AGTAAGATTT CACCCTACGA GAGTTGAGAT TCCTTTTTTG ATAAAAGTGA TTCTCATGAT CCGTTGTGCC TTTTTTGTGT TGTGTTTTGC GCAGAACTGC CCCTGTACCA GCAAACGATA TGACCGAAGA TTCTTCTCGC ATTCTGTTGA GATACCTTCC AGATCAGGCC AAAGACTACA TTTTTGGCGT TGACTCTTAC ACGCACACTC GTGATCGGGC TTTGGAGTTT TGCAACTCCC GACGTATGCC TATTGTCCAC GTCTACGTAA TTTCCACCAA CGCAGCGCAC GAAAAGGGGA TTGCCTTTTT ATTTCGGGAC ATTCCATCCC GTGTTTTGCG TTTGGGCAAC ACGGACTTTT ACACACGCCA GCAAGGCTGC TACGACACAT TGGGGGTAGA CCGGGCAGCT GCAGCCCGGG CTGCGGCTGG TTTGTACGGC TACCCCTGTT TGGTCATTGA CGGTGGCACC GCGCTGACGT ATACGGCGGT CGACGTGGAT GGCCAACTGC AGGGTGGAGG GATTTGCCAA GGCCTTAATC TTCGCCTAAA GTCCTTTGCA CCCTATACAG ATACACTCCC AGCAATTAAA CTGGAATCAG CTTTGGCGAT TCTGGAAAAA AGGCACAATG CGAACGAGCC GTTCGGGTTG TGTGCCCGAC GAATCGACGA CGCTATCCTC GGAACAATCA TGAGAGAAAT GGCGTGCTTA TTGCGTAATA TAGTGGACGA ATGGTCCGCG CAAGCCATGG AAACATTCAG CACTATACCC TCGGAGCAAG GAAACGAAGG CAGTCGAAAG TACAACAAGA ATCTTGTTGT TTGTGTTACA GGTGGCGATT GTCAGGTTAT TGAGACGCTA TTGCAACCAA ACTTTGGCAA CATTATTGCC ATTGGCGCAT CTACAAGGTA CAACTCTTTA AAGACAAAGG AGTCGTCCCC ACTTACCAAG CTGAATGTGA ATAAGCAACT GCTGCACCAA GCTCTACCGG CGCTCATTCA GGAAAAAGCT AAAACCGGCC AAGCGAAGTT TGAAGACGTT CGAAGAGCAC TGATTGGACA ACGTGTTGCG GTGAAGTTTG CAAAAGACGG AAAATTTTAT CGGGGAACCA TTGCTTCTTG CCAACGGGAT GCTGACTTTA CCCGAGATGT ATACACGGTC TTCTACGATG ACGAACGCGA AGATATGGAC ATTGAGCAAG TGCATGGTAA GTCTTCTCTC GGGGAATGCT GTCTGTCCGT GCAAGAAAAA ACTCAACTAC TGCTTCTATG GTAGCCGCCC TGCTATTATA CGTCAAGAAG GGAGAAGAGC TGGACTCATT CGATGACAGT ATTCGTGAAT CTCAAGAAGA GAAACGTCGC GGAGCAGATA AGGCTGCTGA ACTGCTAGGC CAAGTGAAGT CGACACTGCG ACTGGAGCCG ACATCTCCAA AGCAAGGTAT CACTGCCTCC GCCAAAACCT CGTCAACGCT CAACGCTGCA GTGGCAGTTG AGAGCACTGT GCATGGTAAG GGGGACCGAC TCAGGATTCT GAACGTTTGG AAAAAAAAGG TCCGTCATTC TGACCCCAAA AACACTTTAT TTTCAGACAC TTCAATTGAA GCGACAGAGG TAGAAGCAGC GGAGGTAGAA ATTGTAGAAG TGGAAAGTGA TGGGAGAAAG CGAAAACGAA GTATACAACC GCCTTCAGCA GAAAGAACGA TTGTGGAGGT GACGGGGAAA GATGGAAAAG ACTTTATTGG CTGTCGAGTT GCTAAATTCT TTGATGTTGA CTTATATTTT GGAACCGTCT CCAGATTTAT GCCCTCAGAA TATGTGGAAG AAAAGGTTGA CGTTTGGGCT ATTGAATACG ACGATGGGGA TAAGGAAGAC TTTGATGCAT CAGAATTGCA GGAGCACTTG GCCCTGTACG ACGTGCAACA GGGAAAAGAC CCGAATCAAA GTTTGTAAAT CGGTAACAGA AATCAAGGAC CTGTTTGCTG TCGACCGGTG GCCGACGAGT CTTTTTACAT TTCGTCTTCG ATACGATAAT CTGACAATCG TAGCAATGCC TCTTTACCTT GGCAGAGAAA TAAAGCAAAG GTATCCTGTT TC
|
Protein sequence | MPVANVIDDF ETASWPDALP HPDAEFVFSI VCGNSHYRWC VLSKDPTDGT LSPTLFWKTA PVPANDMTED SSRILLRYLP DQAKDYIFGV DSYTHTRDRA LEFCNSRRMP IVHVYVISTN AAHEKGIAFL FRDIPSRVLR LGNTDFYTRQ QGCYDTLGVD RAAAARAAAG LYGYPCLVID GGTALTYTAV DVDGQLQGGG ICQGLNLRLK SFAPYTDTLP AIKLESALAI LEKRHNANEP FGLCARRIDD AILGTIMREM ACLLRNIVDE WSAQAMETFS TIPSEQGNEG SRKYNKNLVV CVTGGDCQVI ETLLQPNFGN IIAIGASTRY NSLKTKESSP LTKLNVNKQL LHQALPALIQ EKAKTGQAKF EDVRRALIGQ RVAVKFAKDG KFYRGTIASC QRDADFTRDV YTVFYDDERE DMDIEQVHAA LLLYVKKGEE LDSFDDSIRE SQEEKRRGAD KAAELLGQVK STLRLEPTSP KQGITASAKT SSTLNAAVAV ESTVHGKGDR LRILNVWKKK VRHSDPKNTL FSDTSIEATE VEAAEVEIVE VESDGRKRKR SIQPPSAERT IVEVTGKDGK DFIGCRVAKF FDVDLYFGTV SRFMPSEYVE EKVDVWAIEY DDGDKEDFDA SELQEHLALY DVQQGKDPNQ SL
|
| |