Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41318 |
Symbol | |
ID | 7199193 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 106734 |
End bp | 109722 |
Gene Length | 2989 bp |
Protein Length | 916 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185282 |
Protein GI | 219130250 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.305609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGA CCGCCGACTT CACCCTTTCC GACTTTCCTC ACAAAGTCCT CGATCCCATC GCCACCGACA CCACTGCTCC CTCGTACGCG TCGCTTCTCC TGGCCCAACG CCAACTCAGT GCCAACGCGT CCGCCATTCC CAGCCTTAAT GGCGGCGGGG CCCATGGTCA CATGGCCCTC ACGCTCACTG ACGAAGCTTA CGCGGAACTT TCCGACATCC CGTTCGTCAT CCCCGTTGCT CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TCACCGAAAA CAACCGCCTC CACAAACGCG CTGTAGCCAC CCACAGCCTC TACGTGGCGG TCAACAACGC TCTCCGTCGC CAGATCCTCG ACGCTGTTCC TCGCGTGTAC GTTCGCGACC TGGAGCACCC CCAGTTTGCC TACAGCCATG TTTCCTGTCG CGACCTTCTC GACCATCTCT GGCGCAACTT CGGTACCATC ACCGCTTCCG ACTTAAAAAG CAACATCCAA TCTATGTACA CCCCTTGGAA CCCGGTTGAC CCCATCGAAA CCATTTTTCA TCGCTTAAAT GATGCCATCG CGTACTCGAT AGCCGGCCGT GACCCCATCA CCGAGGCCGC CGCCGTTCGC GCCGGCTACG ACGTGCTCGA GCACTCGGGC CTGTTTCCAC GTGCCTGTGA AACCTGGCGC ACCGCCTCGC CCGATACCCA CACGCTTGCC AATCTGCGCG CCCTTTTCAA AGTCGCCGAT ACCGACCGAA AGCGCACCGT CACCACCGGC ACCCTCGGTT ACGCCAACGT CCTTACCACC GCGCCATCGG TTCTCCCTTC GCCCTTGCCC GACGCGCTCA GCCTTCCTTT CTCAGCCCTC TCGGTGTCCA ATTCCTCTGC CACCCTCTCT GAGAAAACTT ATTGCTGGAC CCATGGGTCC AGCAACAACC GTCGGCACAC TAGTGCCACG TGCAAAAACA AGGCCCCCGG ACACCGCGAC GACGCCACGG CCACCAACAC CCTTGGCGGA TCCACCAAGG TTTGGACTGC CCCCAAACCT CCCGAATAGG AAAGAGGGAC GGCTACGCCG ACGATTAACA CTAGTAATAC CGATTATTTA AATCATATTA CTAGTCTTAA CTCGTCTGTA GTCCCCTCCC CGCCTAGCAT ACACACCTCA GCCATCGCCG ACACCGGCTG CACAGGACAC TACATCACCG TGTCCTGTCC CCACTCCCAC CCACAACCTG CCTCTCATCC CCTTGCCGTC CGCGTTCCCA ACGGCGCTAT TCTCCGCTCG AGCCACACAG CCACTCTCGC TCTCCCTGGA TTTTCCCCCA CCGCTTGCCA GGCGCACATC TTTCCCGACC TAGCTTCCCA TCCCCTCCTC TCCATCGGCC AACTCTGCGA CGACGGCTGT ACGGCCACTT TCTCGGCCAC TCGCCTTGAC ATTCATCGCG ACGCTACCCT GCTGCTCTCT GGCGCCCGCT CCCCCCACAC CGGCCTCTGG CACCTCGATC TTACCCCTCC CCAGCCCCCT GCCACAGCCC ACGCTCTGGT TCCCAACACC CCACTTGCCG ACCGCATCGC TTTTGTTCAC GCCTCGCTCT TCTCCCCAGC TCTCTCTACC TGGTGCCAGG CCCTCGACTC CGGCCACCTT GCGACTTTTC CAGACGTTTC CTCCCGCCAA GTCCGCAAGT ACCCACCTAG CTCCCCCGCG ATGATCAAGG GTCACCTCGA CCAACAACGC GCGAACCTGC GCTCCACCAA GCTCTCCCCT GTCTGTTTCC CTCTCTCGAC GGAACCCCCT GCTGCCGCTG CGCCCGACCT CGACCCTCCT GACGCCCACC CCGTCGCCCG CACACACCAT GTCTTTGTTG CCCACCAAAG GGTTACCGGT CAAATCTACA CGGACCAGCC GGGTCGTTTC CTCACTCCTT CCAGTGCAGG CCACAACGAC ATGCTTGTTC TTTATGATTA CGACAGCAAT GCTATCCACG TCGAACTCAT GAAGAACAAG TCCGGCCCCG AGATTCTGGC TGCCTACCAA CGTGCTCACG CTCTTTTCAC CCAGCGCGGC CTACGTCCCC AACTTCAGCG CCTCGATAAC GAAGCCTCTA CCGCCCTCCA AGCCTTCATG ACCTTAGAGC ATGTCGACTT TCAGCTAGCA CCCCCCCATC TGCACCGTCG TAATGCCGCC GAACGGGCCA TACGCACCTT CAAGAATCAC TTTATTGCTG GCCTCTGCAC CACGAACCCG GATTTTCCCC TTCATCTTTG GGACCGCCTC CTCCCACAAG CCCTTATCAC CCTAAACCTT CTTCGGCGCT CCCGCATCAA TCCCAAGTTG TCCGCCCACG CACAACTTCA CGGGGCCTTC GACTACAACC GCACCCCGCT TGCTCCTCCT GGCACGCGCG TCTTAGTTCA TGTCAAGCCC GCTGTTCGCG AAACCTGGGC CCCCCATGCT GTTGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTCTGGATC ACGGAAACAC GTGCCGAACG TGTTGCGGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT AGTACATGCC CTCCAGAATC CTTCCCCTGC GTCTCCGTTC GCCCCCCTCG ATGCCAACCA GCACCAGGCC CTTACCGACC TCGCCAATCT CTTTGCCACC GTGGCCGCCC CAGTCGACGA CGTCCCCGCA CCCGCTCCAG TGCCTCCGGT CCGTCCCCCT GCCCCAGCAA CTCCCCTTGC TCAGGTCCGT TTTGCCGTTC CTCTTGTCAC GGCCGAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG GCCCCAGCAC TTCCGAGGGT GCCCACCCCG GCCACCTATC ACTCTCGCAC CGGCAACCCC GGCCGTCGCC GCCGCACAGC ACGCAAACAA CCGGCAACCC CAACCCTAG
|
Protein sequence | MSPTADFTLS DFPHKVLDPI ATDTTAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL TLTDEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVATHSL YVAVNNALRR QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI TASDLKSNIQ SMYTPWNPVD PIETIFHRLN DAIAYSIAGR DPITEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA NLRALFKVAD TDRKRTVTTG TLGYANVLTT APSVLPSPLP DALSLPFSAL SVSNSSATLS EKTYCWTHGS SNNRRHTSAT LNSSVVPSPP SIHTSAIADT GCTGHYITVS CPHSHPQPAS HPLAVRVPNG AILRSSHTAT LALPGFSPTA CQAHIFPDLA SHPLLSIGQL CDDGCTATFS ATRLDIHRDA TLLLSGARSP HTGLWHLDLT PPQPPATAHA LVPNTPLADR IAFVHASLFS PALSTWCQAL DSGHLATFPD VSSRQVRKYP PSSPAMIKGH LDQQRANLRS TKLSPVCFPL STEPPAAAAP DLDPPDAHPV ARTHHVFVAH QRVTGQIYTD QPGRFLTPSS AGHNDMLVLY DYDSNAIHVE LMKNKSGPEI LAAYQRAHAL FTQRGLRPQL QRLDNEASTA LQAFMTLEHV DFQLAPPHLH RRNAAERAIR TFKNHFIAGL CTTNPDFPLH LWDRLLPQAL ITLNLLRRSR INPKLSAHAQ LHGAFDYNRT PLAPPGTRVL VHVKPAVRET WAPHAVEGWY LGPALNHYRC HRVWITETRA ERVADTLSWF PTRIPMPAAS STDRALAAAR DLVHALQNPS PASPFAPLDA NQHQALTDLA NLFATVAAPV DDVPAPAPVP PVRPPAPATP LAQHFRGCPP RPPITLAPAT PAVAAAQHAN NRQPQP
|
| |