Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39556 |
Symbol | |
ID | 7195367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 119249 |
End bp | 121084 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183679 |
Protein GI | 219126887 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.423065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACC GTGACGTACC CCATTGGGAC TCACAGTACC CATCCATAGG TTCCTTACCC GGCAGAGGTG TCGCATCCCG GAATGACCCA CAAGGGGGCA ACTATCGTTT TTTGGAGGAT TCACGGGTCA TTTTATTCGT GTGGTCCCAG AACTGTATCA GTCTCCTCCC ATTGCCATGC CCGTACACGG CAATGGGTGT GGAGTGGAGT GCCCAGTTCT TCCCTCAACC GCGGAGTAGG AACAGTCGAC TGACGAACAA TGCAACAGGA ACTACCAGCC AACGGACGGG TAACCGTACC CGTTCCATGA TTCCCCTTGT CCGCGGAATG CGGGGCAAAG CAGTCGTCGG TCTCGTGACG ATGGGCTTGG TGTGGACTTG GCGATCGCTC CGCTGGGCCG ACGAAGGCAT CCTCTCGGTA GCAGGAAACC CCTTGAGTCA TCCGTTGGCT GGCGTGTCAA TACCCCTTCC GACCTCGTAT CCCGTGCAGG TCTTGCTACA CGTCATTCAC ACCCGTTTCC AACAACACCA ACCCCGTTTG GTACACTTGG GTCGAGCCCG TCTCGCACTC TTCCGTACCC TCTGTGTACC GTCCTTGCGA GCGCAAACCG AAACCAACTT TCTCTGGATC CTCCGAGTCG ATCCCGCACT CTCCGCTCCG TTGCGGACAG CCTTGCTCGC GATTGTACGG GACTACGCGA GCCGGAATCA CTCGGTCCTC GTCGTGGCCT CCAACCAATC CCCCGAACGC TTCCATTTGC ATCGGGAGAA CGACTACGCT GTCGACGATG ATATTCGCAA CGATACACTC TGGTACGGAA ATATGCGAAT CTGGCAACGC TATCAACAGA CAGCTCGAAG AACCGGGACG GTCGTCTTGG AAACGAATCT GGACGCCGAC GATGGACTGG CGATCGATTT TGTACACACC GTACAACAAC GTGCACTCCG AGACTTTCGG ACAACACCAC CCCCAACGGC CGAGTCCGTG ATTGGACTCC GTCGCTTGTA CCGCATCTAT TGTGTGGGAC AACATGTCGA ATGGCACTTT TACGCACCCT GGGATCGTCG ATTGTTCGAC AAAATACCCG ACTACGTCAT TCAAGGAGCC TTGCGGACAC GGATCAATGA TCGTATTTGC ATCACACCCG GACTCACGGT AGCCTCCCAA ACCGTGCCGG ACTCTCACGT GGCTGCTACT ACTCCTACTG CTACTACAAC TGCTACGGGT ACAGTAAACA CAACTGCTAC TGCTACTACT GCTCCCAACA CCGCCACCAT ACCAACACCA CCAACGTCAT CCCCACCACT ACTACCCTTT GACGTCGTAC GTTGGCATCA TCTCATTCAA GAAAAATTTG CAGCCTGTGA ACAGACATCC CCTTCTCAGC TCCTCGACTT TCACAGGGAA ACATTTGTCT CCATGTCATC GGTCTGTTAC GAATACCTCG ACAACAGTTT GGATACGGGT AATGTGACCG AAATATTACC TACCGCACGG CACCCTTGGG CCATCCGCGC ACGTACACCT ACCAGTGTCG GGGTCGAGGA TCTACTCCGC AAACGATCGC GTTGGAAATA TTCCGACTAC TCGGCCTCGC AAGCGCTGTG GACCCAATTG TCGCCAATAT TCAGCGTAAC GGCGGAAAGT CTACGGCAGA CGCACCACGC TTTGGCGGAT GATCTCATCG CCATTCTATC GGACGCCATA GAAGGGCAGT GTACCCCCAA TCACTCGTGT CACGACAAGT CCCGAACCAT TTTAAACCGG GTACTCCAAA ACGAAAAGAA ATACGGCCAG CTCGTCGTGC CGGATAAACT CGTTTGGGAG CAGTAG
|
Protein sequence | MGDRDVPHWD SQYPSIGSLP GRGVASRNDP QGGNYRFLED SRVILFVWSQ NCISLLPLPC PYTAMGVEWS AQFFPQPRSR NSRLTNNATG TTSQRTGNRT RSMIPLVRGM RGKAVVGLVT MGLVWTWRSL RWADEGILSV AGNPLSHPLA GVSIPLPTSY PVQVLLHVIH TRFQQHQPRL VHLGRARLAL FRTLCVPSLR AQTETNFLWI LRVDPALSAP LRTALLAIVR DYASRNHSVL VVASNQSPER FHLHRENDYA VDDDIRNDTL WYGNMRIWQR YQQTARRTGT VVLETNLDAD DGLAIDFVHT VQQRALRDFR TTPPPTAESV IGLRRLYRIY CVGQHVEWHF YAPWDRRLFD KIPDYVIQGA LRTRINDRIC ITPGLTVASQ TVPDSHVAAT TPTATTTATG TVNTTATATT APNTATIPTP PTSSPPLLPF DVVRWHHLIQ EKFAACEQTS PSQLLDFHRE TFVSMSSVCY EYLDNSLDTG NVTEILPTAR HPWAIRARTP TSVGVEDLLR KRSRWKYSDY SASQALWTQL SPIFSVTAES LRQTHHALAD DLIAILSDAI EGQCTPNHSC HDKSRTILNR VLQNEKKYGQ LVVPDKLVWE Q
|
| |