Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42776 |
Symbol | |
ID | 7196150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1078101 |
End bp | 1081031 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176718 |
Protein GI | 219109931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.161289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCAAC CTCCCGGAAG TCACAATGGT GCCGGATCCC GTAACGTTCG TCGCCAGAAC CTAGGCCATT CGAAGAACAC ATCCACGTCG CCGGACGATC AAGTCCGCCT TCTCCAGCGA TTGCCCCCGA ACAAGCGCTG TTGCGATTGT CGCGCCAAGC TCCCGTCCTG CGTCAACTTG ACCGTTGGTA GTTTCGTTTG CCCCGCCTGT GCCGGCATTC ACCGCGAACT CAATCAACGC GTCAAGGGCG TGGGACATTC CAGCTTTACC GACAAGGAAG TCGAATTCTT GCAGAGCGTC GGTAACAACG ATCTCATTAA CGCTATCTAT TTGGCGACCT ACGACGACGC ACAATCCTCA AGGGGAGGAA GAATCCAGGA ACCGAAGGAT AATACCGATC CGCAACATTT GAAGACTTGG ATTCGCCGAA AATATGTGGA TCGTGCCTGG TATCGTCCTT CTTCTTCCAC GGCTGCGGCT CCGCATCCCA CACAGCAGCC TCCACACGCG ACTATTGTGG CGATTCCTCC GACGGCGCCT GGAACTGCAG GATCCACCGA TTTTTGGAGC AACAACAACA ACCACGCCTC TCCCGCACCG GCTTGGGACG CATTCGGGAG TAACAACACT ACCGCCACGA CGAATGCGGG ATCATCGCAA TCAGGATTTG GACCAGCGCA GTCTCCTCCT CCTCAGGCAC CAAACGTGGT AACGAACTTT GCTAATTTCA ACACGGCGCC ACCACCCACC CAGCACGGAC CTCCCAATCC TGGGCAGCAG CTGCAACAAC AGCAACAGGC CGTCAACAGC TTTGCCAACT TTGAAGCTTC GGGATCGAAT GGCAACGGAA TCCAGCAACA GCCCAACTCG GGCTTTGCCA ACTTCAACTC TCCAGCGGCT ACTATTGTGG GACAACAGCA GTTTTCAACA AACGCTGGTG CCCAACAGCA GCCGCCTTTG AACACCTTTG CAAATTTGAA CGCTTCAACA TCTACTGTAG CTGCTTCCGG AAGCCAGCAG CCGCAATTCC GTTCGAACAC TGGTGCTCCC CAGCAACAGC AACCGGTCCC TCAGATAAGC GCATTCGGAG CCACATCGGC TACCACAGCA CCACAGCCCC AACCAGGATT TGCCAATTTT AATTCTGGTG CTTATGCAGA CCCTCAGCAA TACTTTTCGA CGAACAGCGG ATCTCAGCAA CCTCAATCGT CGCCACAACC AAGCGGGATT GCTAATTTGA ACACGGGTCA GGCACCAGCT AAAATGAATA ACGGAACACA GCTGCCTCAA CCAGGATTCG CCAATTTCAA TGCAACATCG GCTGGTGTTG CCGCGAGCCA ACAATATTTC CCTCAGAATA ACAATAACAG CTCTCACCAA GCAAAGCTTC CGTCGGCACA GACGAGCGGA TTTGCCACCA TTCATTCGGG AGGTGTACAT GGTACAAACA ACGCTCCAAA TCAGCCGCAA CCTGCAGTTC CAAATTTCAA TCCAAATGCG GCTGGAAGCG TTCCCGGAGG CCAACAGCAA TTTTCGCCGA ACGGGAACTC GCACCAAATA CAGCAGCCGT CGCCACACTC GAGCGGATTC GCCAACTTTA ATTCAGGAGG TCCACCTGCT GCCACGGATA ACGCTGCACA GCAACCGCAT TCTGGATTTC CGAATGTCGA CGGAACATCA GTTGGTAAGG TTCACGGAGG CCAACAGCAA ATTTCGTCGA ACAGTAACTC CCACCGAATT CAGCAGCCGA GCGGAGTTGC CACCTTTAAC TTGGGATCTG CGCCCGCTTC ACCGAAGAAC ACAAGACAAC AGCCTGGATT CAATCATTTC AACACTGCAT CGGCCGCTCC TGGGGGGCAG CAGCAATTTT CGAACGGTAG CTCACCGCGA ATGCAGATGC CTTCGCCACA GCCGAACGGA TTTGCCAATT CAAGTACAGG AGGTGAAGTA AATAAGGCAA TGCAGCGGGC GCAATTTGCT TCGCCTGGCT CACAAAGCGT ACAGGACAGC TCCATTCCAG GCAATGATGG ACTGCACCAC AGGGGAGTCA ATAGTTCGAT CACTCAAGGA GGTATACCTT TCGTGCAAGG ACGTAACACA GAGCAACCGA TATCAAGTGT CCAGCAACCT ATGCACTTCC AAGGCGGTAG CATCAGTCTT TCTGACGGTC TTAGAGAAAG TCACATCTCG TCCATTACAC GAGGTATGGG GAACTTGGGT AATATTGCCT CCCAGGCAGG AAGCGTGCCG CATGCTATGA ATCAAACGTC GATTCATCAA CAGCCAATCA GTGAATTATC CGAACAGCAA AAGTCACAGC ATAATATTCA TCATATCCAC TCTTCCCCTG GTCCAGATCG ACAAACACCT ACGGAAGCTG CGAAAAATAC ATCTACGGAC GGTGACTCCG CCCCAAATGA TGGTAAAACA GCATCCGTAT ACATGGAAAA CCATCCATCC AAATTCACGG CTGGCCAAAC TGTGTACTAC AAGAGCTCCA CTTACGTGGG AAAAGCGAAG ATCATGAAGG TGCATTTGGA CGATGATCTT GAACCATTCT ATACTATTCT TGTAGACGGC AAAGAGAAGC AAACAGATAA TGGGCATTTG TCGGAAAGGA GTCCTTTGGA GGAAAAGGTG CAGGAATTGA TTGGTTCTTT GACTGAGGAT CAGCTCTTGC AAGTTCATCA GTTTATTATA AGGTTCCCAT TGACTGTGAC CAGTTCAGAA ACAGATTCTG TTGTTCCTCC TGCTGCTTCT GCCACCATAA TTACTGGCAG TAGACAGCCA CCGGTCTTAG CTTCGACTTC ATCAATGTAT CCTCCTGCTT CTTTGACTGC AAACGTCCCC ATTCCTGGTT CGGGGCAACA ACAAGCTGCA ACAGGAGACG CGCCAATGTC TCCGAAAGGA AATCCATTTG ATTTGTACTA A
|
Protein sequence | MLQPPGSHNG AGSRNVRRQN LGHSKNTSTS PDDQVRLLQR LPPNKRCCDC RAKLPSCVNL TVGSFVCPAC AGIHRELNQR VKGVGHSSFT DKEVEFLQSV GNNDLINAIY LATYDDAQSS RGGRIQEPKD NTDPQHLKTW IRRKYVDRAW YRPSSSTAAA PHPTQQPPHA TIVAIPPTAP GTAGSTDFWS NNNNHASPAP AWDAFGSNNT TATTNAGSSQ SGFGPAQSPP PQAPNVVTNF ANFNTAPPPT QHGPPNPGQQ LQQQQQAVNS FANFEASGSN GNGIQQQPNS GFANFNSPAA TIVGQQQFST NAGAQQQPPL NTFANLNAST STVAASGSQQ PQFRSNTGAP QQQQPVPQIS AFGATSATTA PQPQPGFANF NSGAYADPQQ YFSTNSGSQQ PQSSPQPSGI ANLNTGQAPA KMNNGTQLPQ PGFANFNATS AGVAASQQYF PQNNNNSSHQ AKLPSAQTSG FATIHSGGVH GTNNAPNQPQ PAVPNFNPNA AGSVPGGQQQ FSPNGNSHQI QQPSPHSSGF ANFNSGGPPA ATDNAAQQPH SGFPNVDGTS VGKVHGGQQQ ISSNSNSHRI QQPSGVATFN LGSAPASPKN TRQQPGFNHF NTASAAPGGQ QQFSNGSSPR MQMPSPQPNG FANSSTGGEV NKAMQRAQFA SPGSQSVQDS SIPGNDGLHH RGVNSSITQG GIPFVQGRNT EQPISSVQQP MHFQGGSISL SDGLRESHIS SITRGMGNLG NIASQAGSVP HAMNQTSIHQ QPISELSEQQ KSQHNIHHIH SSPGPDRQTP TEAAKNTSTD GDSAPNDGKT ASVYMENHPS KFTAGQTVYY KSSTYVGKAK IMKVHLDDDL EPFYTILVDG KEKQTDNGHL SERSPLEEKV QELIGSLTED QLLQVHQFII RFPLTVTSSE TDSVVPPAAS ATIITGSRQP PVLASTSSMY PPASLTANVP IPGSGQQQAA TGDAPMSPKG NPFDLY
|
| |