Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42443 |
Symbol | |
ID | 7196646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 73439 |
End bp | 77097 |
Gene Length | 3659 bp |
Protein Length | 1095 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177011 |
Protein GI | 219110519 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0730577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGGGCCAA CCAAGAAAGT CCCCGTGGTT CCAATCAGTC GGGGAGAGAA CGAAGCGGCA ATCGGTAGAT CGATCGAACA AACGATCAAT CGAACGATCG AACGATCCTT GACGACAAGG ATCGATTCCG TCAGAGTTTG CATAGGTATC CTACCGTTTC TGGACCCAAA TATTCCGAGA GGGAGAGGCT CGTTCCATCC GCACACCCAG ATACACACAA TCACGTACGA ACGGATCTCA TGGGAAATTG TCAGTCCGGA AACAGCACCA AGCACGTGCG CGTGGGAGAA GAGTTGCGTC CACGTTCGAC CGTCGGATCG TCGGTGCGGC ATGCTAGTCC CACGCCATCG CGTCCCGTTG CCGCGGACGG TACGGACCTC CCCATTTCTC CCGACAACGA GCCGTTGGAA TCCGCCCTCG AAAACAAGAT TCTGGAACAC TGGGCGGATG AAACCACACA GTCGTCCACC ACCGAGCCCG GATACGACGA AGTCGAAGAC GTCCTCATAC TCACCCGGAC TGCCGCTGAA ATGGCCAGTC GGAGTCCGAC GCGGCCTACC GCGGATGCGG ATCCCGATAA TGGTGCCGGA AGTCATCTCC GGCTTTCGCC ACCCCGCGAA TCTGTTCCTC ACGGTGCCGT TCTCTCCACC ACCGGCGATC CGGCAACCAC CACGACGTTG ACCTCGGTCA CCGCGATACC CATTACTCCG GATACACCGG GAGACGATCG AGACGATCGC AATACGCTCC TCCAAATGCC ACAACAACAA CATCCGTCCT ACTACGAGGA TGACGACCAA TCGATCGAAG TGGTTTCGGT CGCTTCCGGT GCGTCTTCCT CGCGACAGCG GCACACCGCA GGACGACTAC TGGTCTCTCC CCCGACCCTC CCGTCACCCC GACAGCGACA ACCGGAAGTC CTCTACGATA TGGCTCCCGC TAAACCGTCC CGTCCACCGA CACCCTCGCG ACAGCGCCTT CGACGGGGCG GGAGAGCCTA CCAAGCTCGT GTAGAAAAAG CCCGATACCT GCAAATGAAA CTCAACCAGT CCGGTGGATC CGGGACGGGA CCCACCAGCG GTGCCAACGC CTTGCGAGCG CTATCACCCT TTGGCGAATC CATCGCCGGC ATGAGTATCC TGGAAGACGA AGAACTATCC TGTACGGATC ACTCGGCGCG GTCCCGCTAT TCACTCAACA CCTCGTCACT ACTGGCCGCC AGCAACGAAC TGTTCAACGC CACCACCTCC CGCGGAGTCT TGTGGGCGGC CGCCGCGACC CAGGACCACA CACCCGTCAC AACCGTTACC TGGTCGCGGT GTTCCCCCAC ACTCCAATGT GCGGCACCGC CGCTCTACGT CGCCTTCGGA ACCGAAGCCG GCCTCGTGCA AGTGCAAGAG GCCCGGACCA CTCCACCGGT GCCGTCCTCA CTGGTGTTGA GTCGTGACCC CAAACACCAA GGGACCGCAA CTTCCAAGAA CAATACTTCC AAGCTCGGAC CGGTCGTTGC AGTACCCCGA GGATCCCGTA CACGTTCAAT CGATTTTTCT CGAAACGGAC AGTACCTCGC CGTCGGTGGC GACGACTGCA TTTGTGCCGT CTATCGAGTC ATCCACGAGC CAACCCCGGA TGACCCCGAC GGAGAATCCA AGTTGCGTTT GGAGCCGCTC GTGGAAATTT CCCGCGTGGA TCGAGTGTAC GCGGTGCAGT TTCGACCGGA CGGCAAATAT TTGGCCATTG GCGGTTTCGA CGGCACGGTC GCAATTGTCG CCACCTCGAA TTGGGAAGTG ACCGCCGAGA TTGCTCGAGA CGGACTCGTT TTGTGTCTGG ATTGGAGTCC AGACGGGAAG CTCCTCGCCC TCGGCGCCAG CGACAAATCC TGCGCCATCG TACACGCCGA CTCATCCTGG AAGATCCGAA CGGAGTTCCA CCGACCCGCA GATGTTGTAG CGGTAAAATG GCATCCCAAT GGCCGTCTCT TGGCCGTTGG CTCTTCGGAT GTGGCTATTG TGGAAGCCCA TTCCTGGATC ACCCGCCATG AAATTGATAC CAAACCCACC GCCGGCAGTC CCCTGCGTCG GTCCGTTTAC AAAGCTCACG CCCTTTGCTG GAGCCCCAAC GGAAGCTATC TCATGTTCGC GGGGACCAAC GGGACTCGTT GTGTTTTGTT GGAAACCAAA TCGTTTACCA TGTTGCACGA GATTCAGAGG GACGAAGTTG TGACGAGCGT CGCCTGGGGC GAGCAAATCA TTCGGGAAGG TATTCCACGG CGCTTCATGG CGATTGGTAG TGAAGACGGG TCCGTTGCTA TTATGCAAGC GGGTTTGGAA ACCCGGTCCG GAGCGTCTAC GGTTGGCGAC GACGTGTCTT CAATGTCGCA AATGTCCAAC GCAAGTTCGT ATTTTAGTAC CCGTGGAGAG TGGGAATTGC GCGAAGACGT CTTTGACGAT ATCGAAGGCG AGTTGGTCGA CGCCCCGGAC AATCGCGCAA AGGCCCAATC CGGAACCTGT GTTCGCGCCC TGGCTTTCTC ACGAGGTAGC AAGTCCCGGC CGTCCGCTTA TTTTGCCGTG GCCACTGATG ATTGCGTCGT CACCGTACGA TCCACCGTGG GCTGGAAAGT TGTGTCGCGG GCAGAATTCG CCCACCCGAT TCGTTGTCTC GCCTTTTCCA ACGGCAGTCG CTTTCTCGCC TTGGGTGGAG AAGACGCCAA ACTCAGCATT CTCGCTACTG TCCCGTCATG GACAGTAGTG ACCGATTTTT CCTTCGCCGC ACCGTTGACC TGCTTGTCGT TCAGTAAAAA CAACGAACGC CTCGTGGTTG GGAGCTTGGA CGGCACACTC ACTTTCCTGG ATCCCCGCAA GAATTGGCAA GTTGTGGGAG AGATTCACGA CAACGAGTCG CCGGTTTTAT CTTTGGACTG GAGCACCCAG AACTTGGTTA TTGGCCGGCA CGACGGATCG GTGCAGATTT ACGATTCGGT TCAAATTCTA CGTAATAGTC TTAAACCGTC GAAGCAGCTC GATGTGTCGG CACCGGCACG AGCCTTGGCC TTTGGGGTCA ATAGCCGATT CCTGGTGGTG GGTGGTGGCA ATGGTGTCGT TAACGTGTAC AGCTCCAAGG GTGGTTGGGT ACTCTGTCAC CAGATATCGG AAGGTGACGT GGGCATCGCT ATGTTAAGGT GGAGTCCAAC CGGACGATTC TTGGCGTACA CTGGAGAATC ACGGTTGTTC AAAGTTGTCG ATACCGTCTT TTGGGCCGAT GTGGAAGAGG CCGACGAGAT GATGGCGATT GCTAAATCCG TCCCTAGGGA ACAAGAGGAC AAGTCGCCCG CTCTAGCTTT TAGCCAGGAT GGCAATCTAA TTGCCTGCGG TGCCAACGTG TTCGACTGTC GACGGTGGGA GTCGGTTTTT GCACTCCAAC AGAAGAAAGT TGGACGCCAC AAGACCGGTA GTAACAGTGA CGGAGAACAC TCTAGTCTAT CCTCACGGGA AGAACAATCT AACGTGATGG TGATTGCCGA AGAATAGAAA GAGTATATGT TTGAGTTGAC GCTTTTAGCA ATGCACGGCA TTTTGTATCC ATAGAAGCCC AAAGCGGTAG GCCAGGCGAT GATGATTTAT TTCAATGTAT TGTTCCGTAA ACAAAAATGT TGCATTCCT
|
Protein sequence | MGNCQSGNST KHVRVGEELR PRSTVGSSVR HASPTPSRPV AADGTDLPIS PDNEPLESAL ENKILEHWAD ETTQSSTTEP GYDEVEDVLI LTRTAAEMAS RSPTRPTADA DPDNGAGSHL RLSPPRESVP HGAVLSTTGD PATTTTLTSV TAIPITPDTP GDDRDDRNTL LQMPQQQHPS YYEDDDQSIE VVSVASGASS SRQRHTAGRL LVSPPTLPSP RQRQPEVLYD MAPAKPSRPP TPSRQRLRRG GRAYQARVEK ARYLQMKLNQ SGGSGTGPTS GANALRALSP FGESIAGMSI LEDEELSCTD HSARSRYSLN TSSLLAASNE LFNATTSRGV LWAAAATQDH TPVTTVTWSR CSPTLQCAAP PLYVAFGTEA GLVQVQEART TPPVPSSLVL SRDPKHQGTA TSKNNTSKLG PVVAVPRGSR TRSIDFSRNG QYLAVGGDDC ICAVYRVIHE PTPDDPDGES KLRLEPLVEI SRVDRVYAVQ FRPDGKYLAI GGFDGTVAIV ATSNWEVTAE IARDGLVLCL DWSPDGKLLA LGASDKSCAI VHADSSWKIR TEFHRPADVV AVKWHPNGRL LAVGSSDVAI VEAHSWITRH EIDTKPTAGS PLRRSVYKAH ALCWSPNGSY LMFAGTNGTR CVLLETKSFT MLHEIQRDEV VTSVAWGEQI IREGIPRRFM AIGSEDGSVA IMQAGLETRS GASTVGDDVS SMSQMSNASS YFSTRGEWEL REDVFDDIEG ELVDAPDNRA KAQSGTCVRA LAFSRGSKSR PSAYFAVATD DCVVTVRSTV GWKVVSRAEF AHPIRCLAFS NGSRFLALGG EDAKLSILAT VPSWTVVTDF SFAAPLTCLS FSKNNERLVV GSLDGTLTFL DPRKNWQVVG EIHDNESPVL SLDWSTQNLV IGRHDGSVQI YDSVQILRNS LKPSKQLDVS APARALAFGV NSRFLVVGGG NGVVNVYSSK GGWVLCHQIS EGDVGIAMLR WSPTGRFLAY TGESRLFKVV DTVFWADVEE ADEMMAIAKS VPREQEDKSP ALAFSQDGNL IACGANVFDC RRWESVFALQ QKKVGRHKTG SNSDGEHSSL SSREEQSNVM VIAEE
|
| |