Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37523 |
Symbol | |
ID | 7202399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 339708 |
End bp | 343685 |
Gene Length | 3978 bp |
Protein Length | 1325 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181533 |
Protein GI | 219122399 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.706155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCGA TTGCTGACGT CCAAGATTCA ATTTACGGGA AGGGAAATGG GCCGCCTATT CTGCCCCACA CAGACGACCC AAAGGCTGTA ACACATCCTC GTAAACATCC CAGTGGGCAC GGCGAGACTA ATCCTTACAC TCCAACAACA GTGCATGCAG AGAAAAAATC CTCGGATGAT GAGCTGGCTT TTCCAGCATC TGAATTGCCA GAGGGTTTTC CCCGCGAGCT GTCATTCCGA GCTCTGGCAG CCTACTCATT GATTCGAACT TTGAGTATAC AACTGCGGCT GTCACCCTTC ACACCCAATG TATTTTTACG AGCACTGAAC CTCCCTTATC CGAATCGTTT GCTAGGGCAA ATTCATGTGG CAATTCTTCG TCTTTTACTC GGCAGTTTAC GTATGGGGTA CCACTGGGGT AAGGGATTTT CTCAAGTTGT GGCGAAGAAG CGAAAATTGG ATGGATTGCG TTGGCCTTTG CGAGCTGGTG ATAATTTGGA ATTTCTGGAC ACCTTTTCAT GGCCAATCTT TTACTACGAT TATTGTCATT TGACTGCTGA TATCCTCCAC ACTGCAATGA CTGACACAAC CGATTATGTA GACGTGCGTC ATCTGGATAC ATCATCTATC AAATCAGATC GGCTAGAGGG CGAGGATCTT GATTGCTCCG TCGGCCCAAC ATCGTTTGAC TCACCATCAG TCATATATTT GAATATTTTG GAGGAAGAAA ATGACGAGGA CGAATACATG GCAGAAGATG ACTTTGTTCA AGCCGATTCA GATGAAGATT TCGATGCGTC AGATCGGCCA ATGAAGAAGC GGAAATCATC GTTTTTCACC GCGGCGACAA AATGTTGTAC ATATTCGGCA TCGGGGTCGC TAGTACATGA CACTTCTTTG CAAGATCAAG CCCAAATGGC AACAATGCAA CAAGATGAAA CCACCGCACA CGAAACATAT CTGCATAATG AGTATTCAAA CGTGTCATCC AATGGCACAA AAATAGGGCT TAAAACGTGT TCAGCTTATT TATCGGGAAA AGTTCTGTCA AGTTCCTTTC CACCAACTCT ACTCGAGACA AAAAAAGAGG AAACGGTTCA TGATCTCGAA GTTCGGCTGG AAAGGTCCTT GGATCGAATG AATGAAGAAG GGGGCTTCGG TGGCTCTTCT TGTATCCCTA TGCACCCAGG GTCTCCTGTT CCATTTTCGA TAGAAGGTGA CACGAGCGTT TCTTCCACGG ATGTTCGGCA ACCTGATCGA AGGCGCGTGT ACCAGATACT CCCTGTATGT TCTGAGTCGA GAAACGAGGT AAGTTCGCAC CAACAAGAAG AGAAAATTGA GAATGATGGA GTCGCCTTCA GGCACAGCTC AGAATGTGAT AGCGAGATAG CTTACGTCCT TGCTAAAAAG ACGTACAGCG CAGACCAATT CAAATATGAT CACGCACACA ACGAACTTGC GACCATAATG GCTTCAACCG CGGCTGATTG CTCATTCTTT CAACCTACTC TATCGGAAGA AGGGAATAGC GCCAGATGCA AGTCGAACAA AGACGACGCT TCAAGCAAGT TTACAGTGAA TGAAATTTGT ACTTCACCCC AATCAGTTGG GTACATCCAT AGTAAAGCAG CAATAAGGGT TCAGACATCA AAGCAACATA CGAATGTGGA ACACACTGCC TGTAAGAATG AGCCACGTGA ACTCGGAATT GGTACGACTG ATGATGACTC GTCCAGGGAC ATAGAGCATG CCTTAAAACC CAATGAAGAA AAGGAGAGTA GGGAGACAGG AACCTCTTGC CTTTCGCGTG GCAAATCTGA GCTCTCAGGC AGCAGCACGG AGTCACATAA ATTTTCAGAT GTTGGCGATC CGTGGGCTCA CTTTCAACCA CTTCTTCGGA TGCGCTCTGG TGAATCATAT TTTAGCCTTC CTCTTGAAGA CAAAGTGACG ATTCTTGAAT TTCTGATCGA CGAATTGCTA TCGGTCGATG TTATAGCGGC AGAAATGGCA CGGCGACACT TGTCGAATTC AACACTTCGG TTTACCTGCG AACAAAATCT TTCAGTTGCA CAATCACAGG CCGCAGACAA TGAGGATGTG TGCGCTGTTT GTAGAAAAGA AGGCGAGCTA CTGTGCTGTG ACGGATGTAT TTTCAGCTAC CATAAGAAGT GCCTTGGCAT ATCGGAAAAT GAGGAAATAT CTGACGTCCG TTGGCATTGC CCTGAATGTA CACTTGTAGA TCCTGCCAGC TTTGGATCAT TCCAACATGT CAGAAAGGAA TGCGTTGAGT GGTTTACAAA GGAGGATTTA AAAATGGAGC CTCAATTGTT TGAAGGTCTC ACAGAGCCCT TCTGCCAGTC TTTGTCAAAT TCAAATTTTC TATGTCCTCC AGTGCTCGAA TGTGTCAGTT CCAGAACAGA AAACCCTTCC GAATTCTTGA TTGTTCACGG GTTTATTTTT GGTCGCGCCT TGCGTGGGGC AAATGCCGAT GTCAATGTGT TGAACCTTGC ACGTGTTCGA CACATGTTTC TAGGATTACC ACCGAAAATT TTGTCCTCTT GGCCGTTTGC CCAGGTACCG TACGATCCAA GCCACCTTTA TAAAGACTAC ATCCTTGCAA GTGGCGCTCC TTATTTCTCT GGACTGCCTG AGCTGTACGA TCCCTCAGTC TATTTGAGCA AATATCGCTT GGCACCGTAT CCTCCACCTA TACGGAAAGA TGTTGAGCCT AGCGCTTTCG ACTTCGAGCA TCAATGTCAT CCGGCTAGTT ACCTGGAAGT GTCGCGGAGA CTTGACAGCC GCATGTCGAA CGATATGTCG GTCATAAAAT TGCTTCGAAA CCGTCTTTTC GACCATTTCA AGATGATCAG AGAATATTTG TTGTCTCTCG AGATTACTCT AGGGAGAGCC TGCCTCCTTG ACGAATTCTG GGGCACTCAT CGGAGTGCAA CAGGTGGTAC AACTATCTGG GCTCGCAATG TTAAAATGGC AAAGTCTGCC CAACGGTTAT CAAACTTAGC GGTCAAACTC GTAGATGCTA CTCACCCCAG AGCATTCCTG GAGACATGGT TTGACCACGC TGTCAAGGTA AAAGGCTTTG ACGCCAAAAA CGGTACAGCT GTTTCCAAGG AAATGAGCGA CGCTATTCCG ATATGTCCGG ATAGCAACCC TACATTAGAG TCTTTGAATC GGCATTGGGA ACGATGTCCC CCATCTGCTA TCTTTTGTCT TCTTGCGAAG GAAGGTAAAA GTCTTAGTAA CTGGGTTGGT GAGTATAGAC CAGATCTTGT AGCGAAAGCT ATCCATCGCT CGAAACGGAA GAAAGGAGTC GAAAATTCTT CCCCCCATTT AGCCGTCAAA AACATGAATG TTGGACCGAA GCGTTTATCA GTGAACGGTC ATACTCCAGG GACGTTTGCT AGACGTGAAC ACCTTAGGGA AAAGAGAGAG TCGGCAGATT CACCTGGAAG CTCTTCTTGT ATGCACAACT TCGAAAAAAG TGGAAAAGAA CTGGTCGTGG ATTTGCCTGA TAAGGAACTG TTTATGAAGC CTTTACGTAA GAGGAGGCGG TTTGATCGAG GCCTCCCTGA CAAGGCACAA CTGGAGAACT TTGACCCAAA GAGCGTAACC AGCGCAAAAG CAATTCTAGA AGCCAGGAAG CGAGCGAAAG TCACCAAGTT TGTGGAGCAA CAAAATCCTA TATTTATCAA AGAAATGCCC TGGCCGGTCG CTGGTCGAAA GTTGTTTGAT CCTGTAGGGT CGCTGTCTCC GTCAATCGTG CGGAGTCTAG CTCGAAGCGC TGGAGGGTTG CGGGCTCCCT TTGTCACGTA CATACCTTCC TACGAAGTAG GGCAAATGTC GCACTACCAT GTTTGGAGAA AGCGCACCCG TTTGTGCGCA AGTTTCGAGG AGCTCAGCTA TTCTCTTCGA ATGCTACAAA CGTTCGTGGA TACTGGGGTG GGTGCTCGGA ATGTTTAG
|
Protein sequence | MESIADVQDS IYGKGNGPPI LPHTDDPKAV THPRKHPSGH GETNPYTPTT VHAEKKSSDD ELAFPASELP EGFPRELSFR ALAAYSLIRT LSIQLRLSPF TPNVFLRALN LPYPNRLLGQ IHVAILRLLL GSLRMGYHWG KGFSQVVAKK RKLDGLRWPL RAGDNLEFLD TFSWPIFYYD YCHLTADILH TAMTDTTDYV DVRHLDTSSI KSDRLEGEDL DCSVGPTSFD SPSVIYLNIL EEENDEDEYM AEDDFVQADS DEDFDASDRP MKKRKSSFFT AATKCCTYSA SGSLVHDTSL QDQAQMATMQ QDETTAHETY LHNEYSNVSS NGTKIGLKTC SAYLSGKVLS SSFPPTLLET KKEETVHDLE VRLERSLDRM NEEGGFGGSS CIPMHPGSPV PFSIEGDTSV SSTDVRQPDR RRVYQILPVC SESRNEVSSH QQEEKIENDG VAFRHSSECD SEIAYVLAKK TYSADQFKYD HAHNELATIM ASTAADCSFF QPTLSEEGNS ARCKSNKDDA SSKFTVNEIC TSPQSVGYIH SKAAIRVQTS KQHTNVEHTA CKNEPRELGI GTTDDDSSRD IEHALKPNEE KESRETGTSC LSRGKSELSG SSTESHKFSD VGDPWAHFQP LLRMRSGESY FSLPLEDKVT ILEFLIDELL SVDVIAAEMA RRHLSNSTLR FTCEQNLSVA QSQAADNEDV CAVCRKEGEL LCCDGCIFSY HKKCLGISEN EEISDVRWHC PECTLVDPAS FGSFQHVRKE CVEWFTKEDL KMEPQLFEGL TEPFCQSLSN SNFLCPPVLE CVSSRTENPS EFLIVHGFIF GRALRGANAD VNVLNLARVR HMFLGLPPKI LSSWPFAQVP YDPSHLYKDY ILASGAPYFS GLPELYDPSV YLSKYRLAPY PPPIRKDVEP SAFDFEHQCH PASYLEVSRR LDSRMSNDMS VIKLLRNRLF DHFKMIREYL LSLEITLGRA CLLDEFWGTH RSATGGTTIW ARNVKMAKSA QRLSNLAVKL VDATHPRAFL ETWFDHAVKV KGFDAKNGTA VSKEMSDAIP ICPDSNPTLE SLNRHWERCP PSAIFCLLAK EGKSLSNWVG EYRPDLVAKA IHRSKRKKGV ENSSPHLAVK NMNVGPKRLS VNGHTPGTFA RREHLREKRE SADSPGSSSC MHNFEKSGKE LVVDLPDKEL FMKPLRKRRR FDRGLPDKAQ LENFDPKSVT SAKAILEARK RAKVTKFVEQ QNPIFIKEMP WPVAGRKLFD PVGSLSPSIV RSLARSAGGL RAPFVTYIPS YEVGQMSHYH VWRKRTRLCA SFEELSYSLR MLQTFVDTGV GARNV
|
| |