Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_52174 |
Symbol | |
ID | 7202040 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 566051 |
End bp | 569924 |
Gene Length | 3874 bp |
Protein Length | 1143 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181228 |
Protein GI | 219121760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.338516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAAAAAGA CGGTCGAGGA AACTTATCAG AAAATGTCGC AGCTCGAGCA TATCCTCATT CGTCCCGACA CCTACAGTAC GTATACATCT GAAGCAAAAA TAGCCATACG CTTGTTTTTA TGTTGCTTAT TCGCTTACTC GTATTTTTTG AAATACCTTT AACAGTCGGC TCAGTGGAGC CTCACACCAA GCTCATGTTT ATCTTGGAGG ACGATAAGAT TATCGAAAAA GAAATCACCT ACACTCCAGG TCTTTTCAAG ATTTTTGACG AAATTGTTGT CAACGCAGCA GATAACAAAC AGCGCGACCC GAACATGGAT AAGCTTGATG TGACTATAAA TGCCGAAGAG AATGTCATTT CTATTCAGAA CAACGGAAAA GGTATTCCAA TCGTGATGCA TAAGGAACAC GGCATTTATG TGCCAGAACT CATTCTAGGA CACCTTTTGA CGGGCTCCAA CTTTGATGAC GGAGAAAGGA AAACAACAGG TAAGTTTAGC ATGCAATGGA AACCATCCCA AAGATCCTTG CCAAGTGAAT AGACACTCAC CTATGGCATT ATTCCAAGGT GGACGAAATG GATATGGTGC CAAATTAGCG AACGTATTTA GCACGGAATT CGTCGTAGAA TGTGCGGATA CGGAAAACGG GCTCAAGTAC CGCCAGGTAT TTCGAGAAAA CATGAAGCAA AAGGACAAAC CAGTAGTCAA GGCGCTCACA GCCAAAGAAA TGAAGGCTGC GGACTACGTC AAAATCACCT TTAGTCCCGA TTTAGAGCGC TTCAAAATGG ATCGTTTGGA CGAAGACACA GTCGGTCTCC TGTCCAAACG TGCCTACGAT ATTGCGGGAA CCATGTCTCA GAGCAGTGGC AAGAAACTTC TCGTCTCCCT CAATGGCAAA AAACTTCCAA TTAAGTCTTT CAAAGATTTC ATAGGCGTAT TTGAGGGAAT TAACAAGCCG TCCGCGTTCG AAACGGTACG TCTTGCAAAT GTAGTTCGCT TCGAATTTGG GTGCCCTTCA GTATCTCACC TATGCGATTC CTTGTGGATA GTCTGATCGC TGGGAGGTTG GTGTTGCCGC TTCCGACGAT GCGGCGATGA AACAGGTTTC GTTCGTCAAC TCGATTTCGA CGTCCAAAGG AGGCACCCAC GTTCAGTACA TCGCAGATCA AGTGGCCTCT CACTTGGCCA AAGCCATTAC CAAGAAAAAC AAGAAAGGTG GGGAGGTGAA GGCGTCGTTT ATCAAAAATC ATTTGGCGAT TTTTGTCAAC TGTTTGATAG AAAATCCCGC CTTCGACAGC CAAACCAAGG AATTCATGAC GAGCAAACCC AAGGACTTTG GTTCCACCTG TAAGCTCTCA GACAGCTTCT TGAAAAAGCT CGAGAAAAGT GCAATCGTTG AGAATGTACT AGCATTTAGT CAGTTTCGGG ATCGTCAAGC CTTGAAAAAC AAAAGCGGTA AGAAGAAGGC CAAACTCACT GGTATCGACA AGCTCGATGA CGCCAACTTT GCAGGGACGG CAAAGTCAAA AGATTGCACT CTCATCATCA CAGAGGGAGA CTCAGCCAAG TCGTTGGCCA TGGCGGGCTT ATCAGTCATT GGACGTGACT ATTACGGAGT CTTTCCCCTT CGTGGGAAAC CATTGAACGT CCGTGATGCT CCCATCAAGG CCGTCACCAG CAACGAAGAG ATCAAAAACG TGGTTGAGAT AATGGGTCTA AAGTTTCAGA CAGTGTACGA TGAAACGAAT ATCAAGCAAC TCCGTTATGG ACATTTGATG ATTATGGCGG ATCAGGATAA TGATGGGTCA CACATCAAGG GTTTGATCAT TAATTTTATT CACAGTGAGT GACTGTTGCT CTTCCAAAAT TTTGCGGGTG TCGTTTTCTC ACTCGAATTC CGTTTCTTGC TAGACTTTTG GCCGAGCCTG CTCGATATTC CTGGATTCCT CCAACAGTTT ATCACACCGA TTGTCAAGGT ATCGAAAGGC CAAAAATCCC AGTCGTTTTT TAATTTACCA GAGTACGAGA ATTGGCTGGA ATCAACGGGA AAAAACGGAC ACGGATGGAA AATCAAGTAC TACAAAGGGT TGGGAACTTC AACGAGCGCT GAGGCAAAGG AGTACTTTTC CAATTTAGAC CTTCACGAAG TTCACTTCGG GATGCTCTCT AACGACAAGA TTGAGGTAGC TATCGATGAC GATCTGCAGC AGGTCCTTCC AGACACAGTG CAGTCTGGCA ATGACCTCAT CGACATGGTT TTTCGCAAAA ATCGTGTGGA AGATCGCAAA CAATGGTTGA ACGCCATTGC TAAGGATACG TTCCTCAACT ATTCGGAAGT ATCCAAGGAA GGGGTAATGT ATTCAGAATT TATCAACCGT GAATACATTT TGTTTTCAAA AAGTGATAAC GAACGCTCTA TACCTCATCT ATTGGATGGC TTTAAGCCAT CTCAGCGCAA AATCCTCTTC GCATGCTTTA AAAGAAAGCT GAAAGGCGAG ATAAAAGTTG CCCAGCTCAC CGGCTATGTC GCCGAGCATT CGGCGTATCA TCACGGTGAA GCGTCGCTCC AAGCCACAAT AGTGAACATG GCTCAGAACT TTTGTGGTTC TAACAACATC AATCTTCTCA CGCCGTCTGG TCAATTCGGT ACGCGTCGAA TGGGTGGCAA GGATGCTGCC TCGGCTCGAT ACATTTTCAC CAAACTCGAG CCAATCACAA GAGCCATCTT TCATCCGGAC GACGACGAGC TCTTAAGCTA CATAAATGAC GACGGTGTGA CCGTCGAGCC AGAGTATTAT GTACCCGTCA TCCCCATGAT TCTCGTCAAC GGAGCTGATG GAATTGGTAC GGGCTGGTCC ACGTCAGTCA ATAACTATAA TCCAAGGGAA ATTGTACGCA ACCTGCGCCG TAAGATTGCT GGTGAAGATT TTGTCGCAAT GGCGCCCTTT TACAGTGGCT TCAAAGGCGA GGTATGATGC TTAATGACCA GGAGAATGAT ATGAGGGTAG TACTGATTGT TTCTCACATC TCGCTAAACC TTGCTCATCT CAGATTATTC CTGTTGATTC CAACTCCCGC CGATCTGGAT CGTACGATAT GCTTGGCAAA GTCGAACGCA TCAATGACAC AACGATCATT ATCTCTGAAT TACCCGTTCG GAAATGGACT CAAGACTACA AAGCCTTCCT TGAGATCATG TTGACTGGCG ACGGGAAAAA GAAACTACCA GAGATCAAGG ATTTTACTGA GAATCACACT GAGACAACTG TCTCATTCAC CATCATTGCC GAGAAAGAGA AAATCGACGA ATTTGAGAAA GAGAAAGCCG GCTTGATGGG TAAATTTAAG TTGACTGGGT CGCTCTCTAC TTCGAACATG ACGCTTTTCG ATGAAAGAGG AAGAATCACA AGATTCGAAG ATCCTGAATC GATTATGAAT GCTTTCTACG ATATCCGTCT AGACTTCTAC GATAAACGAA AGAGGTTGCT GGTCAAAAAG CTGAAAGAGG AACAACGAAA ACTCTCCAAC AAAGCCAGAT TCGTGGAGGA AGTGTGTCGT GTGGAACTTG TTGTCAACAA TCGTAAAAGA CAGGACATTC TACACGAGCT TCGAAATCGG GGCTATGAGA CTTTTGGGGC AGATGCACGT TCGAAAGAGA CTAGCGACAG CGATGGCGAG GAGGATTCTA TCAATGAGAG CCAGTCCGAC GCTGAACTCG CTCGAGGGTA TGAGTACCTC CTTGGAATGA AAATTTGGTC GCTGACCTTC GAGAAAGCTG AGGAGCTGCG ACGTAAGCTT GGTGAAAAAA CTACGGAACT CAACGCGTTG CAGGGTACTT CACCTTCTGA GTTATGGTTG AACGACCTGG ATGA
|
Protein sequence | KKKTVEETYQ KMSQLEHILI RPDTYIGSVE PHTKLMFILE DDKIIEKEIT YTPGLFKIFD EIVVNAADNK QRDPNMDKLD VTINAEENVI SIQNNGKGIP IVMHKEHGIY VPELILGHLL TGSNFDDGER KTTGGRNGYG AKLANVFSTE FVVECADTEN GLKYRQVFRE NMKQKDKPVV KALTAKEMKA ADYVKITFSP DLERFKMDRL DEDTVGLLSK RAYDIAGTMS QSSGKKLLVS LNGKKLPIKS FKDFIGVFEG INKPSAFETV RLANSDRWEV GVAASDDAAM KQVSFVNSIS TSKGGTHVQY IADQVASHLA KAITKKNKKG GEVKASFIKN HLAIFVNCLI ENPAFDSQTK EFMTSKPKDF GSTCKLSDSF LKKLEKSAIV ENVLAFSQFR DRQALKNKSG KKKAKLTGID KLDDANFAGT AKSKDCTLII TEGDSAKSLA MAGLSVIGRD YYGVFPLRGK PLNVRDAPIK AVTSNEEIKN VVEIMGLKFQ TVYDETNIKQ LRYGHLMIMA DQDNDGSHIK GLIINFIHNF WPSLLDIPGF LQQFITPIVK VSKGQKSQSF FNLPEYENWL ESTGKNGHGW KIKYYKGLGT STSAEAKEYF SNLDLHEVLP DTVQSGNDLI DMVFRKNRVE DRKQWLNAIA KDTFLNYSEV SKEGVMYSEF INREYILFSK SDNERSIPHL LDGFKPSQRK ILFACFKRKL KGEIKVAQLT GYVAEHSAYH HGEASLQATI VNMAQNFCGS NNINLLTPSG QFGTRRMGGK DAASARYIFT KLEPITRAIF HPDDDELLSY INDDGVTVEP EYYVPVIPMI LVNGADGIGT GWSTSVNNYN PREIVRNLRR KIAGEDFVAM APFYSGFKGE IIPVDSNSRR SGSYDMLGKV ERINDTTIII SELPVRKWTQ DYKAFLEIML TGDGKKKLPE IKDFTENHTE TTVSFTIIAE KEKIDEFEKE KAGLMGKFKL TGSLSTSNMT LFDERGRITR FEDPESIMNA FYDIRLDFYD KRKRLLVKKL KEEQRKLSNK ARFVEEVCRV ELVVNNRKRQ DILHELRNRG YETFGADARS KETSDSDGEE DSINESQSDA ELARGYEYLL GMKIWSLTFE KAEELRRKLG EKTTELNALQ GTSPSELWLN DLD
|
| |