Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_53969 |
Symbol | |
ID | 7196100 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 808636 |
End bp | 813141 |
Gene Length | 4506 bp |
Protein Length | 1423 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176657 |
Protein GI | 219109807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCGT CGATACCCAA GGCCGCCGCC ACCGCCGCCG TCACGAACGG CAGCCGCGGT GCAAAGAATC TCAAACAGGG CAATTTGTTT TCCTTCTTTA CGACATCCCG CAAGAGCTCC GCAACGACCA AGCCGTCGTC ACCGTCCGTA CCTCTCGAAT CTACCAACGC TCCCGTCCCC GGTGACACCA AACCCTCCAC CGACGCGACG ACTCACTCCC CAAAGTCCAC CACGAAACGT CCGTTGACGC CCCCGTTGCG GGTAGGGGAT GTCCTCCAGG TCTACTGGCC GGACGACAAA GCCTACTACC AAGGCACGTT GGTCCAGATA CGGCGTGCGA CGTCGGCGCC GCTCCACTGC ATCGATTACG GAGACGGACA AGCCCCGGAA TGGATCGATC TCAACACGAC CCAGTACCAA CGCGTCTCCA ACGACAATCA GCAGCAGAAT ATCGACAACA ATAACAACCG AAATCAGCAC AAGCGACGAC GTATTCCGGA ACCCGACGAG CCAGACGAAG AAGCCGAATT CGAGTTTTCG GGAGACTTGG AAGAACCCGA GAGTTGCGCC GACGAAGACG ACGAGTCGGC CTACGAGCAA GACGAACAGA ACGATAACGA CGACGAGAAT GAGGATCAGT GGATGGTCAC CGACGACGAA GATGAAGCAC CGCGCAAACC CAAGCGTCAC AGTGTATCAT CCCGCAGACC CTCACTCTCC AAGTCCCGTC CAACACCCTC CGTGACCGTC ACGGCACACC CGCCAGCTGC CCCTCATCAG ACACCGGCCC GACCCGACGT TTCCACCACT CCTTCCGGAC CAAAGCGGAA CCCACCGCAC GTGACACCTT CCCTGGCGGG GGCTTCGCTC GCAACCAACA CTCAACCGTA CCCTTCACCC TCCCTCGCCG CTACCCCCTC CCCTACCCAC GCTACCCAAA CCAAAGGCAA GCCTCCCATG TTCGAAAAAG GTGTCGTCAA CATTGCGGGT TCCCATGCCC ATAACCATTT ACCATTTCTC CAGAATCCCC GCGACGCCCA GGGACGCACT CCGGACCACC CGGACTACGA CCGTCGGACA CTCAAGGTCA ATTCCCGAGA CTGGCTCAAC GTCACGGGCG GAAACATGAC CGACGCCGTC CAACAGTGGT GGGACCTCAA AGCCCGCTAC GCCGATACCG TCCTCCTCTT CAAAACCGGA AAATTCTACG AAATGTTCCA CGCTGACGCC GACGTGGGCG TCCAAGTCTG CGGACTGCTC TACATGAAGG GACACGTCGC CCACGCCGGC TTTCCGGAAA TATCCTACGG ACCCATGGCG GATCAACTCG TCCGCGCCGG CTACAAGGTA GCCCGTGTTG AACAGACCGA AACACCCGAT GCCCTCGCGG TACGCAAAAA GGCACACCAC CGACGGAACG GACCCGCCCC CAAAGTCGTC AATCGCGAAG TCTGCTCCAT ACTTACGCTC GGCACCCGCA CCTTTGGCTA TCTCGACGAT GACACGCACA TTGCCACTGG TCAAGGCGGC GTGGGACCCC TCCTGGCGAT TCGGGAAACG CTCGTGGACC AAGGGGAACG GCAAGATGAC GTAGAGGTGG AAGTGCAGCA GGCGCCCGTT TGCGAGTACG GTATCACTCT TGTCGATGCA GTTCACGGGG TCGTGACGAT TGGACAGTTT GCCGATGATG TACGCCGGTC CCGGATGGAT ACTCTCTTGA CCAATTTTGC ACCCTCCGAG GTACGTAGCT GACGTTTGCT TGTCCGCGCG GACAGCATTC CGGGATTGGT TTTTCCAATC CATTTGTTAC TCTGATGTTT CTCACACAAT CGCAAATTTG TTGTGGGCGC TCGACAGATT CTGGTAGAGG GCGGTCCCAA TGGTGCATCC GATACTCTCT TGTCGCTAAT TCGGACGGCC CAGAAAACCT CGCTCCAGTC TACGCGTTTG GAAATCATTC GGGCCACGGA GCAATTCCCG CAGTCCACCG CACTGGATCC GGAGATTCGC CGCAAGCTCG ACCGTCCGTT GTCGCAGATT CATCCCTGGG ATGTGTCGGA GACTCTGGAC GAGCTGCATC GCCGACGATA TTATCCGCGC GCTTCGAAGC AACAAACCGA TCACGTGAGT GTAAGCCGCT GGCCGGCCGT GCTGCGGGCC GCGGTGGAAG GCGGGGCCAC CCTGGCCTTA TCAAGTTTTG GCGCCGTCTT GTTTTACTTG CAGCGGAATC TTGTGGACGG TGAGCTACTT TCCATGGGTG TCGTCAAAGC CTATATTCCA CCGTCCTCGT CGACAGTCAC GGAAGAGTCA CCCAGCCGTA TTCAGACAAT GGCCGAACGC GACAGTTGGA GTGAAGCTGG TGTTGACATC GACGATCAGC GCACTACGGC ACCTTTGTCA ACGAAGACTT CGCCAAACGT TACACAGGCA ACACAAGATC CTGTGCCCAT GCAGTTTGAG ATGGTCGAAG CCATTAATAT CGAAAACGAT ATCAATCACA TGGCACTGGA TGGAACGACT TTACACAACC TCGAGATATT GTACAACTCC GTCGACCACA AAGCCAACGG TAGTCTGTGG TCCAAAATCA ACCTTACCAA GACTCCGCAC GGATCCCGGT TGCTTCGTGC ATGGTTGCTC CGTCCTTTGT TCCGCCGAGC GGACATTGAT CGACGGGCCG ATGCTGTGCA GGAACTCGTG TCGGGAGGAG CCGGCATGGC CTTATCGGAA GCGCGGTCTG TTCTCGCCAA GTGTGGAGAC ATTGAGCGAT TATTGAGCCG GGTCCACAGT ATGAGTGGAA TGACCCGGAT TCCCGGGGAA GAAGACGATG CAGACGACGG GAGCAGTTAC TATCCGAGCG ACCGCGCGGT GTTGTACGAA ACATCCACCT ATACAAAACG CAAGGTTGGC GACTTCTCCA AAGTGCTAAA AGGGTTGCAG CACGCAACAC AGATTCCGGA GCTCTTTGAC GGCATCGAAA TACAAAGTGG GCTGCTGAGC AAAATCGTGC GCTTTACAGA TCAGGGTGGG TGTTTTCCCA ATATGATCCA GGAACTTGAA TGGTTCTTTG AAAACTTTGA TCTTGACCAG GCTGCCAAGG GATTCTTCGA ACCATCCCGC GGAATCGATG ATCTGTATGA CCAGGCTTGT GACGCCATCG CGCACATTCA GTCCGAACTG AACGATTACA AGGAGGAAAT GTGCAGCACC TACTTGCAAC CCCGGTCTGC CGCCAGATCG TCCTGGAAAT ACATCAATAC CAAGCCCGAG TCAAAAGAAA AGTACACAAT CGAACTTCCA GCCAGCGTAC GCGTTCCAGA TAACTTCATT CTCAAGGGAA AACGTGGGAG CGGCACCAAG CAGATGAATC GATACAGAAC GGCGCAATTA GAGCACTTTG TCCAGGAGTT CGAAAACGCG TATGAAGTAC AAAAGAAACG CAAGGCTCGA GGCATGCAAC TTATATTTGC CAAATTTGAT TCAATGCGGT CCTTGTGGGC TGCTGCTGCC CAAGCTACAT CGTTGTTGGA TGCGATAGGG GCGCTAGCCC AGACGGCTTC GAAACCCGGC TATACACGAG CCAAGATCCT GGATTGTCCG CAACATGCTT CCCCGACTAT TCGAGTGACG GGCGGGCGAC ACCCTTGTAT TGAAAGTTCG ATTGGATCCA ACGATTTTAT CCCCAACGAT CTTTCGTTGG GCACGGAAAC GTCGCAAGAC AACGCCTCCC GAGTGCTTTT ACTTAGCGGA CCCAATATGG GTGGGAAAAG CACGCTTTTA CGACAGACTT GTTTGATTTC GATACTTGCC CAGATTGGTT GCTTCGTTCC GGCCGAAGAC TGCGCGTTGA CTCCGATAGA TCGCATCTAT ACTCGACTAG GTGCCACCGA CCGTATTCTA TTAGGACAAT CGACATTCTT TGTGGAGGTA AGCGACACTC TGCTCTGATG CACTCAGCCG AGTCTTTCCC ACTTGTTAAC TCGCATACTC TGTTCTTCTT GTCAATGTAG TTAGCGGAGA CCGCCGCTGC TCTTCGAGGA GCGACACGTC GCAGTCTGGT AATAATGGAT GAGCTTGGCC GAGGAACCAG TACTTTTGAC GGTACCGCGA TTGCGAGCTC CGTTGTCAAA CACCTTGTCG ATCGAAGCAA ATGCTTGAGC CTGTTTGCAA CTCACTACCA CTCCTTGTTG GAAGAATGGA AACATAATAG GAACGTACGG CTCGGACATA TGGAGTGCAT CGTCGAAAAT GGTATCACTA CTTCTCGGCC TGAGAATGAA GAAAAAGACG AAAGTACAAT TACTTTTCTA TATACGCTCG GCGAGGGGGT TTGTCCCAAG TCGTTTGGCA TAAATGTGGC TCGCCTGGCC GGCTTACCAG AAGATGTCTT GTCGAACGCT AAGCGCATTA GTTCCGAATT TGAGCAGGAG GTCAATGGCA ATGGGTCGAG TTCCTTCACT CCATGTAATG GCGTTGTGCG AAGGAGCCAC ATCACCAAAG CTATAG
|
Protein sequence | MAPSIPKAAA TAAVTNGSRG AKNLKQGNLF SFFTTSRKSS ATTKPSSPSV PLESTNAPVP GDTKPSTDAT THSPKSTTKR PLTPPLRVGD VLQVYWPDDK AYYQGTLVQI RRATSAPLHC IDYGDGQAPE WIDLNTTQYQ RVSNDNQQQN IDNNNNRNQH KRRRIPEPDE PDEEAEFEFS GDLEEPESCA DEDDESAYEQ DEQNDNDDEN EDQWMVTDDE DEAPRKPKRH SVSSRRPSLS KSRPTPSVTV TAHPPAAPHQ TPARPDVSTT PSGPKRNPPH VTPSLAGASL ATNTQPYPSP SLAATPSPTH ATQTKGKPPM FEKGVVNIAG SHAHNHLPFL QNPRDAQGRT PDHPDYDRRT LKVNSRDWLN VTGGNMTDAV QQWWDLKARY ADTVLLFKTG KFYEMFHADA DVGVQVCGLL YMKGHVAHAG FPEISYGPMA DQLVRAGYKV ARVEQTETPD ALAVRKKAHH RRNGPAPKVV NREVCSILTL GTRTFGYLDD DTHIATGQGG VGPLLAIRET LVDQGERQDD VEVEVQQAPV CEYGITLVDA VHGVVTIGQF ADDVRRSRMD TLLTNFAPSE ILVEGGPNGA SDTLLSLIRT AQKTSLQSTR LEIIRATEQF PQSTALDPEI RRKLDRPLSQ IHPWDVSETL DELHRRRYYP RASKQQTDHV SVSRWPAVLR AAVEGGATLA LSSFGAVLFY LQRNLVDGEL LSMGVVKAYI PPSSSTVTEE SPSRIQTMAE RDSWSEAGVD IDDQRTTAPL STKTSPNVTQ ATQDPVPMQF EMVEAINIEN DINHMALDGT TLHNLEILYN SVDHKANGSL WSKINLTKTP HGSRLLRAWL LRPLFRRADI DRRADAVQEL VSGGAGMALS EARSVLAKCG DIERLLSRVH SMSGMTRIPG EEDDADDGSS YYPSDRAVLY ETSTYTKRKV GDFSKVLKGL QHATQIPELF DGIEIQSGLL SKIVRFTDQG GCFPNMIQEL EWFFENFDLD QAAKGFFEPS RGIDDLYDQA CDAIAHIQSE LNDYKEEMCS TYLQPRSAAR SSWKYINTKP ESKEKYTIEL PASVRVPDNF ILKGKRGSGT KQMNRYRTAQ LEHFVQEFEN AYEVQKKRKA RGMQLIFAKF DSMRSLWAAA AQATSLLDAI GALAQTASKP GYTRAKILDC PQHASPTIRV TGGRHPCIES SIGSNDFIPN DLSLGTETSQ DNASRVLLLS GPNMGGKSTL LRQTCLISIL AQIGCFVPAE DCALTPIDRI YTRLGATDRI LLGQSTFFVE LAETAAALRG ATRRSLVIMD ELGRGTSTFD GTAIASSVVK HLVDRSKCLS LFATHYHSLL EEWKHNRNVR LGHMECIVEN GITTSRPENE EKDESTITFL YTLGEGVCPK SFGINVARLA GLPEDVLSNA KRISSEFEQE VNGNGSRATS PKL
|
| |