Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33013 |
Symbol | |
ID | 7197020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1312570 |
End bp | 1314285 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177804 |
Protein GI | 219112105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0011992 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTG GTCGAAAGCA GCTGCCACGC CGGCGCAAAG CAACGGGTCC ATTGCGACTC GATTGGCCTG CCGTCGGCGC CTCTCAAGAT CTCGTCGGCA CCAAAGATCT TGTGTCGTTT TCACTGGAAG AAAGTGCATT CGTTCCAGTG CGTCGGAAGC AAAAGCGACC ACGCAAGAAT GCAGTCTCCC ATACCGGAGG TGACACTAGT TCACATGACA ATGCTCTTTG GATCGATAGG TATAATCCTC AATGTATAGC CGATGTCTGT GTGGCGCCAA AAAAAGTTCA GGAAGTTCGT CAATGGATCC AGTCTGCCAT GAAAGATCAC GTTCACAAAC TCTTAATATT GGTAGGGAGC CCTGGAATTG GTAAGTCGAC AATGATCCGT TGTTTGGCGA AGGAAAATAG ATGGTCAATC TCTGAATGGA ACGAAACGTT CTCGAATCAA TACAGTGCTT TGAACTCGGC AATGCACTCT GTAGATCAAC AGTCTTCTCT GAGCTCGTTT CAGGAGTTTC TCCGGCAAGC AGGGACCGGC TACCATTCTC TGACCTTCGA ATCGTCGTCA AATTCATCAA CAAAGCAAGA CGGATCTCAA ATTTCGGGGT CCATCATCTT GCTTGAAAGC CTTCCGACGC AACACGAATC AACCCAGATG CGATTAAGAG AGCTGTTTAC TGAACACGTC CGCACCACAT CCGTGCCAAC GGTTCTCATC TTCAGCGATG TCTTGGAAGG GAAACACAAA CGAGAGGATC TGGAGTCCTT GGTGGACCCC AATTTGCTGT ATTCAGACCT TTGCCTCATT CTACAAATTC AGCCTTGTAC TAAGCAAAAC ATGAAAAGGG TTTTGTCGCT TATTGTCCGT GCAGAAAGGC TTTCGGTACC TTCCAGTATA TACGAAGACC TTCACGAGCG GAGCAACGGG GACTTGCGCT CGGCAATCAC AACTTTTCAG TACGAAGCCA TGGGGCAGTC GATGACCGTA AAGAACACAG ATACAACCAA CCGAGACCGC AGACTTTCGC CATTTCACGC CTTGGGTAAG CTCCTTTACG CGAAGCGCGT CACTGGTGCA CACAAGGATC CATTAAGTTG GTGGAAATGG AAGGATGATC GTCCACCAAT CGACTTTAAT CCGGAAAACG TACTGGAGCA TAGTGGGATT GAACAATTTG GGACTCTATC GTTTCTCGAA CATCACAGTC CGGATTTCTT TTCCGACATA TCGGAGCTGA GCGATACACT CGCGACTTTT TCAGATTCGG CGTTATTGAT GGACTGTTCC TCTATTTCTG GCTCCCAGAA TGCTGCCGCC TCGTTGGCTG GCCGTGCCGT CGCCGCCTTC AATCGACATC CACGCGCAAA CAAATTCAGG CAACTTTCTG CTCCCAAAAT TTTCGAAGTC AATCGCAACC GTCGGGAAAA TGAAGTGCAC CTTCGTCACC TACACCATTC TTTATCAACG AATCGTAGCA ATGAACTTTC TTTGCATTCG GCCCTCGGAG CAACTTCGCA CTTCGTCTCC GACAGCCTTT CGTTTCTCCG ACGCATCATC CCCGAGTCAA TAGATCTGTC TCTGAATACT ATGCACTCAC GATTCCGTCT TATAGACAAA TCCATCTCGA GCAGCAATGA TGCGAAAACA GGATTGTTAA AGGAGCAGCA GCAAGTGCTT CTGGACGACG ATATTGGTGA TTTTGATTCA GAATAA
|
Protein sequence | MKLGRKQLPR RRKATGPLRL DWPAVGASQD LVGTKDLVSF SLEESAFVPV RRKQKRPRKN AVSHTGGDTS SHDNALWIDR YNPQCIADVC VAPKKVQEVR QWIQSAMKDH VHKLLILVGS PGIGKSTMIR CLAKENRWSI SEWNETFSNQ YSALNSAMHS VDQQSSLSSF QEFLRQAGTG YHSLTFESSS NSSTKQDGSQ ISGSIILLES LPTQHESTQM RLRELFTEHV RTTSVPTVLI FSDVLEGKHK REDLESLVDP NLLYSDLCLI LQIQPCTKQN MKRVLSLIVR AERLSVPSSI YEDLHERSNG DLRSAITTFQ YEAMGQSMTV KNTDTTNRDR RLSPFHALGK LLYAKRVTGA HKDPLSWWKW KDDRPPIDFN PENVLEHSGI EQFGTLSFLE HHSPDFFSDI SELSDTLATF SDSALLMDCS SISGSQNAAA SLAGRAVAAF NRHPRANKFR QLSAPKIFEV NRNRRENEVH LRHLHHSLST NRSNELSLHS ALGATSHFVS DSLSFLRRII PESIDLSLNT MHSRFRLIDK SISSSNDAKT GLLKEQQQVL LDDDIGDFDS E
|
| |