Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44716 |
Symbol | MYT1 |
ID | 7199700 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 17502 |
End bp | 21230 |
Gene Length | 3729 bp |
Protein Length | 1108 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178912 |
Protein GI | 219116234 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCAT TGCCGGTGTC TAAATCAATC GTCGCGGGTT TCGGTTGGCA GCAGCCCTCG CGTTCTCTCT CGGACGGAGT CCATCTCCAC GACATCCCGG CTCCCCCCGC TCCAGCGGTC TACCAGCACG CCGGTAGTCC AGCCAAAGCG TATCGCACCT TGTGGTCGCT TTTTGGTTCC ACACACACTG TCAAACATTC CGTCCTTGTC ACTATCATCC TCCTCTTCTT TTTTACTCGT CAGGAAAAGC ATACAACGAC ACAAATAGTG TGTGCAACGA GAAAGGAACT GTACCGCTGA CGGGATTCCC GCCAACGAAA TCCGCGCCCA TTTTCGCAAT CGTGTCTGGG AACCCGTTCC GTACAATCTC CGTAGAAAGA CTACCGAGCG CCACGAACGG AGCGTTGGAC GCGGCGTCGT GACGTTTCGT CGTTCCGGCA ATTGCAGGCA ATTGCAGGCA ATTAGCATTC CCTACACAGT CGTACTCGCA GTCACAGTCC CAATCACATA CAATTGTCAG CGTTGCACCT ACACAGTTCG ACACGAACCA AGAGCTCCGT CAATTCACGC AATGGACGCC ATGAGTGTAT CCGCCGTGGC GTCTCTCGAA ACCGTAGCGA CGGTAGCAGC AGCCGCCGCG CAGTCTCATC GCACCGCCAT TTCGACGCCG GCGTCTTTCG TGACCTCTGC TTCCGAGCCC ATTCCCGGAA CGATTCTGCG CACGCGGAGC CTCGACGAAG CGGCGCCGCG TTCCTCACTG TCAACGATCC GTGCCGCAGA CTTGGCAAAG ACAAACACAA ATACAAAAAC ACACACGCAC CAAACAGACG TTCCGCCAGG CAATACCGCG AGTGCTCTTT CCCACCATCC GGCGACTCTG TCACCAAATC CTGCGTTCGC GAGAAGAACC CGGTCTCTGA GCGTCACGGA CGATTCGGAA CCATCCTTCG AGGACGAGCA TCGAACAACA TCCCACCTTG CTTGGGAAAT GTCCGATCTC CCCCCACCCT GCACGACTCG TAAGGCATCT CGCCGCGTCA CCGATCCCAC CCTTTCCAGA ACACCTTTAC GCCCCCCGCA TGCGCGGCTC CGGTATCACG TCCAAAAAGA TGGCGATCGG AAAGCTCACT CCCGCTTCGG CGAGGCGGCA TCGCAAGTCT CCCATGCGTA GAGTGCCCCC CTTGACACCG CATCGGGCGT CCACCGCGTC GGGGGTTCTC ACGGCACCGT GCTCATCGGG GGATTGCATG GAAGAAGACG ACTGCGACGA TGACAACAAT AACAACAGCA ACACTAGCAT CTATCGACGC CCTCTACAGA ACATTGGGAC GGTCAAAAAG ATGCCGAAAC TCTCCCTAGG AACGGTCGCT ACTTCCGCTA GTAGTACCGC TTTTTTGGAG ACGACACACG ACACCGCCGA TAGTGACGAA ACCAACGAAA CCTCTACTCC TTTCAGGTTT ACATCCTTCC CGGCCTCGTT GCCTCGAATT CCTTACCATC CGTCCACTTT AGATACACAC GATGCCCTTC TTTGTCCCGC AACAACCGTC AAGAAACGCA TGACCTTTGA CGAAGACGAC AAGACGAACA ATACCAGCGT GAGCTCGATA CACGATGATG AAGACTTTGA AACCTTCTCC GCGGCCGCCG CGTCCCCAAC GGGTACACCC GTCGCGCGGA CTCGACTCAA CTTTGCCTCC GTACTGAGCC CGTTCGACGC ATCGACACGT GCCGCATCCA TTCCTCTCCA AGGTACGTTG GGAATGTCTC CTCTTTTGTC CTTTTTTGCT TACGGTTTTG TTTCTATCAA TTATTCTCAC CTTATGTTTG TCATTTAACG CGCAATTTCG GCGTCGAACA GAGTCAGATA GAACGCGCTC GCTGGACTCT GGCGACCGAT TGCCCCCCTC TCGACATTCA AACTTTTTGA GCTTTGCCGA CCACGATCAT ATCAAGCCAA AAACACCGCG CGAGGTACAG TTGCTCTTTC ACTTGGATGC GGCACCTTGC TCTCCGATTC CCGGAATTCC CGAAGAAGAG GGCGATCTGA GCAGTTCCGA CCGCAGCGAC GGCAACGACA ACCGAGAAGT CTTGACGAGT ACGAGTCGGC TATCGCAGAG TTCCACAGCC ACCACCGAAA GTGGTAGTCG ACAACGCAGA CCTATGCCCG ATATGTCGGC GTTCGATAGC GAAGCACTGT TCTCGCGCGA TCGCTCCAGT CTTGGAGTGG CAGCATCTTC ATCAACGCCA CCGGCTTCAC CCAAGCTATT GTGTCCACCT ACCCCCGTAA GGACTCCAGC TTGGGCGCAC GCCGACACCG CACATGCATC ACTCGGAGGT GGCTTCGCCG CCGGAGGCCA GTCCAAGTTA AGTCGATGCA ACTCGCTGAC TGTTACGAAG GTCCTGGCAA CGTGTCCTCT GCAAGTGTTG GAAGGCCGCT CATCGCTAGA GAATTCTTTG CTCGAGGAGG AAGGTGGTCG CAACGAATCC AACATGAGTT CAGCCTTTGG GTCTCTGCAT CACTTGGACG GCTCCGACCA CGACACAACG ATCGACGCAG ATATGGAAGA CGCCGGATCG GCCGCATCAG GCGAGCCGTC GCCATTTAAG CAAAGGCAAT CCCTCGGAGA AGTCGGGGCA GTCGTGTCCA TGACAAGCAG CTTTGAAGTT TTGTCTACTT TGGGCAGTGG AACATTTGCG GACGTTTACA AGGTACGTTC CAAAACAGAC GGAAGCCTGT ACGCCATAAA ACGAAATCGG CGCCAGTTCC GAGGAAAACG CGATCGCGAT CAAGCCCTTG CCGAGGTTCG CTATATGCAA CGATTGCAGA GCATAGTCGC TACTGCTCCG AGTGTGTCAA CCCAAAATTC GAGCTATTGT CTGCACGTGC TTTTCTTCTA CCAAGCATGG CAAGAGGATG GTCATTTCTT TTGTCAGACC GAACTATGTT GCCGTGATAC CTGTCGAGAT TTGATTGATT CCGTCAAGAC GAAGTGGAAC GAGGCCAAGC TTCGGTATCC CAGTGTTGCC AAGCTAGAGC ATTCAGGTCG CTTAGTACCG GAGTCCAGTG TATGGAAAGT ATGTCACGAC GCCTGCGCCG GACTGTCTCA TATTCACAGT CACGGATTGG TCCATCTAGA TATCAAACCG TCCAACATCT TCTTCGTCGA GCATCCGCGA TACGGGCCAA TGTGTAAGAT TGGGGATTTC GGCATGACAT GCGAAATCGG ATCTTCGGAA GACGGCCAAG AAGGCGATCA ACTCTACATG TCACTAGAAT CACTGACAAA CAGCGCCAGG CATCCCAGTG CGGACATTTT TAGTCTGGGA CTGACTCTTT ATGAGCTTGC ATCACACTCA ACTTTTGAGG TTCCGGTAGA GGGTGCACGG TGGCACGAAC TCCGCAGCGG TCGTCAGGTG CCAAATCTTC CAGAAAGTCG AAGCGCAGAT CTTGTTAAGC TGATTGGGTT AATGCTCAGT GCAGATGTCG CTCGGCGTCC GACCGCGGAC TTGGTTTTAG GAAATGATCA AGTACTACTG TTTGGAAATA ATCGCGAATC ATTTCTTATT GAATACCTGC GAGACGCATC AGCTGCCGAG CGAGCCGAAG TGCGAGGCAG CTTTATAGAT CGTGAGGATC AAACTCCTCG GATCGCCTCG CGGAGTCGAG TCTGTAGTCC TCCGGTCGGT ATGATGCCGC CTATGGCTCC CATGTTGTAC TCTCCGTAG
|
Protein sequence | MGALPVSKSI VAGFGWQQPS RSLSDGVHLH DIPAPPAPAV YQHAGSPAKA YRTLWSLFGS THTVKHSVLV TIILLFFFTR QEKHTTTQIR CTYTVRHEPR APSIHAMDAM SVSAVASLET VATVAAAAAQ SHRTAISTPA SFVTSASEPI PGTILRTRSL DEAAPRSSLS TIRAADLAKT NTNTKTHTHQ TDVPPGNTAS ALSHHPATLS PNPAFARRTR SLSVTDDSEP SFEDEHRTTS HLAWEMSDLP PPCTTQHLYA PRMRGSGITS KKMAIGKLTP ASARRHRKSP MRRVPPLTPH RASTASGVLT APCSSGDCME EDDCDDDNNN NSNTSIYRRP LQNIGTVKKM PKLSLGTVAT SASSTAFLET THDTADSDET NETSTPFRFT SFPASLPRIP YHPSTLDTHD ALLCPATTVK KRMTFDEDDK TNNTSVSSIH DDEDFETFSA AAASPTGTPV ARTRLNFASV LSPFDASTRA ASIPLQESDR TRSLDSGDRL PPSRHSNFLS FADHDHIKPK TPREVQLLFH LDAAPCSPIP GIPEEEGDLS SSDRSDGNDN REVLTSTSRL SQSSTATTES GSRQRRPMPD MSAFDSEALF SRDRSSLGVA ASSSTPPASP KLLCPPTPVR TPAWAHADTA HASLGGGFAA GGQSKLSRCN SLTVTKVLAT CPLQVLEGRS SLENSLLEEE GGRNESNMSS AFGSLHHLDG SDHDTTIDAD MEDAGSAASG EPSPFKQRQS LGEVGAVVSM TSSFEVLSTL GSGTFADVYK VRSKTDGSLY AIKRNRRQFR GKRDRDQALA EVRYMQRLQS IVATAPSVST QNSSYCLHVL FFYQAWQEDG HFFCQTELCC RDTCRDLIDS VKTKWNEAKL RYPSVAKLEH SGRLVPESSV WKVCHDACAG LSHIHSHGLV HLDIKPSNIF FVEHPRYGPM CKIGDFGMTC EIGSSEDGQE GDQLYMSLES LTNSARHPSA DIFSLGLTLY ELASHSTFEV PVEGARWHEL RSGRQVPNLP ESRSADLVKL IGLMLSADVA RRPTADLVLG NDQVLLFGNN RESFLIEYLR DASAAERAEV RGSFIDREDQ TPRIASRSRV CSPPVGMMPP MAPMLYSP
|
| |