Gene PHATRDRAFT_44716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44716 
SymbolMYT1 
ID7199700 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp17502 
End bp21230 
Gene Length3729 bp 
Protein Length1108 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178912 
Protein GI219116234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCAT TGCCGGTGTC TAAATCAATC GTCGCGGGTT TCGGTTGGCA GCAGCCCTCG 
CGTTCTCTCT CGGACGGAGT CCATCTCCAC GACATCCCGG CTCCCCCCGC TCCAGCGGTC
TACCAGCACG CCGGTAGTCC AGCCAAAGCG TATCGCACCT TGTGGTCGCT TTTTGGTTCC
ACACACACTG TCAAACATTC CGTCCTTGTC ACTATCATCC TCCTCTTCTT TTTTACTCGT
CAGGAAAAGC ATACAACGAC ACAAATAGTG TGTGCAACGA GAAAGGAACT GTACCGCTGA
CGGGATTCCC GCCAACGAAA TCCGCGCCCA TTTTCGCAAT CGTGTCTGGG AACCCGTTCC
GTACAATCTC CGTAGAAAGA CTACCGAGCG CCACGAACGG AGCGTTGGAC GCGGCGTCGT
GACGTTTCGT CGTTCCGGCA ATTGCAGGCA ATTGCAGGCA ATTAGCATTC CCTACACAGT
CGTACTCGCA GTCACAGTCC CAATCACATA CAATTGTCAG CGTTGCACCT ACACAGTTCG
ACACGAACCA AGAGCTCCGT CAATTCACGC AATGGACGCC ATGAGTGTAT CCGCCGTGGC
GTCTCTCGAA ACCGTAGCGA CGGTAGCAGC AGCCGCCGCG CAGTCTCATC GCACCGCCAT
TTCGACGCCG GCGTCTTTCG TGACCTCTGC TTCCGAGCCC ATTCCCGGAA CGATTCTGCG
CACGCGGAGC CTCGACGAAG CGGCGCCGCG TTCCTCACTG TCAACGATCC GTGCCGCAGA
CTTGGCAAAG ACAAACACAA ATACAAAAAC ACACACGCAC CAAACAGACG TTCCGCCAGG
CAATACCGCG AGTGCTCTTT CCCACCATCC GGCGACTCTG TCACCAAATC CTGCGTTCGC
GAGAAGAACC CGGTCTCTGA GCGTCACGGA CGATTCGGAA CCATCCTTCG AGGACGAGCA
TCGAACAACA TCCCACCTTG CTTGGGAAAT GTCCGATCTC CCCCCACCCT GCACGACTCG
TAAGGCATCT CGCCGCGTCA CCGATCCCAC CCTTTCCAGA ACACCTTTAC GCCCCCCGCA
TGCGCGGCTC CGGTATCACG TCCAAAAAGA TGGCGATCGG AAAGCTCACT CCCGCTTCGG
CGAGGCGGCA TCGCAAGTCT CCCATGCGTA GAGTGCCCCC CTTGACACCG CATCGGGCGT
CCACCGCGTC GGGGGTTCTC ACGGCACCGT GCTCATCGGG GGATTGCATG GAAGAAGACG
ACTGCGACGA TGACAACAAT AACAACAGCA ACACTAGCAT CTATCGACGC CCTCTACAGA
ACATTGGGAC GGTCAAAAAG ATGCCGAAAC TCTCCCTAGG AACGGTCGCT ACTTCCGCTA
GTAGTACCGC TTTTTTGGAG ACGACACACG ACACCGCCGA TAGTGACGAA ACCAACGAAA
CCTCTACTCC TTTCAGGTTT ACATCCTTCC CGGCCTCGTT GCCTCGAATT CCTTACCATC
CGTCCACTTT AGATACACAC GATGCCCTTC TTTGTCCCGC AACAACCGTC AAGAAACGCA
TGACCTTTGA CGAAGACGAC AAGACGAACA ATACCAGCGT GAGCTCGATA CACGATGATG
AAGACTTTGA AACCTTCTCC GCGGCCGCCG CGTCCCCAAC GGGTACACCC GTCGCGCGGA
CTCGACTCAA CTTTGCCTCC GTACTGAGCC CGTTCGACGC ATCGACACGT GCCGCATCCA
TTCCTCTCCA AGGTACGTTG GGAATGTCTC CTCTTTTGTC CTTTTTTGCT TACGGTTTTG
TTTCTATCAA TTATTCTCAC CTTATGTTTG TCATTTAACG CGCAATTTCG GCGTCGAACA
GAGTCAGATA GAACGCGCTC GCTGGACTCT GGCGACCGAT TGCCCCCCTC TCGACATTCA
AACTTTTTGA GCTTTGCCGA CCACGATCAT ATCAAGCCAA AAACACCGCG CGAGGTACAG
TTGCTCTTTC ACTTGGATGC GGCACCTTGC TCTCCGATTC CCGGAATTCC CGAAGAAGAG
GGCGATCTGA GCAGTTCCGA CCGCAGCGAC GGCAACGACA ACCGAGAAGT CTTGACGAGT
ACGAGTCGGC TATCGCAGAG TTCCACAGCC ACCACCGAAA GTGGTAGTCG ACAACGCAGA
CCTATGCCCG ATATGTCGGC GTTCGATAGC GAAGCACTGT TCTCGCGCGA TCGCTCCAGT
CTTGGAGTGG CAGCATCTTC ATCAACGCCA CCGGCTTCAC CCAAGCTATT GTGTCCACCT
ACCCCCGTAA GGACTCCAGC TTGGGCGCAC GCCGACACCG CACATGCATC ACTCGGAGGT
GGCTTCGCCG CCGGAGGCCA GTCCAAGTTA AGTCGATGCA ACTCGCTGAC TGTTACGAAG
GTCCTGGCAA CGTGTCCTCT GCAAGTGTTG GAAGGCCGCT CATCGCTAGA GAATTCTTTG
CTCGAGGAGG AAGGTGGTCG CAACGAATCC AACATGAGTT CAGCCTTTGG GTCTCTGCAT
CACTTGGACG GCTCCGACCA CGACACAACG ATCGACGCAG ATATGGAAGA CGCCGGATCG
GCCGCATCAG GCGAGCCGTC GCCATTTAAG CAAAGGCAAT CCCTCGGAGA AGTCGGGGCA
GTCGTGTCCA TGACAAGCAG CTTTGAAGTT TTGTCTACTT TGGGCAGTGG AACATTTGCG
GACGTTTACA AGGTACGTTC CAAAACAGAC GGAAGCCTGT ACGCCATAAA ACGAAATCGG
CGCCAGTTCC GAGGAAAACG CGATCGCGAT CAAGCCCTTG CCGAGGTTCG CTATATGCAA
CGATTGCAGA GCATAGTCGC TACTGCTCCG AGTGTGTCAA CCCAAAATTC GAGCTATTGT
CTGCACGTGC TTTTCTTCTA CCAAGCATGG CAAGAGGATG GTCATTTCTT TTGTCAGACC
GAACTATGTT GCCGTGATAC CTGTCGAGAT TTGATTGATT CCGTCAAGAC GAAGTGGAAC
GAGGCCAAGC TTCGGTATCC CAGTGTTGCC AAGCTAGAGC ATTCAGGTCG CTTAGTACCG
GAGTCCAGTG TATGGAAAGT ATGTCACGAC GCCTGCGCCG GACTGTCTCA TATTCACAGT
CACGGATTGG TCCATCTAGA TATCAAACCG TCCAACATCT TCTTCGTCGA GCATCCGCGA
TACGGGCCAA TGTGTAAGAT TGGGGATTTC GGCATGACAT GCGAAATCGG ATCTTCGGAA
GACGGCCAAG AAGGCGATCA ACTCTACATG TCACTAGAAT CACTGACAAA CAGCGCCAGG
CATCCCAGTG CGGACATTTT TAGTCTGGGA CTGACTCTTT ATGAGCTTGC ATCACACTCA
ACTTTTGAGG TTCCGGTAGA GGGTGCACGG TGGCACGAAC TCCGCAGCGG TCGTCAGGTG
CCAAATCTTC CAGAAAGTCG AAGCGCAGAT CTTGTTAAGC TGATTGGGTT AATGCTCAGT
GCAGATGTCG CTCGGCGTCC GACCGCGGAC TTGGTTTTAG GAAATGATCA AGTACTACTG
TTTGGAAATA ATCGCGAATC ATTTCTTATT GAATACCTGC GAGACGCATC AGCTGCCGAG
CGAGCCGAAG TGCGAGGCAG CTTTATAGAT CGTGAGGATC AAACTCCTCG GATCGCCTCG
CGGAGTCGAG TCTGTAGTCC TCCGGTCGGT ATGATGCCGC CTATGGCTCC CATGTTGTAC
TCTCCGTAG
 
Protein sequence
MGALPVSKSI VAGFGWQQPS RSLSDGVHLH DIPAPPAPAV YQHAGSPAKA YRTLWSLFGS 
THTVKHSVLV TIILLFFFTR QEKHTTTQIR CTYTVRHEPR APSIHAMDAM SVSAVASLET
VATVAAAAAQ SHRTAISTPA SFVTSASEPI PGTILRTRSL DEAAPRSSLS TIRAADLAKT
NTNTKTHTHQ TDVPPGNTAS ALSHHPATLS PNPAFARRTR SLSVTDDSEP SFEDEHRTTS
HLAWEMSDLP PPCTTQHLYA PRMRGSGITS KKMAIGKLTP ASARRHRKSP MRRVPPLTPH
RASTASGVLT APCSSGDCME EDDCDDDNNN NSNTSIYRRP LQNIGTVKKM PKLSLGTVAT
SASSTAFLET THDTADSDET NETSTPFRFT SFPASLPRIP YHPSTLDTHD ALLCPATTVK
KRMTFDEDDK TNNTSVSSIH DDEDFETFSA AAASPTGTPV ARTRLNFASV LSPFDASTRA
ASIPLQESDR TRSLDSGDRL PPSRHSNFLS FADHDHIKPK TPREVQLLFH LDAAPCSPIP
GIPEEEGDLS SSDRSDGNDN REVLTSTSRL SQSSTATTES GSRQRRPMPD MSAFDSEALF
SRDRSSLGVA ASSSTPPASP KLLCPPTPVR TPAWAHADTA HASLGGGFAA GGQSKLSRCN
SLTVTKVLAT CPLQVLEGRS SLENSLLEEE GGRNESNMSS AFGSLHHLDG SDHDTTIDAD
MEDAGSAASG EPSPFKQRQS LGEVGAVVSM TSSFEVLSTL GSGTFADVYK VRSKTDGSLY
AIKRNRRQFR GKRDRDQALA EVRYMQRLQS IVATAPSVST QNSSYCLHVL FFYQAWQEDG
HFFCQTELCC RDTCRDLIDS VKTKWNEAKL RYPSVAKLEH SGRLVPESSV WKVCHDACAG
LSHIHSHGLV HLDIKPSNIF FVEHPRYGPM CKIGDFGMTC EIGSSEDGQE GDQLYMSLES
LTNSARHPSA DIFSLGLTLY ELASHSTFEV PVEGARWHEL RSGRQVPNLP ESRSADLVKL
IGLMLSADVA RRPTADLVLG NDQVLLFGNN RESFLIEYLR DASAAERAEV RGSFIDREDQ
TPRIASRSRV CSPPVGMMPP MAPMLYSP