Gene PHATRDRAFT_49856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49856 
Symbol 
ID7198580 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp100363 
End bp102983 
Gene Length2621 bp 
Protein Length810 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184647 
Protein GI219128916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGA ACGATGATCC GACCCTTCAT CCAAAGGAAT CGGTAGCATC GGACGCCGCC 
ACAGTGTACC TGCCTTCGGG TGAAGGGAGA ACCACTGCCG CTACTTTTCC TTATTCTCGA
CACCGACCAC AGATACCGTG CATGACGGAT TTACGCAAAT GGCTCCAAGC ATCCCGAACG
CCGCGAAGGT TGGTCGTACT ATTCGTCCTG CTGGAGTCTC TGGCATTCCA GAAAGGATCG
ATGCGAGTGA CGGCAGAGAG CGAAGTACCC GACAACGTGG CAGGCCAGCT GTTGCGCCAG
ACGGTAAAGG CTGCAGCAGC CTCGTCTCCG TCGCGAGATA GCTTGGACGG CGCCACTGCT
AGCGGACTCG TCTGGAGTGA TCTGAGTGTA GTTTCGTCCA ATCGGGATGT CACCCTTCTG
CACCCGTTTT CTGGATGGAT CACCAGCGGT CAGATTGGCG GGATCCTCGG ACCGAGCGGA
AGTGGCAAGT CAACTTTTCT ATCGGCCCTC TCGGGTTCTT CACGACAACT TTACCAAACC
GGGCAAGTCT GGCACTATCT ACATACCGTT GTGCACGGCT CCAAGGACAC ACAAACACCG
ATACAATTGT CTCGAATACC CACACAAGAG GTTGCTTGGC TTCAGCAACA CGACGATTTT
TTCAGCATGC TAACGGTCCG GGAGACTCTA GATTTGGCGG CATATTTAGA GCTACCCCAC
TTAGTCCTGT CGCAACGAGA CGCTTTGGTT CAAACTCACT TGGATGCGCT GGGCCTAGCT
CACGCCGCCG ACCGACCAAT TGGCTCGGAT CTTACCGGTT TGGGCACTGC ACGCTTATCC
GGTGGTGAGC GCCGACGATT GTCGGTGGCG TTGGAACTGT TGACGGAAAA ACAACTTCTT
TTAGCGGATG AACCCACGTC GGGTTTGGAC AGCAGTATCA GTGTCAAAGT GATGCAAAAT
ATCCGAGACG TTTGTCGCAA GCGAAATATC CCTTGCTTAT GTGCAATTCA TCAGCCTCGA
TCATCCATTT GGCACTTATT GGATACCCTC ATCCTGATGG CCCCCGGTGG ACGCGTGGTA
TACGCCGGCC CGAAATCCGA AGCCGTCGCA TATTTTGCGA CCCAAGGGTA CCGTTGCCCC
GACGCGACCA ACCCGGCCGA GTACTTTGTC GATCTCGTTT CCGTCGATAC CGAAGACGAG
CAGGTGGCCG CTATCGACGA AGCACGGATT GATAAACTCG CTTCCGTCTT TCGTGACTAC
CAACAAACAT CTTTGCTTCT GCCTGCCAAA CGACCTCAAG TGAATTTGAG TCTGGATATC
GAGACTGATA TACAACAACC TTCAAATGGT CAATCGATGA GACGGGCTTT CCAAGAAAAA
AGCCAACTTG GGCTCCTGAA ATTTCTGTGG GTTCCACGAT TGGGAGCCTT GCTAAAGAGG
TCGTGGCGAC AAAATGTTCG CAACTGGGAA ATCAATATTT TCCGAGCGTT CGCCAGCGCG
GGTAACGCGA TTCTCCTGGC TCAAATCTTT CCAACTGTCC GAGGAAGTGT CGCCAAAGCC
AATAGTGTAG CCGACAGGGT GGCACTGTTG TCGTTCGGTG CAATCAACAT GTGCTTTATT
GCATTTATGA AGACTGTCAC GTTAATCGCG GAAGAGAAAC CGGTTGTTCA ACGGGAACAA
TCACGTCGTC AGTACTCGAG CTTGGAGTAC CTGGTGGCCA AGGTTTTAGC AGAATTTCCC
TTGGACTCCT TGTTTTCCGC TATCTTTACA GCCTTCCTAA AAAAGTGCTC AGGAGTCCGG
ATTTCGTGGG CCAAGCTGAC TGGGGTCTTT AGTTTGTTGA CGGTGTCTGG CGCTTCGCTT
GGTTTGATGC TGGGCAGCTG GCTTCCAACC GAAAAACTGG CTACGACGGG CAGTATTCCA
GTTCTAGTCG TATTGATGGT TGTGGGTATC AGTAAGTGCT GTTCCATGTG TTTGACTGAA
TCGATTTGGT TTTGTGTTGT CCGCAGCCTC TCAATCATGG TGGATACTCT CGAATAACGT
TGTAGTCAAT CCGAGTGGCG TAGATCAGTC CACCCCTCCA CCGGCGGTCG TGCAGGTATT
GAAACGTTGT AGCCCATTTG CCTACGCTAT TGAAGCGCTC TGTCTTGGGG AGTACCCCGG
AATGGAATTC GAACGTCAGT CAGGCTGGTT CGGCCGTATC AGGGACTTGC CTAGAATGGG
CGGATTGGTA CGTTTGTTGT TCGTCTCACG TTTACTCTTG CGTCGAACGA AAGAAAATAT
CACTGATCCT CGCGGGTGTT TTCTTTTTGC AATCGTTGCA GGCCATGGTT CGAAATGGTG
ATCAAGTCCT GGAGGCGCTG GGCTTACAAG ACAAGGGGTA TGTTCGAGTC ATGCAACACC
TTGGAGTATT GTCTGCTGCG TACCTGGCAG TTAGTTGGCT GGGCATGCTT GTACAGGGTA
GAAAACATGG CATGCATGGT GCGGTCGAAG CGGACACTAG CCAGCACGTA CAGCGAACCA
AGGCTCCAAA GGACACCGAG GGCAGCTTTC TGTCCAAGTC AACAACGGAA ACGTCAACAT
CACAACGACA TTTGAAGGTT CCTTTAAAGA TCCGAGTCTA A
 
Protein sequence
MTLNDDPTLH PKESVASDAA TVYLPSGEGR TTAATFPYSR HRPQIPCMTD LRKWLQASRT 
PRRLVVLFVL LESLAFQKGS MRVTAESEVP DNVAGQLLRQ TVKAAAASSP SRDSLDGATA
SGLVWSDLSV VSSNRDVTLL HPFSGWITSG QIGGILGPSG SGKSTFLSAL SGSSRQLYQT
GQVWHYLHTV VHGSKDTQTP IQLSRIPTQE VAWLQQHDDF FSMLTVRETL DLAAYLELPH
LVLSQRDALV QTHLDALGLA HAADRPIGSD LTGLGTARLS GGERRRLSVA LELLTEKQLL
LADEPTSGLD SSISVKVMQN IRDVCRKRNI PCLCAIHQPR SSIWHLLDTL ILMAPGGRVV
YAGPKSEAVA YFATQGYRCP DATNPAEYFV DLVSVDTEDE QVAAIDEARI DKLASVFRDY
QQTSLLLPAK RPQVNLSLDI ETDIQQPSNG QSMRRAFQEK SQLGLLKFLW VPRLGALLKR
SWRQNVRNWE INIFRAFASA GNAILLAQIF PTVRGSVAKA NSVADRVALL SFGAINMCFI
AFMKTVTLIA EEKPVVQREQ SRRQYSSLEY LVAKVLAEFP LDSLFSAIFT AFLKKCSGVR
ISWAKLTGVF SLLTVSGASL GLMLGSWLPT EKLATTGSIP VLVVLMVVGI INPSGVDQST
PPPAVVQVLK RCSPFAYAIE ALCLGEYPGM EFERQSGWFG RIRDLPRMGG LAMVRNGDQV
LEALGLQDKG YVRVMQHLGV LSAAYLAVSW LGMLVQGRKH GMHGAVEADT SQHVQRTKAP
KDTEGSFLSK STTETSTSQR HLKVPLKIRV