Gene PHATRDRAFT_49919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49919 
Symbol 
ID7198619 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp290700 
End bp294011 
Gene Length3312 bp 
Protein Length1040 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184773 
Protein GI219129179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.099413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCCAA CTTGCATTAA AGAGGCTTTC AAGGATCCAC ACGGGGAAAA CTTCAGGTAC 
TGTCCTATTT GCCGGGAAGC ATGCGCTGTT GATATGATCG AATCTGTATG CGGACCGGGT
GCAGTGAAGG AGATGGAACG ACAGTTGCGC AACAAAGTAG AAATAGAAGT CCAGAGAGGG
ATGGACAAGA AGCAGGAGGA AAAGCTCAAA ATGAATGAAA GCAAGCAAGT GGCCTTGAAG
CTCTACCAGG ATCTCTGCGA ACGATTGAAT ATGAAATGTC CGCGATGTGA ATTGGTCTTT
GATGACTACA CAGGCTGTAA CGCACTTACA TGCCGAGGTC AGAGCTGCAG AGCCGCCTTT
TGTGCCATCT GCTTGAAGGA CTGTGACACG AATGCCCATC ACCATGTTCG TACCTGCCAC
GGGGACCTGT TTGACAAGAA GGCGTTCAAT ACGGCGAGAA AGAAGAGAGA AATCGACACC
ATCAACGATT TTTCGGCCGA AATCAGCGGA GAGCCACACG AGGTGAAAGA ATTGGTTCGA
ATCGAATTTG AAAAGTCCCA GTCAGACCAA ACACAATATT GTGAGGGAGG TTTCCCGTTT
GCTCCATTCC TATCTAGAGC AAAGAGAGAT CTTCTTGCGG CAGTCAACTC TGGACGACTT
TCAATTTTAA GCGACGCGGA AGCATATCCT GTCGAAACTG GTCTTACTCG ACGTGACATT
TCTCCGCGAA ATGTCATCCC CGAGAACTAT AGGCTTCGCC TACTACCATC CATGGAAAAT
ATTTATTCAA TTATTCTTGA AGAACAAGTG CACACCATCA ACGGTATTGC GTGGAAGAAG
ATAGCGTTGC AGGACGAAAA AGAAATACGC TCCAAAGACT TGGGCAGGCC AACAGTAGAC
GCTTTGAAAA ACATTGCGAT GGCATTGTCT TGCGGAGTTG TAGCCTTTGT AGGTACCAGC
TCTCTGTACC AAAGTTGTAT TTCACGAAAA GAAGGCAAGT ACGACCGAGA CGAGGCGAAG
GATCCTACCA TCTGTGTACA GTTCCACAAA ATTTGTCGAA ACGGTAACAT GAAAAAAAAC
GGGCAATCAC TATCGGAGCT GGGATTGGAG GAACGTGATG TAATAGGCGT CGACCAAAAT
GTCCGTATGC TTATATTAGC GGATCACGTC TTGAAATCTT CGGATGAGTC GATGAGCTTT
GAACCGCTCC AACACTTTGT TACAGGTCGG CAGCCGTCTC GAGTTTTTAC ATCTATTTCA
ATGCCACCAC CCCCCAGCTT CTTGACCTTG AACAACAAAC AGCAAAAGGT TGCGCACCCT
CTTTCTCTCC TCACTGCAAT GGAAGTTGCC GGTCCTCCAG GTACAGGCAA AACAAAAACC
ATCATGGAGC TTGTTAGGGG CATTCTTCAC TGCACAGACT ACGATGTCAT TCTCATGTCG
GAGAGGAACG GCGCCATTGA TGCTATTGCC GAAAAGATGG CTGGCGATTG TTTAACCTTA
AATCAATCTC AGTCGGTAAA GAGCGTTTCA AATGTGGAGC TCTGGTCCAA GGTCCTATCA
TTTGGTTCGG TTGGAGGCAT GGGCCCATTC TCCGCTTTAT TTACCTCCAC CGCCAAAGAA
TTGTATGCAA TATTTGTATA TTCTTGCTTA CGGAAGTGCT GGACTTGAAA CCTCTCACAA
AGATTTTGAC TCTTCACTAT TGACAGGTAC CACCCAGAGG TGCTGGAAGC CGACCGAGTC
TTGAAAAAGA AAATCAAGTT CATGGAAAAT TACTCGAGGC GGTTGAGAGA GGCACTCAGT
AGGTCGATAT ATGATCTCGA GGGAGAATTA TTTGAGGAGT CGAAGCTCAG CATGAGGGGT
CGACTTATTC AAAACAACAA GGAGAATAAT CTAGAGAACG CCCGTGATAT CATAAGCTCA
ACCATCAATG CTCTGGGTGC AGTGAAACAG TTCCGAGCCG ATAGCCCAGG CAAGCAGAAT
TCTGATCTTG TTATCCTTTC ATTGGAGCAC TCTGCTATCC TCCGAGATCT GATTCCCATC
AATTGTGAGA AAGCGGATCA CCACTCTGTT CCAAAGTCAA TCACTTCCAA CCCGAAAGCA
ATCAACCGGG CTCTGGAAGC CATAGAAAGA CGACTTCAAG AAGTCCTTCA GAACAAGTTA
TTTACTGTTG CGGCATGCGA AGCCATGGAC TACTCGATCC GCCTATCGCA AAGCTCGCTT
CAAGACGTTA GATTACGCAT CAAGATCGAT CTGCAAAAGG ACGCTCGCGT TTTTTTATCG
ACCATTGGTT CCTCGCACAA AATAAACAAG AGTGTTCTAG AAGCCCACAG AGTAGAACCA
GAGATAGTTT CCTTGGACGA AGATGCATTG AACGAAATGC AGCGAATGAA TGAACAGACA
AAGCCCACCA TTGTCATTTT TGATGAGGCT GGTTGCATTC CTTCGTATGA GCTACTAGGA
CTTTCTCGAT TGGGACGATC AATTAAATCT ATAATCTGCG TCGGGGACAA GCATCAGCTA
CCACCCTACA ACCCTGGATC AACGAAAAAC GACTTTAAAA AAGGAGGTTC ATTTGGCAAC
GTAAGGAGGG GAAAGCCAGT ACGGCAGCCA GAAAAAGTAC AAAGTTTACT GGATGCAAGT
GGCTTGCGAT CAGAAAAAGT CAAAGTTGAA CTCACGGAGC AATACCGCGT TCCTCGGGAT
ATTGCCGGTG TTTTGAACGC TCGTATCTAT CGTGGAAATT ACCAGACTTC TGTTCATTGC
AACGCTCCGA TAAAAGGTTT TCGTCTCGTG AACGTTCCAA AAAGTGGCCG CGACCAACCT
TACGTGAATC ATGACGAAAT TGACGCTTGT ATTCAGCTCG TGGAAAGCTC CCTGCAGGCA
GGGTTGAAAC ACACAATGGT GCTGACACCG GTAAGACGCA TGGGCTTTTC TTTCCCTGGA
CCACTACGCG CTATGCCTCA ACGGAACTAA CACTCGGTTT TTGTCCAAAC TTCCAGTACA
AAAAACAGCA GCGGGAAATG GAGTTCAGAT TTAAAAAGAA AGGGTGGAAT GACATTCTTT
CTGTACTGAC AATTGATCAG TGTCAAGGCC AACAGGCTGA TATTGTAATA CTCAGTCTGG
TCCGCAAACC AACGCGATTT CTTGACAAGA ATCGTCTCAA TGTGGCGCTG TCGCGGGCCT
GTCAAAAGAT GTACTTCCTT TGCGACAAAA ACCTATTTGT TGAAGCGAGC CAGAATCAAG
CCTGGGAGTG TCACCTTTTG GCGAAGGATC TGCTTGATCT AGCCGGTAAT TGAGGCAAAG
AAAACAAAGA CC
 
Protein sequence
MCPTCIKEAF KDPHGENFRY CPICREACAV DMIESVCGPG AVKEMERQLR NKVEIEVQRG 
MDKKQEEKLK MNESKQVALK LYQDLCERLN MKCPRCELVF DDYTGCNALT CRGQSCRAAF
CAICLKDCDT NAHHHVRTCH GDLFDKKAFN TARKKREIDT INDFSAEISG EPHEVKELVR
IEFEKSQSDQ TQYCEGGFPF APFLSRAKRD LLAAVNSGRL SILSDAEAYP VETGLTRRDI
SPRNVIPENY RLRLLPSMEN IYSIILEEQV HTINGIAWKK IALQDEKEIR SKDLGRPTVD
ALKNIAMALS CGVVAFVGTS SLYQSCISRK EGKYDRDEAK DPTICVQFHK ICRNGNMKKN
GQSLSELGLE ERDVIGVDQN VRMLILADHV LKSSDESMSF EPLQHFVTGR QPSRVFTSIS
MPPPPSFLTL NNKQQKVAHP LSLLTAMEVA GPPGTGKTKT IMELVRGILH CTDYDVILMS
ERNGAIDAIA EKMAGDCLTL NQSQSVKSVS NVELWSKVLS FGSVGGMGPF SALFTSTAKE
LYHPEVLEAD RVLKKKIKFM ENYSRRLREA LSRSIYDLEG ELFEESKLSM RGRLIQNNKE
NNLENARDII SSTINALGAV KQFRADSPGK QNSDLVILSL EHSAILRDLI PINCEKADHH
SVPKSITSNP KAINRALEAI ERRLQEVLQN KLFTVAACEA MDYSIRLSQS SLQDVRLRIK
IDLQKDARVF LSTIGSSHKI NKSVLEAHRV EPEIVSLDED ALNEMQRMNE QTKPTIVIFD
EAGCIPSYEL LGLSRLGRSI KSIICVGDKH QLPPYNPGST KNDFKKGGSF GNVRRGKPVR
QPEKVQSLLD ASGLRSEKVK VELTEQYRVP RDIAGVLNAR IYRGNYQTSV HCNAPIKGFR
LVNVPKSGRD QPYVNHDEID ACIQLVESSL QAGLKHTMVL TPYKKQQREM EFRFKKKGWN
DILSVLTIDQ CQGQQADIVI LSLVRKPTRF LDKNRLNVAL SRACQKMYFL CDKNLFVEAS
QNQAWECHLL AKDLLDLAGN