Gene PHATRDRAFT_47550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47550 
Symbol 
ID7202622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp89184 
End bp90542 
Gene Length1359 bp 
Protein Length452 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181842 
Protein GI219123044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGACG CTGTCGTCGT GGGACTCGGA GGTGTTGGTT CGTTTGCTTT GCGAGCCTTA 
ACTCACAGCG GTACCCGTAC AGATACAGAA TTCGTGAGCA ATGACACCAA CACAAAGAAC
CCAAAGGGCA AGACGTACTT GGGCATCGAA CGGTTTGCCC GCTGTCACGA TCGTGGTTCG
TCGCATGGAT ACACCCGCAT CTATCGACAA GCGTACTTTG AACACGCCAA TTACGTACCC
TGGTTAAAGT TTTCCGTAAA AGCCTTCCGA GAACTTGAAG TCGCCCAAAA TGTGTCGCTC
TTGCACGAAT CTGGCTGCAT CGTGATGGAG CCAGCCGTTG CTCAAACGGA AGGAGATCCC
TCCCAAATTT CCATGCCACC GTACTGCAAG GCGTCGTACG AATCCGCCCT ACGGCACGAC
ATTGATGTGG AGTTTCTTGA CACGACGGCC TTAAAAGCAC GCTTTCCTCA ATTTCTATCC
GACCACGATA TGGTGGGACT ATTGGAACCC CAGGGTGCGG GCTTAGTTCG GCCCGAACGC
TCCATCGAGG CGGCTTTGCG AGACGCCGCA GAGCACGAGG GTGTGAAAAT ACAAGAGCAT
ACCCAGGTCT TGTCTTATCG GCAAAAACAA TACCACGATG ACACCGAGAT TGTCGAAGTC
GTCATTCAAC GGGACGGAGA AGACGCATCC GAAACGATTC TGACTAAATC GCTCCTGATA
GCCGCCGGTG CGTGGGCCTC GACTTTTGTT CCTTCTTGGA AACCGTACGT TGTTCCCAAA
CGGCAATTGC AAGGATGGAT CGATGTATCT CATACCGCGG ATGCGTCATT GTACGACGGG
GGTAAACTAC CGGGGTGGAT CCTCGTCACA CCATCGTGGC CGGTACCCAT GTACGGGCCG
CCGTGTGATC CGAGCGGCGA CGATCCGGCT CATCGCCATT GGCTCAAAGT TGGCTTGCAC
GGAAGAGACA TACCGATCGC GGATCTCTCC CAAAATCCCC GCGAAGCATC GGAAGACGAA
ATTCAAGAAG TCCGCGAGGC AGCAACCCAG GTATTTACCC GGGACGTCTG GGCCAAGAAC
GACGACCAGA AATTTCCTGA TCTAGCGCAA GTAACACCGT GTATATATAC CATGACCCCC
GACACTCACT TCGTTATTGG CTCACCGCCG CTTCTCTCTG ATCGGCTTGG AACGAGCCCT
GCTCCAAAAT CGTGCGTCTT TGCGATTGCT GGCTTGTCCG GACACGGATT CAAAATGACT
CCGGCCTTGG GACAAATGAT GGCGGATTTT GCTAACGGTG TTGACGTTGA AAGCGTTTGG
GGAACGTCTT TTTGTTCACC ATTCCGCTTT GGCATTTAA
 
Protein sequence
MYDAVVVGLG GVGSFALRAL THSGTRTDTE FVSNDTNTKN PKGKTYLGIE RFARCHDRGS 
SHGYTRIYRQ AYFEHANYVP WLKFSVKAFR ELEVAQNVSL LHESGCIVME PAVAQTEGDP
SQISMPPYCK ASYESALRHD IDVEFLDTTA LKARFPQFLS DHDMVGLLEP QGAGLVRPER
SIEAALRDAA EHEGVKIQEH TQVLSYRQKQ YHDDTEIVEV VIQRDGEDAS ETILTKSLLI
AAGAWASTFV PSWKPYVVPK RQLQGWIDVS HTADASLYDG GKLPGWILVT PSWPVPMYGP
PCDPSGDDPA HRHWLKVGLH GRDIPIADLS QNPREASEDE IQEVREAATQ VFTRDVWAKN
DDQKFPDLAQ VTPCIYTMTP DTHFVIGSPP LLSDRLGTSP APKSCVFAIA GLSGHGFKMT
PALGQMMADF ANGVDVESVW GTSFCSPFRF GI