Gene PHATRDRAFT_47247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47247 
Symbol 
ID7202300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp59062 
End bp60699 
Gene Length1638 bp 
Protein Length467 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181476 
Protein GI219122280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000828474 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGAAGTCAG ACAACCTCCG GGATGAACTT CAAATACTAA CTGTAACAAG GGATACTCCA 
CTGCCTTTTT GCGAAACGAA GACATACCAT GCAACCATGA ACCACTAGAA AGCCAGCCTC
TTGTGAACTC TGATAGTCAA AGTCATTGAA AATAGCGCAA GGGCGTAATA TGGGCGAGCT
TGTGGCTCGG CCCCCCATTC ATACGGCGTC GTATGATCGC CTGCAGCCAG ACCGTCTATC
CACAATTCTG CTTTTCCACG AAGACGATCC CGTCAACTAT CGCTATAAAG TTGTAGTCGT
GGACCGAATG CTGCTTCCCT CCACCCCCCT TCAATACGGG ACTGCCGCAT TTCTCATTCC
AGCTGGACGG GAAGCTGAAT ACATCTTTGC TTCGGAGATT GGATTAAAAT CAATAGCGGA
ATCTGCCAGC ACGGCCCGAC TGATCGCAAT CAGTTTTGGA AGACATCATA GGTTCGGCTC
GCAGATAATC GTTCAGGAAG AACTGTCGTT CGTCGTACAG GTTCTCTCTC GCCAAGGCAC
CTTTCTACCG AAACCACACC AGGAGCTGCT TGCCGAGGTT GAAATCCCAT TCATGGCTGT
AGATGGGATT GGAAATCGCC ATATTGTTGC AGAAGGAGAG AGTCAAATTT CAGGCAAGTA
TCTTGTTGAG CAAGTCGATG TTGATGGAAT GCAGGTCCGT CGACTATATT TTGCGAACAA
TCCTTTTGTA ATTCAGTCAG AAGCTGTGCT GCGCGATCAG GGTCGGGTTG ATAAGAGCTG
TTCTGCGTTT GACTATCACA AAACTATGGC TGCCGGTATT CTGGCACTGG TTGATTCGGA
TGTTTTGACG CACGGTCTTC TGGTTGGTCT GGGGGGTGGT TGTTTCGTCA ACTTGATTGG
GCATCTCCTG AATGATCTTG AATTGTCGGT AGTCGAACTT GACCCTGCTA TACTGAAAAT
TGCCGAAGAG CACTTTGATC TAGATCTGGA GAGCAACCGC TTGGACATCC GGGTGGGCGA
CGGCCTAGAG ATCATGCCAC TTACCCATGA CGCCGTTTCG GGTTGTCCGA CAACCTTCGC
AAAAGAAAGT ATGGCATTTG TTGCCATTGA TGTTGATTCG AAGGACCAAT CAGCTGGTAT
GTCATGTCCG CCAGAATCTT TCGTAGAAAT TGAGTATTTG TCGAGGTTGG CTGAGCTTAT
CCACCCACAT GGAGTTCTAG TCATGAATAT ATCTGCTCGA GATCCTGAAA AGCTAGATCA
TGTTTGTCGA AGAGTGCAAC AAGTTTTTCG AAACGTTGCG CTGGCAGCAC CTCACGACGG
AGAGGGCAAT GGGAAAAAGA AAGATATAAA TATGGTTCTA TTTGGGAAGC ACGCCGTCAT
GGAAGTGTCA ACATCAAAGC TTTGTCCACT GGTGGAACCT CATACTACCT ATGAATCAGG
GCCTGACCTA AAGAGGGCAT TGGCAAATCT TGTTGCATGG AACGAAACAA AATCAATTGA
CACTGGTTCA TCAAACAAGA TTGGGCGAAC CAAGACCACT CGACAGAAGA AAAAGCGTGG
AAAGAGAAAA TAAAACTGCG CTGAATCCAC CAAGAAGTCG AGTTTGGCCG CAAACAAAAT
AGCGCAAAAT TATTTTAT
 
Protein sequence
MGELVARPPI HTASYDRLQP DRLSTILLFH EDDPVNYRYK VVVVDRMLLP STPLQYGTAA 
FLIPAGREAE YIFASEIGLK SIAESASTAR LIAISFGRHH RFGSQIIVQE ELSFVVQVLS
RQGTFLPKPH QELLAEVEIP FMAVDGIGNR HIVAEGESQI SGKYLVEQVD VDGMQVRRLY
FANNPFVIQS EAVLRDQGRV DKSCSAFDYH KTMAAGILAL VDSDVLTHGL LVGLGGGCFV
NLIGHLLNDL ELSVVELDPA ILKIAEEHFD LDLESNRLDI RVGDGLEIMP LTHDAVSGCP
TTFAKESMAF VAIDVDSKDQ SAGMSCPPES FVEIEYLSRL AELIHPHGVL VMNISARDPE
KLDHVCRRVQ QVFRNVALAA PHDGEGNGKK KDINMVLFGK HAVMEVSTSK LCPLVEPHTT
YESGPDLKRA LANLVAWNET KSIDTGSSNK IGRTKTTRQK KKRGKRK