Gene PHATRDRAFT_49225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49225 
Symbol 
ID7195691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp277145 
End bp279579 
Gene Length2435 bp 
Protein Length726 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183846 
Protein GI219127238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAGTCTTG TGCCTAGTCG CGGTCGATCA TATCTTGCAA TCTCATCATA AGAGGGCGTG 
GAACAGCAAG GCTTTGTCTA TCCTCGAGAT CCAATATCAT CTTTTGCTTG TAAAAAAGGA
CCAGGCGTAG GAATGAATCA GACTGCTAAC GGACTACGAA GCTCGTTTTC GCCCCATGAT
CGCAAGTTTT CGGGGAAGGT CGATCACATA AATGTGAGCA ACCAAAATTC TCCACTATCG
CGCGCTCAGA TGTGGTACGT TCGTCAGGCT CGCCCTACAC GTGCCTCTAT GATCCGACAA
GTTAAAGCAA CTGCTGGAAT CGATATCACT GTGGAAGACG TAGGTTTATT GCCGTGGAAC
GAAGATGGCA CTAAGATTCG CTGGGAAGAA ATGCGACAGC TTCTCCAAGA TGAAAAAGCA
GGAGCCAAAC GAGCACCCCG GGCGAGTGAG CCGGAAAAAT CAGAACTAGA AACCGAGTCA
AAGTCTGACT CCGCAAAAGA CTCCGAGGCC GATGATGATG ACGACGACGA TGATGATGAT
GATGATTCCG ACGCAGATAC CGAGGAGTTG AACGCTGAGG CTCTCCTCAG ACAATACGAA
CCTCGGACAT TGAATAGGGA AGAATGCGAG ACCGCGGGCG TGACTGCGCC CGTAGAGGAG
GCTAGGTCTT CGTTTAGAGC TCCCGATTCC GCCCAGCCAC CACACTTTGT GAATCCCGAA
GCAGAGGACG TGAATGACCA AGATGATAGC ATGGAGAAAA TTGTGAATAC GGAGCTTCAA
AAGCCTGCTA GTTGGGCGAA GCGTGACGAC TCAGCCAGTA CAACTCCCCA AACTGTTATC
ATTCCGAAAC GCGAGTCGGA GCATGATCCT TCAGAAGAAT CGTCAGAGTC GCCCGACTCA
GTCTCCGTAC TAACAAACCC GTACGTGCCG AGCACCCCCA GGAACGAAGG CAGTTCTCGC
GATCAAGAAT CTCGCAAGGG ACGAGCCTTG AAGTGGTACT ATAAGCTGGC TCAGCCGTCG
TGCAAGACTA TGCATCGAAT CATCGATAAA ACCACGGGAG TTGAAGCCAC CAAAGAGGAC
GTTGATCTTT TAAACTGGAA CCACGATAGA ACTAAGGTTC TTGAGGAGCC GGATGACCCA
AATAGCGAGG AGAACCGACG AAAACGAGCG TTGCGATGGT ATCATAGCTT GGCGGAGCCA
ACTCTCGAGA CCATGAGAAA ATGTGTAAAT GAAACAAAAG GAATGGATGT TTCAAAAGAC
GATCTTGACC TGCTAAATTG GAATAAAAAC AAAACGAAAG TATTGCCCGA CGGTGAGTCC
GACGCCGAGG CCTTAAACGA CGCCGAAATG CGACGTCGGG AGGCTATGAG TTGGTACAAA
AGACTGGCCA GTCCTACCCG GGAGACAATG CGTCGTGTTC ACGATCAAAC CTCTGGAATG
AACATCGAGA AAGACGATAT CGATAGTTTG CGTTGGAATC ACGACAAAAC TAAGGTAATC
GAAGACGACA TCGGACTATC CAATCATTCA CAAAGTTCAA TGAGATCCAC GTCCTCTCGT
GCAGAAAACG ATCGTAACCG CTTGGAAGAG GCGCGCCGCC GTATGGAAGA AGGGGTACGT
AGGGAAGCGG AACGTCAGGA AAAGGAAGTG GATCGTCGCC GTCTTGAAGA ACGCAAGATC
CAGGATGATA TTGACCGGCG TCGACGTGAT GCGGACCACA AGGCGAAGTT AGAGAATGAT
CGTACTGAGC GTGCAATTCA GGATAACAGT AAAAATGAGA CGGTGCGTAG AGCCAAAGAA
GTTGATGAAC TTCGTCGCCA TGAAAAAGGA GGTCTGGAGA CCCCGGAAGC CAGGGAGGCC
AAGAGTGAGG AGGCGGAGGA GACAAAAACG GACGAGCTAG TGGAGGCCAA AGCGCCAGAT
TCTGAACGTC CTGTGAAAGA TGAGGAGAAT CGTCGTCTCC GTGATGTTTT GGGAATGAAT
CGCAACGAAA GCGCGCAATT GAAGCAAAAC AACGCACCGG GTCTTCCCGC AGCAAATAAT
CGAGCTAATT TCAACACGAA GGAAAGTGGA CGCAGTCAAC GTGCACGGAA GGATGAAACC
GAAGACCAAA AACAAGAGCG TATTCTCCAC ATTTACGCTT GGTGGGCGCG CTTGGGACAA
CCAACTCGAT CCGATTTTAA AAGACGCATG AAGACTGTAA CTCCTGCGGA CATGGTGCCT
GATGATGTGG ATTCTTTACC GTGGAGTTTT GACGGGAGTA CCGTCATGGT GAACAGAATC
AACAAGCTTG TGAATGCAAA TCAGGTTGTC TGAAAACAGT TGTGTAATTG AAATGCTACA
ACAACAAACC GAGCCTCTCT CCCCCAACGG CAGTCGTAAG GAGCAAAACA AACCTCAGTA
AAAAGTCAAA AAATAATACT TAGCATTACC TCGGC
 
Protein sequence
MNQTANGLRS SFSPHDRKFS GKVDHINVSN QNSPLSRAQM WYVRQARPTR ASMIRQVKAT 
AGIDITVEDV GLLPWNEDGT KIRWEEMRQL LQDEKAGAKR APRASEPEKS ELETESKSDS
AKDSEADDDD DDDDDDDDSD ADTEELNAEA LLRQYEPRTL NREECETAGV TAPVEEARSS
FRAPDSAQPP HFVNPEAEDV NDQDDSMEKI VNTELQKPAS WAKRDDSAST TPQTVIIPKR
ESEHDPSEES SESPDSVSVL TNPYVPSTPR NEGSSRDQES RKGRALKWYY KLAQPSCKTM
HRIIDKTTGV EATKEDVDLL NWNHDRTKVL EEPDDPNSEE NRRKRALRWY HSLAEPTLET
MRKCVNETKG MDVSKDDLDL LNWNKNKTKV LPDGESDAEA LNDAEMRRRE AMSWYKRLAS
PTRETMRRVH DQTSGMNIEK DDIDSLRWNH DKTKVIEDDI GLSNHSQSSM RSTSSRAEND
RNRLEEARRR MEEGVRREAE RQEKEVDRRR LEERKIQDDI DRRRRDADHK AKLENDRTER
AIQDNSKNET VRRAKEVDEL RRHEKGGLET PEAREAKSEE AEETKTDELV EAKAPDSERP
VKDEENRRLR DVLGMNRNES AQLKQNNAPG LPAANNRANF NTKESGRSQR ARKDETEDQK
QERILHIYAW WARLGQPTRS DFKRRMKTVT PADMVPDDVD SLPWSFDGST VMVNRINKLV
NANQVV