Gene PHATRDRAFT_49918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49918 
Symbol 
ID7198617 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp285501 
End bp287721 
Gene Length2221 bp 
Protein Length652 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184771 
Protein GI219129175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGGC AAAGTCTGAT CAGTAGATGC CAAGCCTTTG CTCCGCCTCC CACAGTTGAC 
ACGAATGATG ACTGGATGCC AGTCGACGCC TATCGTCAAC AACACGGTAT TCAGTTCGAA
TACGAACCGA GGCACATCTC CCCCGAAGTT TGCCGGTACT TAAACGAAAC AGAATGTGCT
CAAGAAGACA AAGCTGCCAG GGAAACGTAC CAGCGCCATT TGCGAACAAT GGAAAGCCTC
CGACGAAGAC GTTTACAGAC TACCCCCGCA ACCAGCAAGG GCTCGTTCAA GGCTATGGTA
CTTCTTGTCC AGTTTGCGGA CCACCAAAAC CGGCCCCTTC CCTCGAAAGA ATACTTCGAA
GAGCTCTGCA ACGGTGCCGG AACGTCGACG GTCAACCCTA TCGGTAGCAT AAAGTCCTAT
TTTTCCGAAC AATCGCAAGG CTTGTATGAT GTTGATTGCG AAGTTTTTGA TTGGCGCACC
ACAACGTATA CCGAAGCGGA CGCTGCACAA GGTATTAGCG GTATATTGAG CAACGCCAAC
GCCCAAAAGT TTTTCCACCC AGTATTGGAT CAGATTGACG CAGAAAAAGT CGCCTCGAGT
GGAGACTTTT GGTTATTTTT CGAAGGCTTC GATGCCGATG GCGAAGACGG TATGGGCGAC
GGCTTCATTG ATGCATTGGT CGTGATTCAT TCTGGTTTTG GAGCACAAGT GGCAGAAATT
TGCGACGGTG TTCGACGTGC CGATCGGATC TGGTCACAGG GTCGCTCCAC TTCGGAAGGC
TTTGGTTGGA ATCCACGCAA TATTGAGGTC GGTTCCTACT CTATTGCCAG TGCATTCGAG
CGTTGCAGTA CAGACCGCCC AGCTTTGATG GGAGTCATTA CTCACGAATG GTATGGTCAG
ATCGCCTTCC TTTTCGCGGA AATTTTGTCA TTCTTCGTTG GAACTAACCG GTCGTCATTT
ATGATCTCAG GCTTCACACT TTCGGCGTAC CAGACTTGTA CGGAAAAAAC AATTTTCGAT
TCGGTGGTAT TGGGTCCTTT GGAATGATGT CATCCCCGTA TGGCCAGACT GGCGACGGGT
CCACGCCTGG ATCGCTAGTA CCATGGACTC GAAACCGCAT TGGTTGGCTG GAATACAACG
AAATTACGAC GGATGGGACG TATTCAGTAA GTGGGCTGCA GGCGTACGCA ATTCGTGATA
AGTTCCCCGC AGATGAGTTT TTGGTGATCG AATGCCGGTT TCCAAGCTCT TTCGATTCAG
ACTTCTGGGG AATCGGTGGC ATTGTATTTT ATCATATCGA CGATAAAATG GGAGGACAAG
ACCGCCCAGG CTGGCCCGGT CAATCTGGAT GGCCTGCAAA CGGCAACCAC TACCAAGTAG
CAGTACTACA GGCTGATGGA CGTTACGACA TCGAGCAAGA TGTCAACAGT GGAGATATCG
ATGATCTGTG GGTGGATGGG ATGGCTTTAC GCCCAAATGG TGGGAGCGGT ATTTTCCCTA
ATACCGATAG TTATCAGGGA ACGCCAACGG TGACAGGGAT CACAATCCGA ATCGTGTCCA
GCCCGGGGAA GACTATGCAG TTTGTTGTAG AAGGACTGGC ATCCAATCCC AACCAATTAT
CACTTCCTAC GCCCTTCCCA AGTGGCCTTC CTTCACACTC TCCGGTCATT GGGATTGCTG
CTACTCCCTT CGCGTCTCCA TCACAGTCAA ATCCCTATCC ACTCTGTACG TCGAGTATTA
CAGAAAATTG CTTTGATCAA CCCACAGCAG CTCCGAAAGA TTCCAAACCT TGTGTACCAG
GCATGGGAGT GGTCTGTTCT GATGGTACTA GCGATCCACA AACGGCCCCG ACTATAGCAC
CTGAAAGTTC CAAAGATGAA ACAACAGGAC AAGAAGATGC TTTGACTTCC AGTGCCCCAA
AGCTTGGCGA TATGCTTGTT CTGGTTGTCT CTGCTGTAAC CATCGTGGTT ACTTGGTAAA
GGGGGAAGGC TAGCTTAAAC GGGGCATTTT TGCATCTTTC ATCAAATCTT CAATGACCAG
AAGGATCTCC CGTTGGTTTT TGTCCGAAGG CTTTACTCTT TCGATTTAAA CCTGAAGATG
CATTCTTTCG GACCAATGGT TAACATCTTG GTTTGTCTGG TCTTCTCGCT CTCTCTAGAA
TTCATGTGAG CTCAAAATAG CACGAGCCTC AACTGTAAAG TAAATTTGAG TCGAAACCTT
G
 
Protein sequence
MAGQSLISRC QAFAPPPTVD TNDDWMPVDA YRQQHGIQFE YEPRHISPEV CRYLNETECA 
QEDKAARETY QRHLRTMESL RRRRLQTTPA TSKGSFKAMV LLVQFADHQN RPLPSKEYFE
ELCNGAGTST VNPIGSIKSY FSEQSQGLYD VDCEVFDWRT TTYTEADAAQ GISGILSNAN
AQKFFHPVLD QIDAEKVASS GDFWLFFEGF DADGEDGMGD GFIDALVVIH SGFGAQVAEI
CDGVRRADRI WSQGRSTSEG FGWNPRNIEV GSYSIASAFE RCSTDRPALM GVITHEWYGQ
IAFLFAEILS FFVGTNRLHT FGVPDLYGKN NFRFGGIGSF GMMSSPYGQT GDGSTPGSLV
PWTRNRIGWL EYNEITTDGT YSVSGLQAYA IRDKFPADEF LVIECRFPSS FDSDFWGIGG
IVFYHIDDKM GGQDRPGWPG QSGWPANGNH YQVAVLQADG RYDIEQDVNS GDIDDLWVDG
MALRPNGGSG IFPNTDSYQG TPTVTGITIR IVSSPGKTMQ FVVEGLASNP NQLSLPTPFP
SGLPSHSPVI GIAATPFASP SQSNPYPLCT SSITENCFDQ PTAAPKDSKP CVPGMGVVCS
DGTSDPQTAP TIAPESSKDE TTGQEDALTS SAPKLGDMLV LVVSAVTIVV TW