Gene PHATRDRAFT_40962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40962 
Symbol 
ID7198689 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp429766 
End bp431815 
Gene Length2050 bp 
Protein Length648 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184875 
Protein GI219129394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT 
GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TACGGTGCTT TGAGGACCAC CAAGGTCGTT
GTGGGGACTG TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC
TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC
CGTAGCGTCA AACTTTTCAA ACCGGACCAG TCGACAGTAC CAGCCAGTCC CGCAGCACCA
ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA
GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG
AAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG
GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC
GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT
ACACCAATTG GCGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG
CAATTTTTTC TTCTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT
GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TTGATTCTTT
GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA
ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG
CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA
GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGATG ATTTTGTTGA AAGATACAAC
AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG
CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT
GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC
ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAACAG AAAATGATGG GGACTACAAT
GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG
GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG
CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA
TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT
GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT
TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACGTACGCTA TCGTTGGAGA
CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC
AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCGAAGTCGT
CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC
CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA
ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG
GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGACAAAT
TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACAGGAG AACCAAGAAG TGGCCTGTAC
GCACATCTAA
 
Protein sequence
MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTVVEVNNT RKAPNNRVST 
FITADFDIGG GSVKRSTLNI RSVKLFKPDQ STVPASPAAP IPAVDNADTD LAVPEQEEGE
AVLQETSPDE ELEFPAQPMM KIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD
EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QFFLLMFPPD QLSAMCQLTN
VQLAQQNKHC ISRSQLWSTT APSKYIPAPA FGKTGMSRQR FDDLWRNIRW SNQCPERPEG
MSSHTFRWQL VDDFVERYNN HRANTFKPSH LICVDESMSR WYGQGGEWIN HGLPNYVAID
RKPENGCEIQ NAACGCSGIM LRLKVVKGKT ATENDGDYNE QLLHGTKILK ELVLPWWWTD
RIVCADSYFS SVGTAMELQR HGLRFIGVVK TATKQYPMRY LSTLELNQRG ERRGLVMRDV
DTNYSTLLAF VWMDRDRRYF VSSASSLDAG KPYVRYRWRQ IDQSPDADPE RLEIIIPQPK
AAELYYSACG MIDRHNRSRQ DTLMLERKLG TTNWSTRVNL SIFGMIVVDT WLAYSLCTGI
GRANGREEKQ KDFYTALAEE LVDNQYDNVG TQHFQGLQEN QEVACTHI