Gene PHATRDRAFT_29967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_29967 
Symbol 
ID7195178 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp650375 
End bp653608 
Gene Length3234 bp 
Protein Length871 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183400 
Protein GI219126303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.497541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGTCTTGGT CCTCCTACGC ATTCCGGTCC TTTCCAGTTA GTCGGTCGAC CTTCTTTCTT 
GTCTGCTACG TCTGTTCGTG TGTGCCAAAA GAGGTCCTTT GCAATAATTA ACCATGGCGG
ACTTGACAAG TATCCTGTTG GCTGTTGCCT CGTCTCCTGG TAGGTTTCGT CGAGAGAAAG
CAGAGAGGAA TCGGCGTAGG GTCTCGTCAC GGAACCAGAT GACTCTCGTT GGCCACGTTT
CCTTGTAGAA AGAATGTCTG TTTGTTTATT TGGTTGTCTT TGTTTACCAA TCCCAAATCT
CACCTCACTT TGCCTTTGTT TGGGAATTTT ACTGCGTAGA CCGATCGGAA GAAAAACTAC
TCGAAGAATA TATGCAGTCG AACTACAGTG AGTTCTGTTT AGCTTTGGCC AAGCTGCTCG
CTACGGAGGG CGCACCCTTT GCAGCACGTC AAATGGCGGC ACTTCAGCTC AAAAATACGG
TCCACGCCAA ATCTGCCGAG ATACTGCAGG AAAAGCACAA TCGCTGGAAG GCCACCGACG
CCACACATCG AGCCGCCGTC AAGGAGTGTC TCCTTGCAGC GATGCGTTCC GGTGTACCAA
AGGTGCCGCA TTTTGCCGCC GTCACTGCCG CGGAATTCGC TTCCATCGAA CTGCCTTTTA
ACGAATGGCC ACAGTTCATC GCAACGCTCA TGGAAAACGT CACTTCGCAT GCACCGGAGC
CCATCAAGAT TGCCTCGTTG GAATGCCTCG GATTCACCTG TGAAAGCATT GTAATTATGG
AGGAACTAAT GGGAGACAAT TTCGTTCCCG AATTGGCCTC TTCCACCGTC GACACCATGC
TGACAACGAT TGTGAACGGA GTGCAATCGA ATCAGACGGA TGCGATGCGT CTCGTCGCTC
TGACAGCATT GAAAAACTCG CTCGGCTTTG TCCGTCACAA CATGGAACGC AAGCAAGAAC
GAGATTTCAT TTTTCAAGCC ATGTGTGAAG CCACCAAGTC CAGCGATGCA CAGGTTCGGG
CTCTCGCTTT TGCCTGTCTA GACCATACCG CCGAACTATA CTACGACACC CTACCGGACT
ATATGACCGT CATTTTTGAG CTCACCACCA ACGCGATTCG ATCAAACGAC GAAGAGGAAA
CCGTTCAAAT GAATGCCATG GAATTGTGGA CCGCCATCGC CAGCACAGAG CAGACTCTGG
TGGACCAAGA CCAGGATGCG GCCGAGAGAG GGCAGCCCCT GGATCGACCT CCGTGTCCCA
AATACACACT CGCTGCTATG GAAGCGTTGG TTCCGCTATT GCTAGTCATG CTTGCCAAAC
AAGAAGACGC ACCCGAGGAC GACTCCTGGG GTTTGCAAGA GTCTGCCGGG GTGTGTTTAG
AGACAATCTC GCAAACTGTT GAAGGATCAA TTGTTCCACA CGTCATTCCC TTTGTCACGC
AACATATCCA GTCGGAAGAA TGGCGCTACC GCGATGCGGC TATCGTAGCT TTTTCCTCCA
TCATGGATGG TCCCAGTACC GAGGAGCTGG CCATATACGT GAACCAGTCC ATTCCGGTTC
TACTCCGTGC ATTTTCAGAT TCGAATGAGA TGGTCCGCGA CTCTGCCACA CACTGCATCT
CCACCGTTTG TCGCCTCCAC ATGATTGCTG TCGACCGAGA TATAGTGCAT TCCATTATCA
AAGGTCTAAT CGAGAAGCTA CGTGACTCTC CTCGTGTTGC TGCCAAAGCG TGCACGGCCC
TCTTCAATAT TGCCACATCC TTCAAAAGCC CCGAACCCGA GCCGACGTCA CTTTTGTCGG
AACCCATGTT ACCCCTTTTG CAAGCATTGT TACAAACCAG CGAGCGCCAA GATGCCACGG
AATGCCATTT GCGTGTCGGT GCTATATCGG CAGCCAATGA CCTCGTCGCA GCCGCACCCT
CGGACACCAC ACCCATCCTC GCCGAATTCC TGCCGGTTAT CATCGCACGG TACGAAGCGA
CAATGCACGC ACAAGTATTA GGCAACGAAG AAAAGGAAGA GAAGGAACAA GCCCTGGGCT
TGTTTAGTTC GCTTATTTCA GTCCTGTTTC AACGGCTCGA AAAACATGAC GTGCTTGCGT
ATGTGGACAA GGTCATGGAA TTGTTGCTCC AGGGATTGCA ATTGCGGAAC GCCTCGTGTC
ACGAAGAATT TTGGCTGGCG ATTGGATCAA TTGCCGGGAC GATGGAGGGC GAATTTATTG
TAGGTATTTG ATGTAGTTGC AGCGTGTCCT ACCTTTGCTT TCCTTTGACA AGTTTTCGTG
GCTGACCTCT TTTGCATCAT TTTCAGAAAT ACATGCAAGC GCTGAGCCCA GCCCTACTGA
CGAGTTTGCG CGACTTCCAC GCCAAGACTC TCTGTATTGT ATCTATAGGA GTTGTCGTCG
ATATTTGCTC CGCGATTGGT GATAAAATTC AACCGTATTG CGACGGTATT ATGAGCGCGT
TAGTGGACTG TTTGAAAGAT TCCGTCATTC AACGAGATGT TAAACCTGTG GTATTTTCCT
GTTTTGGCGA CATTGCCATG TCAGTGGGCG GTGCATTCCA ACCATACCTT CAAGTATCGA
CGATGCTTCT ATTCCAAGCC TCACAACAGC AAGCACCACC GGATGACGAA GACTTAATCC
TTTTTGTAAA TTCGCTTCGG CTAGGCATTC TGGAAGCCTA TTCGGGTATC ATCATGGGTC
TCGCTGACGG CAACGCCCTC CAAAGTTTTA CACCCAGTGT GCCCAACATT GTACAATTCG
TGCAAGTCTT AGCGGCCGAT TCGACCAAAG ATATCTATGT GTTGGAAAAG TCGGTGGCTC
TTTTGGGCGA TGTGGCGCAA CAAATGGGTA GCATTCCTCA AATTCGGGAA CAATTAAACC
AACATTTCGT TTCAAAGCTG TTGCAAGAAG CTCTCAACTC CAATGATGAA ACCACCGTCG
ACTCGGCTAA CTGGGCGGGA AACTTGATCA AGCAACTCAT TCGAGGCAAC GCTTAGTACA
CACACGGTTG TGGAAACTAG GGCGTCCTTT TGGACCACGC CGGTCGCCAT GGGACGAGAT
ATGATTTCGT TCCGTGGCCA CCGATGAAAT AGTGAAACAT GACAAAATGG ACAAGATGGA
GTGTACTTAT TTTGACACCC TGTTTTCTCC TCTTTTTCGA AATGAGCTAT CGAAAATCGT
ATATAAATGG AATCGCATTA GAAAATTTTT AACGAATTGA CTGTGATTAT TACC
 
Protein sequence
MADLTSILLA VASSPDRSEE KLLEEYMQSN YSEFCLALAK LLATEGAPFA ARQMAALQLK 
NTVHAKSAEI LQEKHNRWKA TDATHRAAVK ECLLAAMRSG VPKVPHFAAV TAAEFASIEL
PFNEWPQFIA TLMENVTSHA PEPIKIASLE CLGFTCESIV IMEELMGDNF VPELASSTVD
TMLTTIVNGV QSNQTDAMRL VALTALKNSL GFVRHNMERK QERDFIFQAM CEATKSSDAQ
VRALAFACLD HTAELYYDTL PDYMTVIFEL TTNAIRSNDE EETVQMNAME LWTAIASTEQ
TLVDQDQDAA ERGQPLDRPP CPKYTLAAME ALVPLLLVML AKQEDAPEDD SWGLQESAGV
CLETISQTVE GSIVPHVIPF VTQHIQSEEW RYRDAAIVAF SSIMDGPSTE ELAIYVNQSI
PVLLRAFSDS NEMVRDSATH CISTVCRLHM IAVDRDIVHS IIKGLIEKLR DSPRVAAKAC
TALFNIATSF KSPEPEPTSL LSEPMLPLLQ ALLQTSERQD ATECHLRVGA ISAANDLVAA
APSDTTPILA EFLPVIIARY EATMHAQVLG NEEKEEKEQA LGLFSSLISV LFQRLEKHDV
LAYVDKVMEL LLQGLQLRNA SCHEEFWLAI GSIAGTMEGE FIKYMQALSP ALLTSLRDFH
AKTLCIVSIG VVVDICSAIG DKIQPYCDGI MSALVDCLKD SVIQRDVKPV VFSCFGDIAM
SVGGAFQPYL QVSTMLLFQA SQQQAPPDDE DLILFVNSLR LGILEAYSGI IMGLADGNAL
QSFTPSVPNI VQFVQVLAAD STKDIYVLEK SVALLGDVAQ QMGSIPQIRE QLNQHFVSKL
LQEALNSNDE TTVDSANWAG NLIKQLIRGN A