Gene PHATRDRAFT_50026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50026 
Symbol 
ID7198724 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp180760 
End bp184287 
Gene Length3528 bp 
Protein Length741 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184837 
Protein GI219129315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.301248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTCACACTC GAATGGTTGA ATGGTGACTG ACAGTGACTG CTTGTCTTTA CTTAGAGAAT 
GGACAGATCT CCTCCCCGTT GCTTCAACTA ATTGTGTGTC CAGAGTACGA ACACCGATGG
TGGGATAGGT TCCTTTGCTA GAATACTCTT GCGTAGAAAA AGTACCATCT GTTAGGGAGG
ATGGATCTCA TTGACACAGT GAACGTCGCA TTTCAGGACC CATACGCCAC TCCATACCAG
TTCGTTCGGG GGAAAGAAAA TCCACAAGTA GCCGTAAGTA CAGCAGTTCG TTGGCTCGAG
TGAACGACAC TTACCGACGG TGTCATAAAA AGCAATCGAC ACCACTATCA ACAACACACA
CACACACACA CACACAAACA CCACACATTC CTCAAGGTCC GTTGGTTGCG TTGCGCTGTT
TCGGTTCACT GCCTCTACTT ACCTACTCAC CTGCGTACAC ACGTGCTCAC TTCCTTTTCC
TTCCAGCGTT CCCCCATGGT TGCTTCGTTT TCAATGGGCC GAAACTCGGC GACGTGGTCG
GTGTCACCGT GCACCCGCAT TCTTCTACTG ACGCTCGCAA CGTGGAACAC GTTACCATCA
CTCGTCCAAG CCGAGTGCTT GGCCAATACG GAGTTCAACG ACTTTTTCGA AGGCCTCGTG
GGAGCACCCA TTCCCAGTGA AGGCTCGTGC TGCATGTTTG ACGTGTGCGG ACTCGAATGT
CCCGTCGATG TGCCCGAACC CAGTAACGGT ACGTTCCAAA GGTGCTGCCG TCCAGCTCTC
GGGCGATGGA TGGATCTAGT ATTTCGTCCC TTGTTCCGGG GTCGATCGAC ACACACACAC
ACACACACAC ACACCTTTCT TTCTACCCAA CGATACGATA CGGTCAAAGT CTGCCCACAG
AAAAAACAGA CACGTGCCTT GGCTCGTGGA CAGGATTGGC AGTCGCCCCT TGTATTTACT
AGGAATGCAA CGAATCCTAT CCATTCTCAC CTGTGCCTGT TGTGTTCTTT GACAGGCTTT
GGAATTGCCG TGGCCATTAG CATCGTCATC TCCTTCTTGA TCGGCATCGC CACCTTCTTT
GTTGTCAAGG GACAGTCCGT CAACTTCTTC GTCGCCGGTC GGTCGCTCCC GTTATGGATC
GTATCCATGA CGCTCGCGGC CCAATCGATC GACTCCAACG CCATCCTCGG GAACGCCGAT
TTGTCCTACA AGTTTGGCTT TTACGACGGC GCCGTCATTC CCATTGGACT CGGACTCAGC
CTCATCCTCA ACGGTATATT CCTCGCCCAC CACGTCAACA ACGATGAAGT CCTGACGCTG
CCCGATATCT TCGCCAAGCG CTACGGCAAG GTCGTCGAAA TCCTCGTCTC CTTGACCACC
ATTTGCAGCT TTCTCATGCT CCTCGCCGGG AATCTCGTCG GATTCGGGGC CATTACCAGC
TACGTTTGGG GTATTTCCGA TACCGCCGCC ATCTGGATAG CTGCCGGTAT CGTCTGGGTC
TACACCGCCA GTGGCGGACT CTTCTCCGTC GCCTATACCG ACGTAGTTCA GGGACTCATG
GGATGGTCCG GATGCATCGT CATGGCCTTT TGGTTCATCG CCAACGAAGA ACCCAACGCT
CCTCCACCTT CGATTGGATA CCCACGTACG TAACAAGCGT CCCCACGCGT GCGGTAGACA
CCACATGGCT CAATCCGCGT CTTGCGCCGT TGATTTTTGC CACTCACCGT AGTATTCTTT
TTTGTTTCAA TGGCGTGTAG TCTACGTCTA CCCGGATAAT ATTGGGGACG GCGGTGTCTG
CGACATGTAC GACGGAGTCC CCTGTGCCGT CACGGCCGAC GCGTGCTGCT ACAATATTGA
ACGCTGGTGC CCGGAATTCA ACGTCACGGG ACAGTGCGAA CGCTTCGACC GCGGAGCCTA
CCCCGTCGGT GATCAGCGCA TTTTTTCCGA CCAAATGTCC AATTTTCGTG CGCTGACACC
CTTTCCCAAC GCCATTTTCT GGAACTGGGC CACCATTTTT ATTCTCGGAT TCGGCAACCT
CGCCGCCCTC GATTTTCAAG CCCGGTGTAT GGCTAGCAAG ACGCCCCGAA CGGCACGTAT
CGGCTGCATT ATTGGCGGGT TGTTTACCTT TTTGATCGGT ATTCCCTTTG CCTACATGGG
CGCCATTTCA CGGTACGTGG CTAGAGTCGT GTTGTGTCCT AGTGGCACCC GTACCTTGGT
CCTGGCTGCT GGTCGACGTC GTACGGTTCC CCATCACGTG TGTCTGACCC CGTATTTGTT
TTATTGTTGC GTTCCGTTGC GTACAGAGTG TACTACGGTC CGGATTCCAT CCACGCCGAA
TTCGAGGTGG ACACGTGTCT CAGTCAGTTG GCCCTGCCCA CCTGTGGCCT CTGGCTCCCC
GACAAGAACG CCTTTATCAA GTTGTTGACG CACCAAGCTC CCGACTTTTT GGGTGGTTGG
TGTTTGGTCG GGATCGTTGC CGCCAGTATG AGTACAGCCG ACGGAGCTAT TCTAGCCATG
GGTACGGTCT TTGCCCATAA CATCATGCGT CAATTTGATG AATGGATTCC CAATTTGGTG
ACGCCGGACA ATTTGTTGGT CACCGCACGC GTCGCCACTT TGCCCTTTAC TCTAATCAGT
ACCTTTATTG CGGCCTTTTA CCGATCCTCC CATTCGGCCG GGGCGACCGG GTACTTGTTG
ATTGTTGCTT TTGATATTGT CTTGGCCACC GTCGTGGTCC CTTTGTTTGG ATGCTTCTAC
TGTAAAAACC CTTCTCCCCG TGCCGCTTTT GTGGCCATTA TCGGCGGTGC CATTACGCGT
GTCGTGCTGG AATTCGCTCT GCCCAAGGAC GGTTTCTTGC TTTTGCCGTA CGATGCGCCC
GAATTTCTCA ACACTGGCCC CGGCGCCAGT ACCGGTACGC CGGTCTTTTG GGACGTGGAT
CCGGGCGACA TGTGGGACGA AACCGTCGAA CCCTGCGTCC AGGAATCCTT TGAAGACTTC
ACCGGCGTGG ATTCCTTGTC CGCCTTTTTG GTTTGCATTC TTTCGTTCGT GTCCGTGCAA
ACGATCGAGC ACTGTACGGG CAAGCCTTTG TTCAGTTTTG CCGGTATGCA GGGTTACCAC
AAGGATACTA CGGAACATCC ACTCAAGGGT AGCAGTATTG ACAAAATGGA CGAAACGGCC
TTTCAGGACG ATACCACGAA ACGCGGAAAT CCCGACGCCG ATGAAGGCGA TGCCTAAGGT
CGACACGAAC TCGGTGTAAG ATGTCTCTTC TTTTGAAGTT CGAGAGTGGA ATGTTGTAAT
ATCTACTTGA TCAGCAGTTT GGATGCCGGT GGGCTCATTG CTGGTTTTGT TGGCATATTC
TATACCGTAC ATGGTTTTCG GGTCGAGAAA TTCAACGTGT GTTTCGATTT AATATGTTTT
GTTGCTATGG GAACCGCTTC CTTGACTTCG GCAAGCAATT TGCGTTTTTG AATAATAGCT
TGTCCGTCGG TTTTAATTTG GTAGTATAGC GTGATTTTTA TGTTTTGC
 
Protein sequence
MVASFSMGRN SATWSVSPCT RILLLTLATW NTLPSLVQAE CLANTEFNDF FEGLVGAPIP 
SEGSCCMFDV CGLECPVDVP EPSNGFGIAV AISIVISFLI GIATFFVVKG QSVNFFVAGR
SLPLWIVSMT LAAQSIDSNA ILGNADLSYK FGFYDGAVIP IGLGLSLILN GIFLAHHVNN
DEVLTLPDIF AKRYGKVVEI LVSLTTICSF LMLLAGNLVG FGAITSYVWG ISDTAAIWIA
AGIVWVYTAS GGLFSVAYTD VVQGLMGWSG CIVMAFWFIA NEEPNAPPPS IGYPLYVYPD
NIGDGGVCDM YDGVPCAVTA DACCYNIERW CPEFNVTGQC ERFDRGAYPV GDQRIFSDQM
SNFRALTPFP NAIFWNWATI FILGFGNLAA LDFQARCMAS KTPRTARIGC IIGGLFTFLI
GIPFAYMGAI SRVYYGPDSI HAEFEVDTCL SQLALPTCGL WLPDKNAFIK LLTHQAPDFL
GGWCLVGIVA ASMSTADGAI LAMGTVFAHN IMRQFDEWIP NLVTPDNLLV TARVATLPFT
LISTFIAAFY RSSHSAGATG YLLIVAFDIV LATVVVPLFG CFYCKNPSPR AAFVAIIGGA
ITRVVLEFAL PKDGFLLLPY DAPEFLNTGP GASTGTPVFW DVDPGDMWDE TVEPCVQESF
EDFTGVDSLS AFLVCILSFV SVQTIEHCTG KPLFSFAGMQ GYHKDTTEHP LKGSSIDKMD
ETAFQDDTTK RGNPDADEGD A