Gene PHATRDRAFT_38086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38086 
Symbol 
ID7202946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp31993 
End bp33969 
Gene Length1977 bp 
Protein Length658 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182147 
Protein GI219123678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTTTT ATTCCTGGTT TGGATCAACG GCGGCTCGAG CGCACCGTCA CTTTCCCGCA 
CTACCGGAAC CACTAGACGA CGCCGTAGCC ATACACGACG AGCACCAATC CTCTTCCGGT
TCCTGCAAGG ACGACCCGCC AGATATTCCG GTACGGAAGA ATGACGAAAC GTGTGTACGT
ACCGCCGTCC ATTCGATCAC GATTCCGAAA TGGGACGCTC CGCTCCAATC GGCAGCCCAC
GAACCGTCAC TGCACAATCT CGACAATCCG GACCAAAGTG CGCACCAAAC AGACAATAGC
GTCACCTCTT GGTGCCGTCA GCACGTACCT TCCGGTGCTT CTTGGAACGG TAGGGCTAGT
TGCCGTCACC AGCAGTACAT CACCAATCGA CAGCAACTCC TAACGGAAGA CGAAGAAACG
GATTCGTTGT TTCCTTTACA CACGATTCCG TTCGTCTCTA TTGAACTGCG CTTGGAACCC
ACCGATCACA GTGACGACGA TGACGACGAC GATGACCTCT TTGGATGTCC TCCCCACGGA
CCCCGGACTC GTCAAGCCTA CACGTCATGG GACGCGCCAG ATATTGCTGC AGACGTCGTC
TCGGCGACAC CATCCGATCC ACCCCAACGC ACGGGTCAAA TTCGAATAGG CGGAACAAAG
TCGATAGAAT TTCCGGACGC TCTGCCAAGT CGTCTTGACG AGACTGTCCA TTTATACAAC
GCGAGTCAAG GGTGCCATGC CGACTCACCG CCCGCGCAAT TCGTCTCGAC CATCCCGGCG
CATTTTGCAG TCCCCGTCGA AATTACTGTC GTTCCGAACC ACGATCCAGG AAATTGCAGA
AAGGATCAGG CCAGCTACAA TGTCTGCTGG AGCTCCCTTA TCGAGGACAA CGAATCGGAT
AACCGGTTGG CGGATTGCTC TCTGATGAAC GATCCAACGG AGCCCTTGCG GTATCCATGG
ACGAACCTCC CGACACAAAG ATTCAGCGAC CGCAGCACTC GCAACAAAGA GAGTCTTGTC
CCGCCACTGC CCGATCCTCT CGATCGATGC GCCGCCGAAT TCTACAGCTT GTTGCTGCGG
CACCTCGAAA CCAAACGTGT GGCCGATCAA GCCGCCACTC AGGACTTGCT TGCCTTTTTA
CGGAAATATC CCTTGGTTGG CCAAGTACAC TTCCGCTTAC CCGACTTTGC CAGTGCATAT
TGCTTGCCAC TGGCTTACTT TGCGGCGATA TCCTCGTTGG AGGGCTGTCA GCTCGCGTAC
CGTCTCTATC CCGAAGCGAT CGGGATGGAA GATGACTTTG GTCTCCCTTT GCATTATGCC
TGTTACTTGC AAGCCGATGT ACAGGTAGTA TCGTTTCTAC TGGCCCGCTA TGGGGAAGCG
GCTAAACGAA CCAATCAGGA ACATCAAACG CCGCTACACT TGGCTTGTCA GGTTGCCACT
CCTGGACCGA CGACGAGCTC ATCCTCGAGT ACTAGTTTGG ATCGACTGGT CCGGGAACCC
AATCTGCAGG TTCTGAAAGT TCTCCTCGAG CACTACCCTA CCGCTTCTCA ATTGGCCGAT
CGGGAGGGGA ATTTACCGTT GCACTGGGCC TTGCAAACCT CGGGCATTTC GTTGTCACGT
TGCCAGGCCT TGGCCGCTCC GAAACCGCAC CATCAAACGC TTCGCCGGAG CAACCGGATA
CTGGAGAAAC CTTTGCATGT GGCTTGCCGG TTTGGTGTGT CGATGGAAGT CTTGTCTTGG
CTGCTCGAGG AGCATTTGGG CGCGGCCAAG ACTACCAACG AAAGGTTTGA AACACCACTG
CACGCGGCTG TATTGGGGGA ATCCGAGTCG AGTCGGAACA TGCGGTGGAT GCAAGCGCTG
GTCCGAGCCT TTCCTGACGC CCGATCTTGG ACCGACGAGC GGGATGAACG GCCCGTGGAC
AGTGCCATAC GAATGGGCGC ACCGGAAGGT ATTGTGTCTT TGCTAAGCGT GGAATAA
 
Protein sequence
MFFYSWFGST AARAHRHFPA LPEPLDDAVA IHDEHQSSSG SCKDDPPDIP VRKNDETCVR 
TAVHSITIPK WDAPLQSAAH EPSLHNLDNP DQSAHQTDNS VTSWCRQHVP SGASWNGRAS
CRHQQYITNR QQLLTEDEET DSLFPLHTIP FVSIELRLEP TDHSDDDDDD DDLFGCPPHG
PRTRQAYTSW DAPDIAADVV SATPSDPPQR TGQIRIGGTK SIEFPDALPS RLDETVHLYN
ASQGCHADSP PAQFVSTIPA HFAVPVEITV VPNHDPGNCR KDQASYNVCW SSLIEDNESD
NRLADCSLMN DPTEPLRYPW TNLPTQRFSD RSTRNKESLV PPLPDPLDRC AAEFYSLLLR
HLETKRVADQ AATQDLLAFL RKYPLVGQVH FRLPDFASAY CLPLAYFAAI SSLEGCQLAY
RLYPEAIGME DDFGLPLHYA CYLQADVQVV SFLLARYGEA AKRTNQEHQT PLHLACQVAT
PGPTTSSSSS TSLDRLVREP NLQVLKVLLE HYPTASQLAD REGNLPLHWA LQTSGISLSR
CQALAAPKPH HQTLRRSNRI LEKPLHVACR FGVSMEVLSW LLEEHLGAAK TTNERFETPL
HAAVLGESES SRNMRWMQAL VRAFPDARSW TDERDERPVD SAIRMGAPEG IVSLLSVE