Gene PHATRDRAFT_50443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50443 
Symbol 
ID7199255 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp77117 
End bp78819 
Gene Length1703 bp 
Protein Length517 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185374 
Protein GI219130442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00232774 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTTTGTGA GATCCAAATA TCGACTCACT CTTTTCAGGA TCAGCTACAA GCCATTAACC 
TTCTGTAGCT TGTAGCTACC CCCTTGATAT TCGTATTCAA GGCTAAAAAG TATGAAATTT
CTCCATTCTG CTCTGATAGT TTTGACATCG GCTTCGGCGT CGTCTGCGTT TACCGCTACC
AATGTTCCGC TGAAAAGACC TACCGTCTAC AATAAAAGCA GTTTTTCTGC TTATCGCAGT
ACGGCTCTTC GGTCAGCGGT TGCCCCGAAG ATTTCATCCG TCAATGGAAA GCAGCAGGTT
CCACAACAAA CAGAATCAGC AAAAAAAATG TGGCTTATGA TTGCGATGAA GACGCCAACT
GCGTCATCGT CGACGCATGC GACGACGAAC AATGCCGAAC TTCGCTCGAC GTTCGCATTC
ACGGAAAATG GTATGATCTC TCCGGCTGGC GCAAGGCTCA CCCTGCTGGA GCCCACTGGA
TTGACTGGTA CGATGGCCGC GATGCTACCG AAGTTATGGA TGCCTTTCAT TCCGAAAAAG
GACGCGCTAT GTACAAGCGC TTGCCTGCCA GCTCTACTGA AAGTGTCGCC ATGTTGGAAA
CTACTATTGC TCCTGACAGT TCCACACAAA TTGCTTTTCG CCAACTACGA GACGATCTGG
AAAAGGAGGG TTGGTGGAAA CGTGACATGG TGCACGAATT TACGCAGCTT GGTATTTGGG
CCTCGCTCGT GGTCGGTGCC GCCGTAACTG CACATTCAGC CCCTCCACTT GCTACTTTTT
TGTTGGGACT TTCAATGACG GCAGCCGGTT GGTTGGGCCA CGATTTCATT CACGGCGTTG
ACTCGTTCAC CGATCGCCTC CGGAATTTTG CCGGTGTTGC CGCTGGTCTC GGGCCTACCT
GGTGGTCCGA CAAGCACAAC AAGCATCACG CTTTGACCAA TGAACAAGGC GTAGATGAGG
ATATTGCTAC GGATCCTTTC TTATTCACGT GGGCGCCGGA TCCTAAGGAT GATTCACCCT
TGCGCAAAAT CCAGCACTTG ATTTTCTGGG TTCCATTCTC GGCACTTTTT GCGCTGTGGC
GTGTCGATAC CATGCAGGTA GTAATTGAAG CCGTCGAAAA CAAGCGTGTC GGGGCAAAAG
GTGAACTGTA TGGACTTTTA CTGCACTATG CTGTGTTGTT TACTGTCTTT CCGGTCACTG
TCTGGCTTCC CGCGATCTTT TTGAGCGGAC TGATGTCAGC CTTGATCGTC ACGCCCACGC
ACCAATCCGA AGAAATGTTT GAAACTTATC AGCCAGATTG GGTCACGGCG CAGTTCCAAT
CGACCCGCAA CGCAGTAACC ACCAATCCTT TTTCCGAATG GCTCTGGGGC GGCATGCAGT
ACCAACTCGA ACACCATTTG TTTCCTTCTA TGCCACGCAA TCGCTATCCG GCACTGCGCG
AACGCCTAAT TCAGTTTGCC GCGGACAATA AGATTCCCGG TGGCTACCGA GAAAGCGGCG
AGTTTGAAAT TCTACGCATG AATTGGAATC TTTACAAGTC GGTGGCGGAA GCAGATGCGG
TCCCTGGTGC ACCTCCTACT CGTGGTCGTC TAGGACAGCA AGGTGCAATT CGCGAAACAA
ACAGTCCGGC TGCTCAGCAA GAAAAGGCGA AGATCGACCA GACGGTAGCA AAGGGGAATG
GCCCGGCGTT GGAATCTGTG TAG
 
Protein sequence
MKFLHSALIV LTSASASSAF TATNFFCLSQ YGSSVSGCPE DFIRQWKAAG STTNRISKKN 
VAYDCDEDAN CVIVDACDDE QCRTSLDVRI HGKWYDLSGW RKAHPAGAHW IDWYDGRDAT
EVMDAFHSEK GRAMYKRLPA SSTESVAMLE TTIAPDSSTQ IAFRQLRDDL EKEGWWKRDM
VHEFTQLGIW ASLVVGAAVT AHSAPPLATF LLGLSMTAAG WLGHDFIHGV DSFTDRLRNF
AGVAAGLGPT WWSDKHNKHH ALTNEQGVDE DIATDPFLFT WAPDPKDDSP LRKIQHLIFW
VPFSALFALW RVDTMQVVIE AVENKRVGAK GELYGLLLHY AVLFTVFPVT VWLPAIFLSG
LMSALIVTPT HQSEEMFETY QPDWVTAQFQ STRNAVTTNP FSEWLWGGMQ YQLEHHLFPS
MPRNRYPALR ERLIQFAADN KIPGGYRESG EFEILRMNWN LYKSVAEADA VPGAPPTRGR
LGQQGAIRET NSPAAQQEKA KIDQTVAKGN GPALESV