Gene PHATRDRAFT_40938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40938 
Symbol 
ID7198815 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp314678 
End bp316233 
Gene Length1556 bp 
Protein Length416 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184942 
Protein GI219129535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.445023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGTC GCCGTCTGCC TCGCGCATTG ACGACGTTGA CGATTTTTTT GGGAGGCCTC 
TATTTAGGAA ATTTGCAGCA TCAACTGACA TCACGTCAGG AAGCTCCCGT GACGGCTCTG
GGCGGCCCGG CATTGCCTTT GTCTTTCTCG GGAGCAAGCG TTCCACACTA CGAATCCGCA
CAGTCGAGAG AAATAGTTAA GTCGCCCGCT CTGTGGGAGA CACAGCTGGC TGACGCTCAG
CAAGAGACGA AGCGACTACG CTCTCAGATT AAGGAGATGG AGAAGCAAGT TCGGCGTAGC
CGATACCAAT CTTCGACATT CGAGCAACCG CTCGTTGATT CGAATCCACG GGCTGTGCCC
AATCCTCACT GGGTCTGTCG GCAGCTATCT ACAGCTGTCA ATGCTACTAA CGCAATCCCC
AGTGCATCCT TTCTTTGGAA TACTCGTTTG AAATCGATTC ACGCGGCTTC CCAGCTCAAA
GTGAACGACC CCCGCTACTA TTTTTCCGAT TTTACGGCCC AACTCTTGGC CATTGTTGCA
CCGCGATTGT CCCGGTCCTC CGGTCACGAT GCCGATGGTG TGACTGTCCA GTATTTGCTC
GATCGGATAC AGGCTCGGTA CGAATACTTA CACGCTCAAG GGCCGATTGC GGAGCCCGTC
AAAATCGTCG TCCTTGGAGG CAGCGTTTTG GTGGGACGCA ACTGCCGCAA GCTCTGCAAG
GATCTGGGGT TGCAACTGCG CATGCCTCAA CGCGAATGCA CCTGGGCTCA TCGACTAGGA
GTCTTCCTGA ACGTCCTGGT ACCTGACATC TTTCGCGTCA CCAAAATTGC CATGGGCGGG
ACCAATACGG CGGTCGGCAC GACCATTTGG AAGTACGATT TGTTGCCACC CGAAGCGCGT
CAACCCGACG TTGTCATCAA CGCCTACAGT ACCAACGATA TGCACATCCT CACGGCGTTG
GAAGCATCCT CCGGTAACCA AACGCTACGG GATCGCGTTT TTGTCATGCT CCAAGACTTT
GCGCGGGAAG TCCTGGTACC GCCGCCATTG GCGTGCACCA ACGCGCCTCC ACCGCCACTT
TTTCTCCACG TGGACGACTA CCTCGGCAAC GAGCAACGCG CAATTCTGGC CACGACCGAG
CTGCGACAGA GCGTGGATGT ACTAGCGGCG TACTACATTT TTCCTACCGT CTCCTACGCA
GACGTAATCC GGGATTTGGT ATACGGTGAT ACGGCGGAAT CGTGGTTTTC TCCCGAAGGC
TGGTACGTCA AGGGTATGTC AGGGATGCAG CGGGAGATTC ACCCCGGAAT GGGTATGCAC
ATTGTTATGG TCTGGGTGAT TGCCTTTAAC CTGTTGCACG TGGCGACAAC ACACTGTAGT
CGAGAGATAA GTTCCAGGCA GAACCTACAG CTTGATTACG ACAGGTCACT GTTGGCTCGG
GACGTGCCAT TGCAGAATGG GCCGTACACG AATGTTCGCG GCAAGCCAAA CCGTCTTCCC
GAAAGCTTGC CACCCCCGTT GACGTCCAAC ACAACTCTAG AGACAATATC GATTGA
 
Protein sequence
MASRRLPRAL TTLTIFLGGL YLGNLQHQLT SRQEAPVTAL GGPALPLSFS GASVPHYESA 
HASFLWNTRL KSIHAASQLK VNDPRYYFSD FTAQLLAIVA PRLSRSSGHD ADGVTVQYLL
DRIQARYEYL HAQGPIAEPV KIVVLGGSVL VGRNCRKLCK DLGLQLRMPQ RECTWAHRLG
VFLNVLVPDI FRVTKIAMGG TNTAVGTTIW KYDLLPPEAR QPDVVINAYS TNDMHILTAL
EASSGNQTLR DRVFVMLQDF AREVLVPPPL ACTNAPPPPL FLHVDDYLGN EQRAILATTE
LRQSVDVLAA YYIFPTVSYA DVIRDLVYGD TAESWFSPEG WYVKGMSGMQ REIHPGMGMH
IVMVWVIAFN LLHVTVGSGR AIAEWAVHEC SRQAKPSSRK LATPVDVQHN SRDNID