Gene PHATRDRAFT_40052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40052 
Symbol 
ID7195762 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp33053 
End bp34348 
Gene Length1296 bp 
Protein Length377 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184172 
Protein GI219127917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTC CGTCCGACTA CAAATGCTAC CACGAAAACG ATGGCCTCAC CAAATACGGC 
GGAAGAGATT TGCTGGGCTA CGGTAGTCAT CCACCCGCTC CCAAATGGCC CAAAGGTACC
AAGGTTGCTT TGAATTTCGT CATCAACTAC GAAGAGGGCG GTGAAAAGTG CCTTTTGCAC
GGCGACGGGG AGAGCGAGAA GCTTCTTTCC GAAATCACCG GGGCAGCGGC ATTCGGTATG
TCATTGAACG GGAGCACAAC TTGGTGCGAT CAAAGCCTCT AAACTCTCAA CCGCTTTGTC
GATGTTGTTC CTTCGTTACG CCAGAGGGCC AACGACACAA AAACATGGAA AGTCTATACG
ATTACGGAGC TCGGGTAGGA TTCTGGAGAT TCCATAGGCT TTTTACGAAA AAGAAGGTAC
CGACGACCGT ATTTGCGGTC GGTATGGCCC TGGAACGCAA TCCTGCGGTG TGTAGCGCTC
TCAAAGAGAC GGACTGGGAA GTTGCCTCGC ATGGATATCG CTGGATCGAC TACCAGGATG
TCGACGAAGA TACGGAACGG GAACATATCG CCCGGTCGGT TCGCATTCAC GAAAAACTCC
TCGGAAAAAG GCCCGTCGGG TTTTATCAAG GAAAGGTAAG TAAAAATTTC AATTTGCACA
TTCGCGGTGA TCACAATGTC TCATCAATTT TCATCGTCTC GCCGCCCCAG CCCAATCTCA
ACACTAGACG GCTCGTTCAC GAGGAAGGCG GCTTCAAGTA TGATTCCGAT TCTTATGAAG
ACGACCTTCC CTATTGGACT GTTGATGTCG ACGGGATGCC ACGTTTGATT ATTCCTTACT
CGTTGGCTGA GAACGACATG AAATTCACCA GTCCAGCCGG AGTCGCTACC GGTAAGGACT
TTTGCCAAAT GCTCAAAGAG ACACTCAAGT AAGTGCGGTC CGATAGTGCT CGTGGGACAA
AATTATCGAC ATGTTTGTGT CTCATACTGC TTGTGCGCCA GGTATCTCAT TGAGGAAGGT
CGTGCCGGTC AGCCAAAAAT GATGAGTGTC GGTTTGCACT GTCGTTTGGC ACGCCCGGGC
CGCGTCGCTG GATTGTCTGA CTTTATTGAC TTTGCCAAAT CCTACCACCG AGACGTATGG
ATTTGTACGA GAGAGCAGAT TGCCGATTTT TGGTACGAAA ATCACTACCC CCGGGGCGGT
GGGACACCCG TCAAAGCCGA AAAGGACGAC ACTGAGAACA CTACTAACGG AGCGGAACAA
ATGGCTACGG AAGATTTCGA AGGTGATGTA ATTTAA
 
Protein sequence
MTTPSDYKCY HENDGLTKYG GRDLLGYGSH PPAPKWPKGT KVALNFVINY EEGGEKCLLH 
GDGESEKLLS EITGAAAFEG QRHKNMESLY DYGARVGFWR FHRLFTKKKV PTTVFAVGMA
LERNPAVCSA LKETDWEVAS HGYRWIDYQD VDEDTEREHI ARSVRIHEKL LGKRPVGFYQ
GKVSKNFNLH IRGDHNVSSI FIVSPPQPNL NTRRLVHEEG GFKYDSDSYE DDLPYWTVDV
DGMPRLIIPY SLAENDMKFT SPAGVATGKD FCQMLKETLK YLIEEGRAGQ PKMMSVGLHC
RLARPGRVAG LSDFIDFAKS YHRDVWICTR EQIADFWYEN HYPRGGGTPV KAEKDDTENT
TNGAEQMATE DFEGDVI