Gene PHATRDRAFT_12346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_12346 
Symbol 
ID7200800 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp269450 
End bp270830 
Gene Length1381 bp 
Protein Length408 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180004 
Protein GI219118464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTGA TCGAAAAGGG TGTCAAGGAA GGGCTCAAGG CTGCCAACGT CAAGTTTCCG 
GCACAACTTG GCGGTTGCAT CTTTCTCTTT TCCTTCATGA TGCTGGCAGA AAAGATCAAT
CCCGAACTCG GCAACGCAAT CTTCGAAGCC TTGTCGCCCG GAGCCGGGAT TCTGGCCAAG
TGGCTTCCCG TCTTTTTCGT TCCCGGCCTA GCACTCTTGC CCTTGTCACC GAAGATTGGG
ACCAGTGTCG ATGTAAGTTT CGTGTCGGGA GAAATGTTGG TATTGGCGTT GAGATTCCGT
TGCCGGGTTC AACTCGTGAA GAAATCTCAC ACTTGGGATT GTACAAACTC TATACCGACA
GGTTGCCAAA GTCATTATGG TTTGCTGTCT CGGATTTGTC TACACCGTCA CCACCACCGT
TGCTTCCGTC TTGACCGCTC TCAAAGTACA AGGAACCCCG GTCAAAGTTG CGTCCGTTGC
CCAAACGAAG AAGACCGCCA CCGCTACACC GGCCAAAAAA CCGTTTAGTG ACGCCACAAT
GGGTTTCTTT ATCAAGGGAA CCTTTATCAC TGCAGTTCTC AGTCTCTTGG CCACCAAGAT
GAACAACGAC TTTAGCAGTC CATTGCAAAC AGCCTTTTTG GGTTTCTTTA CCTTTGCGGC
CTATGTTTGG GGGGCCCGTC TACCGACCGG CTTTGTTAAG GTTATTCATC CCTTGGTCAC
CAGTTCCATT CTCGTACTGG GACTCATGCA AGCGCTGGCC CGGATCAACG GCCAAGACTT
TCTGGACGTT GTCCGTAGCT ACAAGGTGGG ATCGTTGTAT CCCATGAAAG CCGGTGCCGG
AGATATCCTT TTGTATCTTT TGGGACCCTC CGTGGTGTCC TTTGCTATTT CCATGTACAG
TCGTCGTGAT TTGCTCAAGA GCAACCTGCT CGTAGTATTG ACGGCCATGT TCGTTTCCAG
TGCCGGTGGC CTCTTCGGGA CAGCCGCCTT TGTCCGTCTC ATTAATTTGG GAGGACGCGG
CGGACGCATG GTGCGACTCT CGGTATTGGC CCGCAACATT ACCACGGCCT TGTCCATGGC
ACTCACCGCC ATGTTGGGCG GAGACATTTC CGTCGCGGCC AGTGTAGTTG TCTTGACGGG
CATTATTGGT GCAACCTACG GCAAGGCCCT GTTGGCGTTG TTGAACATTT CGGATCCTAT
CGTTCGTGGA TTGGCCATTG GATCGTCGTC GCAGGGCCTC GGGGTGGCGG CCATTTCGGA
CGAGCCGGAC GCCTTTCCTT TTGCCGCTAT TTCCATGGTT TTGACGGCCA TTTCCGCCAC
CACTTTGGTT TCCATCCCGG CCGTCCGGGC AGCGCTGATC CGTACGGCCG TCGGTAACTA
G
 
Protein sequence
MALIEKGVKE GLKAANVKFP AQLGGCIFLF SFMMLAEKIN PELGNAIFEA LSPGAGILAK 
WLPVFFVPGL ALLPLSPKIG TSVDVSFVSG EMLVLALRFR TPVKVASVAQ TKKTATATPA
KKPFSDATMG FFIKGTFITA VLSLLATKMN NDFSSPLQTA FLGFFTFAAY VWGARLPTGF
VKVIHPLVTS SILVLGLMQA LARINGQDFL DVVRSYKVGS LYPMKAGAGD ILLYLLGPSV
VSFAISMYSR RDLLKSNLLV VLTAMFVSSA GGLFGTAAFV RLINLGGRGG RMVRLSVLAR
NITTALSMAL TAMLGGDISV AASVVVLTGI IGATYGKALL ALLNISDPIV RGLAIGSSSQ
GLGVAAISDE PDAFPFAAIS MVLTAISATT LVSIPAVRAA LIRTAVGN