Gene PHATRDRAFT_36301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36301 
Symbol 
ID7201904 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp115782 
End bp116862 
Gene Length1081 bp 
Protein Length328 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180757 
Protein GI219120018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.208439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACTA AAAGCGAGGG ACGTCTTGGG ACTGACGAGA TGGTGCCTTT ACACGCTTGC 
CTTGCACAGT TTGACCTTTG GATTGCCGCC CTGCATGACG ATACTCACGA TTGGTCGCAG
ACATCACGCT CACATTGTAC CAGCGCCATA CTTTCCGTCC CAGCTTTGCT ATCTCTCATT
CTTTCCAATA GCAGAACTAG CACAATTTTA AAGGACAAAG AGCAAAGCAA TCTGGACCCG
GTTCTAAAGC TAACCAATGT TTGCTACGCT GGCAATACTA TTTCAAACAA GAAAACGCTC
ACCACGAGCT TGACTGCTGC CGTGGGTGTT GGTGGCTCCT CTGTTTTGGT TGCCATTGCA
GGATTGGGCT ACACCTCGGC CGGTATTACC GCCGGCAGTT GGGCTGCTTG GATCATGTCG
GCTGAGGCTG GCATGGCTGG TGGCGGCGTA GCTACCGGGG GTCTGAGCGC CACTCTACAG
AGCGCAGGAG CAGTTGGCCT CATGGGAGCC GGCTTGGGCC TCACTACATG TCTCTTTGCG
GTAGGAGCTG TAGTGGGTGG TACCGCGGCG ATGTATACCG TACGACACAA CCGGATGAAT
GCAATTCGAC TCGGTACAAT TGCTACATTG GATCAGGGCC TCGCAGTATT CGCCCATGGG
AATGTCGTTG CGTTGGTGTC GTCCAAGCAT AACCGCATTT TGAGAGTGGG TGATCAGTCG
CTGATAGATG CGTATGGAGA ATGTAGCGAT CAACCGCCTA GTTTACCGCT CGAATGGGAT
AGGGAACGAT TTTTGGTAGT CCGTGTTGGG GAAAGAAATT GTGCGCTTTA GCCTGTCTGC
GAGGCGTTTT ATACGCGTGG TGGATGAGAA CGGTCTTTCG TTGAGTGGAT TGGAGCGAGC
CTCGAATCTG GAACCGACGG GAGGAGAGAT CTTCACCATT CAAAAAGGCA TCAATGGCAA
TGTTTCCTTC TTTTCACAAA TGACGGGAAG CTTCGTCAGT ATCAATCAGA AGGGTGTTGT
TTCCGCTTTG GCTACTGAGG CTGGAGAATC CGAGCACTTC AAGGTATTGG TCTTGATTTA
G
 
Protein sequence
MLTKSEGRLG TDEMVPLHAC LAQFDLWIAA LHDDTHDWSQ TSRSHCTSAI LSVPALLSLI 
LSNSRTSTIL KDKEQSNLDP VLKLTNVCYA GNTISNKKTL TTSLTAAVGV GGSSVLVAIA
GLGYTSAGIT AGSWAAWIMS AEAGMAGGGV ATGGLSATLQ SAGAVGLMGA GLGLTTCLFA
VGAVVGGTAA MYTVRHNRMN AIRLGTIATL DQGLAVFAHG NVVALVSSKH NRILRSVLGK
EIVRFSLSAR RFIRVVDENG LSLSGLERAS NLEPTGGEIF TIQKGINGNV SFFSQMTGSF
VSINQKGVVS ALATEAGESE HFKVLVLI