Gene PHATR_21006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_21006 
Symbol 
ID7204601 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp189507 
End bp191284 
Gene Length1778 bp 
Protein Length466 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185837 
Protein GI219121218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.540977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAAC TTCCTATTAC CGACCCTTCC ATCCATGCCA ACCTCCTACT CGAATCGAAG 
GAGGATGTCT TCAAAAAATA TTCCGTCGTC CAAGTGCTCG GTAATGGATC AATGGGAACC
GTTTCCAAGG TCAAAATTAA GAAGCACAAG GTCGGGGGAA GCGCCTTTCA GCCGAAATCC
AAGGGAATTT TTGGCTTTTT GAAGAAACAG AACAACAAAA GGAAGGAAGG TGAGACCAGA
GAACACAATA GTCAGGACTA TATATACGCA CTCAAGTCCA TCATTCTGGA TCGGGTCTCT
TCTGTCTTCC TGGACGAGCT CCGTAACGAA ATTCTTATCC TTAGATCATT GGATCATCCC
AATATTGTCA AAGCGCACGA GGTTTACTAC ACGAGGAAGC AGATTTATCT CGGTGCGTGA
TGAGGAATGT GAAACGTGAT TGGTCAGAAG TCTGAACTTT TTCTCTTTGG AAAGGAGACT
AACGCATTTC CGTCATCGTT TTCTTATATC CGTGTGCTGT TTCTGGAAGT ATTGGAGTTG
TGTGATGGCG GAGACCTTTA TACCAGGTCG CCTTACAGTG AAAGGGAATC GGCAAGGATT
CTGCAACAAA TATTGTCGGC AGTGCGGTAC ATGCATGGTA CGCTATACCG ATACGCATAT
AGAATTCCCG CAGCAAACTT TGGGAAGCTA ACAATTCTTG TTCTGCTTTG TCTAGATCAC
GGAATTGTTC ATCGGGATCT CAAGTTCGAG AATATCATGT TTGAGAACAA TAGCCCCAGT
GCTCGGTAGG TTACTGATCA ACTCTCGAGG CTACACGTAG ATGAAGCGCT CTTACAGTCA
ATAATGTTCT ATTCATTTTG CACAGAGTCA AAATTATAGA TTTTGGATTG TCTAAAAAGT
TCCTTGGCAA ACCGTCGTAC ATGACCGAAC GCGTTGGTAC CGTCTATACG ATGGCCCCGC
AAGTCCTGCA AGGAGTCTAC TCATCGCAAG CTGATCTTTG GTCCGCTGGA GTGATAGCCT
ACATGCTGTT ATCGGCTTCA AAGCCTTTTT ATCACAAACG ACGGCGCAAG ATGATTGACC
AAATCATGAG GGCCGACTTC GGATATAATG CACCGGTCTG GAAGCAAATA TCAGAAAGTG
CGCAAGATTT TGTAAGTCGA TTACTAGTGG TGGATCCAAA GAAAAGACTG AATGCAGAAA
AAGCATTGGA CCATTCTTGG ATTGTGAATC GCGAACGCTT GCCAGATGAG ACACCATCCG
AGGATTTGTT GGCCGCTGTC GATGATTGCC TCGTGAATTA TCGACAAACG TCGGAGCTGA
AAAAGCTAGC TTTAAACATG ATCGCCCATC GTTCTACCGC GGAAGAGATC ATGCAACTTC
GGAAAGTTTT TGACAGCTAC GACACCTCGA ATGATGGAAT TATTACATTT GATGAATTCA
AAGCAGCTTT GCACAAAATG AAATATCCGG ATGAGATTGT ACAGGAAGTT TTTAGCAGTA
TTGATGTCAA CCGAAATGGC CATATACAGT ACACGGAATT CATTGCATCG ACCGTCTTGG
CACAGGGACA TATCGCAGAG GATCGGGTCG CAGTAGCTTT CGATCGCTTG GACTCTGATG
ACACCGGCTT TATTTCCAAG AAGAACTTGC AAAACGCATT GGGCAAGGAA TACACTCCAG
AACTCGTCGA AAATATAATG GAAGAAGTTG ACAAAGATAG GGATGGCAAA ATATCATATA
CCGAGTTTCT GCAATACTTT CGGAAGGAAA CGAGCAAT
 
Protein sequence
MDELPITDPS IHANLLLESK EDVFKKYSVV QVLGNGSMGT VSKTREHNSQ DYIYALKSII 
LDRVSSVFLD ELRNEILILR SLDHPNIVKA HEVYYTRKQI YLVLELCDGG DLYTRSPYSE
RESARILQQI LSAVRYMHDH GIVHRDLKFE NIMFENNSPS ARVKIIDFGL SKKFLGKPSY
MTERVGTVYT MAPQVLQGVY SSQADLWSAG VIAYMLLSAS KPFYHKRRRK MIDQIMRADF
GYNAPVWKQI SESAQDFVSR LLVVDPKKRL NAEKALDHSW IVNRERLPDE TPSEDLLAAV
DDCLVNYRQT SELKKLALNM IAHRSTAEEI MQLRKVFDSY DTSNDGIITF DEFKAALHKM
KYPDEIVQEV FSSIDVNRNG HIQYTEFIAS TVLAQGHIAE DRVAVAFDRL DSDDTGFISK
KNLQNALGKE YTPELVENIM EEVDKDRDGK ISYTEFLQYF RKETSN