Gene PHATRDRAFT_40792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40792 
Symbol 
ID7198646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp411627 
End bp413768 
Gene Length2142 bp 
Protein Length713 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184800 
Protein GI219129236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.162987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCT ATGAACGTGC CGTACGACAG CAAAATCCAA CACCGGATCC TCCGGTCAAT 
CCTCTTCCTA TAAAGTCAAC AGTAGCGGCG ATCGACAGCG ACGCGAATCC CTGGAGTCAA
CTGGCACACG AGTGGGGAAG TATCAATAGC GACAATGATG ATGGTGCCGG GGTTGCTACG
ATGGCACCAC CAGTCTTTCA ACCGCCGAGT CAAGGCGGTA ATTCCCTCCA CAGTGCATCC
GCGACCCCCA TATCTCCCTC TACAAACACA CCATCTTTGG AAGAACTCGC GATCATGAGC
CGCGACGCCA AAGAAAACTT ATCCCGCCTC GAAGAACTGG AGTCGGAAAT CATGGACATG
ATCGATAGTA TGAAAGTCAA AGACGCTACG TACCAGACTA CCATTCGCAA ACTGGAACAG
GACTCTTCCG TCCTACAAAA TACGCTACAG GGCCAACTGC AGGCCCTTCA AACGGAATAC
CAGGAGTTCA AGGCCACCGC ATCGCAGCAA TCTCAGGCCG ATCGGCAAGC CACCGTGGCG
GCACAAGCGG CGTGGCAACG CCAGGTCCAG GACCTGCAAG CAACGGTGCA ACAGTACAAA
CAGACCGCCT TGGAGGAATC ACAAAGCCTT CAAGATATGG TAACGCTCAA GACCAAACTG
GAAAATCAAC TTTTGGACCT GCAAAAGGAA TCCACACTTA TGATTGACGA ATTCCGTACC
AAATTCTTTG CCGAACGGGA CGCCCGCTAT CGTGAAGTCC AAAAGGCGCA AGAAGATATT
GGTAGAATGG AACTGCAAAC CCGCGATATG CTCCTTGATG CCTTGAACGA AGGCCAGCAG
CAGTTGAATC GGGAATGGAC GGACTATACA TTTCAACTCA TTCAAAAAAC GGAAGAGGTC
CTGGAATCGC AGAATGCGTT GGAACAAGCC AGCGAAGGAT TACGCCAAAA GGATGCCCAA
TTGGCCGACT TACAGGCGCA CACGGCCTAT ATGCAGAGTT TGGCCGACGG AAGTTTGCAG
CTTCTGCATA AAGTTCACGA TGAAACCCTC GGTGGACCCC TGTCGACGGA AGCGGTAGAT
TTGCCAACTT TACAACAACG AGTCGGCGCC ATGGTCGAGA CCTTTAAAGC CAAGGATGCC
GACTACCGCA GCACAATGCA ACGATTGCAG TCAGAATACG GTCAAAAAGA GGCGGCCCTG
CAGCAACAGC TGCAAAAGTT GGAGGCAGAG TATCAGCAAT TTCAGACACG GACACGAGAC
GATCACACGC AGGAGTTACA GGCTGCGAGT GAAGTTCAGT CTCGTTTGCA ACGGGAAGTG
GCCGAGTTGC AGACGAAACT GACGCAACAA GTGGAACTCA CCAAAACGGA AAGCGAAAAA
GCGGCCCAGT TTTTTCAACG AAGACTGGAA ACGGAACGGG AATTAGCCAT CCTCAAGACT
AACTCTACGA AGGCGATGGA CGAATTCAAG GCCTTGTACG AAGCCGAACA AAAAGCACGA
CTAAAGGAAC AGCGATTGGC GTACGAACGC TTTCAGGAAA TGTTTCGCAC AGGGCGACGC
TGGGTGCGCG ATGCCTATAA GGACCGACAG GAGGAAGTCA AGAAAGTTTC CGAAAGCCTG
TCGACAGAGT TGAAAGCTAA AGATAAAGAG CTACGTAAAA CGAAGGGATC GCTGGGATGG
AGCGCCACGT TTTTGATTGC ATTGGCCCTG TGTGTCCCAG TGTACGTTGG CTTGGAAGTG
CCTCCCGCGG TGTGGGACAC CGGACGGTCG GTCGGATCCG CGGCGTCGAC TCTGAAGGAA
CGTGCGTTTC CCGAATCAAG TACCGGCGCG GCTAGTAGCA AGGCTCTTCC GATTGGACCG
GAACGTGTGA AACAGGAACG TTCGAAACCT TCGTCAGCAA CTCCGGCCGT TCGTATCAAA
GCAGACGACG AGCCTCGTGG AGCACCTTCG CAGGCATCTC CCGAGCCTGT CGAAACCACC
GAATCAGCGA GATCAGCCGA GCCTTTGACG GAAACAGTTA AATCCATAGA AGTTGTGACT
ACAAGGGTTT CCCCCAAGAA ATCGCTGAAG AAGTTGTCAT CCGACAGTAT CCCTTGGGGT
GAGTATTCCG AGGAGTTTGT AAAGAGCTAC GCTAAGCAAT AA
 
Protein sequence
MSAYERAVRQ QNPTPDPPVN PLPIKSTVAA IDSDANPWSQ LAHEWGSINS DNDDGAGVAT 
MAPPVFQPPS QGGNSLHSAS ATPISPSTNT PSLEELAIMS RDAKENLSRL EELESEIMDM
IDSMKVKDAT YQTTIRKLEQ DSSVLQNTLQ GQLQALQTEY QEFKATASQQ SQADRQATVA
AQAAWQRQVQ DLQATVQQYK QTALEESQSL QDMVTLKTKL ENQLLDLQKE STLMIDEFRT
KFFAERDARY REVQKAQEDI GRMELQTRDM LLDALNEGQQ QLNREWTDYT FQLIQKTEEV
LESQNALEQA SEGLRQKDAQ LADLQAHTAY MQSLADGSLQ LLHKVHDETL GGPLSTEAVD
LPTLQQRVGA MVETFKAKDA DYRSTMQRLQ SEYGQKEAAL QQQLQKLEAE YQQFQTRTRD
DHTQELQAAS EVQSRLQREV AELQTKLTQQ VELTKTESEK AAQFFQRRLE TERELAILKT
NSTKAMDEFK ALYEAEQKAR LKEQRLAYER FQEMFRTGRR WVRDAYKDRQ EEVKKVSESL
STELKAKDKE LRKTKGSLGW SATFLIALAL CVPVYVGLEV PPAVWDTGRS VGSAASTLKE
RAFPESSTGA ASSKALPIGP ERVKQERSKP SSATPAVRIK ADDEPRGAPS QASPEPVETT
ESARSAEPLT ETVKSIEVVT TRVSPKKSLK KLSSDSIPWG EYSEEFVKSY AKQ