Gene PHATRDRAFT_47445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47445 
Symbol 
ID7202567 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp640416 
End bp642119 
Gene Length1704 bp 
Protein Length538 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181772 
Protein GI219122895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAG CATTTCCCAA TTTGCGAATA CGCATCAAAG TGATTCCCAG ATTTTTTCGC 
TGGTATGCGT GCCATGGCCT CGTGTTCGTA GGCCTCCTCG GGGGCTCAAC CGTGGCTACC
GCGAAAAGAC GAGCCGATAC CCGCCTGAGT ATTCCTAGAC AGCTGAAGCG TTCACTGAAA
GACCTAAGAG AGGCTGGTTC CAAACGGAGC AGGGAAATTT TTCAAAACGT AAGTTCCGTA
CGTGCAGGTG GCTCTGCGTT TGATATGGAC CAGTTTGGCC GGTCGGTATC GACTGTACTG
GGATTCATAG CGGGAACTCA GACACTGGGT ACGATACTGA CTGCGAATAA GAATGCCATC
TTTGATCAGG TGTGTGATCC GAAGTCGTGA TTATGACGGC CGCAAAAATC GGTTTGTTGA
CAGTGTCTCA ATTTTGTCTG TTACTTTCGT CTCCAGTTGG CTGTCCGTCT ATTGGAACCA
GACAGCGTCG CTGCGATGCA TACGGTGAGA AAAATTCTGA GCTCTGTTGA CGATCACGGG
ATTGTTGACG TCTTTGCAAA GTACCGTAGT AGAGACGTGC TACTGTCGAT CTACGCATTG
AGTCGCCTCC AAGACGCCGT GGCAGAAAAC GATCGTCGAC GAGAAACCTA TTCAGCGTTT
CTTGACAACG AGCTGATCAC TGACTTAGCC CATTACTCTG TTTACGCCAG TGTGGCTTAT
GGTTGGAAGA TGCATTTTGC CTTCGGAGGA GGCTTGCACC TAGGGGATTT ACAGGTGTTG
CTGAAGCGGA CAGGAATTTT TCTTGCAGAC TTGCTCGAAC ACAAAAAGGA ATCCAAGGCG
CATCGTCCCG CTTATTTCAT CGTGAGGGAT CGATCCAGAC GCAAGTTAGT GTTGTGTATA
CGAGGGACTC TGTCAGCACA CGACCTTTTA ACTGACCTTT GCTGTTCGCC AGATGAGTAT
GAATTACCAA GGTCGACGTC TCGATCGCGC ATCAAAACAT TATCGGATTA TTGGTGGAAC
GGCGGAAGCG CACATATAAA GATGCGTGCT CATCAAGGAA TGCTGCAAGC TTCTCGTTTG
CTCAAGAAAG ATGCAGAGGA TCTCATTCGC AGCCACCTTA AGGAAAATCC CGGTTTCTCT
CTGGTTCTCG TAGGGCATTC CATGGGTGGT GGTGTCGCAG CTTTGCTGGG CACATTGTGG
GAAGACACAT TCGAGAATCT CCAGGTTTAC GTTTTCGGCC CCCCCTGTGT TTCGTGCTTT
GGCGTGGCAC CTACCGGTAC CCGGAACATT GTATCCGTGA TTTCGGATGG TGATCCCTTC
CGAAGCTTTA GTCTCGGACA CGTCGCCGAC TTGTCCATTG GCGTGGCTCT GCTGTGTGAT
GATCCTCATC TTCGAAGGAT GATCCTCATG AAGACGAATG GTCGAACAAA AGAGATTGGA
GCCCTTGACT TGCAATGGTG CGTACAAACC ATGAAGAGAA TGCGTGGAGA CATGAAGTCA
GAAAAGCTTT TTCCGCCAGG TCGACTACTG TTGCTGTCAA GCAAAGGTGG CATGTGCAAA
GTTCGAGAGG TACCTACCGA GTTTTTCGGA GAGCTTGCCA TCAATCACAA GATGTTTGAC
GTCTCAAAAC ACATACCTGC GAGGTACGAG TCCATTCTTC GATCTATTCT AGAGCATCGA
GGAGCTAGTC CATCAGTAGA CTAA
 
Protein sequence
MKQAFPNLRI RIKVIPRFFR WYACHGLVFV GLLGGSTVAT AKRRADTRLS IPRQLKRSLK 
DLREAGSKRS REIFQNVSSV RAGGSAFDMD QFGRSVSTVL GFIAGTQTLG TILTANKNAI
FDQLAVRLLE PDSVAAMHTV RKILSSVDDH GIVDVFAKYR SRDVLLSIYA LSRLQDAVAE
NDRRRETYSA FLDNELITDL AHYSVYASVA YGWKMHFAFG GGLHLGDLQV LLKRTGIFLA
DLLEHKKESK AHRPAYFIVR DRSRRKLVLC IRGTLSAHDL LTDLCCSPDE YELPRSTSRS
RIKTLSDYWW NGGSAHIKMR AHQGMLQASR LLKKDAEDLI RSHLKENPGF SLVLVGHSMG
GGVAALLGTL WEDTFENLQV YVFGPPCVSC FGVAPTGTRN IVSVISDGDP FRSFSLGHVA
DLSIGVALLC DDPHLRRMIL MKTNGRTKEI GALDLQWCVQ TMKRMRGDMK SEKLFPPGRL
LLLSSKGGMC KVREVPTEFF GELAINHKMF DVSKHIPARY ESILRSILEH RGASPSVD