Gene PHATRDRAFT_47749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47749 
Symbol 
ID7202733 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp719781 
End bp720948 
Gene Length1168 bp 
Protein Length327 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181962 
Protein GI219123294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCCG CTTCGTTGCA TGCACCGAAC GACGCTTTGG CGGAGCTTCT CGCTCCGGAC 
GGGTATTACA AGTACCTAGG CGTTTGCAAG CCTTCACCGG CAGCGGAAAG CTCTGGTAGA
TCGTCGGAAA TGGAAGGTCC TTCGGGGTCT TCTGCGAAAG AAGACACACT CGACGAAGAT
ACTGTAAAGA AAGCATACCG TAGACTGAGT CGAAAACACC ATCCCGACAA GCCTGGCGGC
GATGCCGATA CGTTTCGCAT GCTCAACCGA GCGCAGAAGG TTCTTCTCAA TCCGAAGCTC
CGCCAACAAT ACGATATTTT GGGAATCGAT TTGGATGATG ACGAAGAGGA GCATGCCGAC
AACAATCATC ACGATCATCC TGACGATAAA AAAGATGGGA ATACGGCGCA AGGCATTGTT
CATGAAATCG CTAGTATGGC GTTGACAACT ATCGTACAGC TCGGAGTTCG AACCCGTACG
TACTTGTTTT CGTCAGACGA AGAGACGGTA ACTGACACCA TCGCTCACGC GCTTCATCCC
TCTTCCGTTT CCGGAATTCT TTGGCAGTCA TGCTGGCTGG AGTATCAATC CTCGTTACCC
GGTATCGCTG GACCGTGTAT CCAGCCATAC TTTTTCTAGC ATACATTGCT TTCACTATTC
TGAAACAAGC CCGGTTGCCG GGGCATTCCC TGCTTGATAT GTTACCTCCG TTGCTGATTG
CAACTGGTCT TTTGTGCATG TTCTACGGTC GCGTAGTATC GGTCGGCGAC AGTGATTCGC
CGGATGCCGG TACAACGACC GCACCTTCAT GGACCTGGCT GTTCTGGAGC GGCGAAGTGC
TGGTTATTGC CATGTTCACC TTCAATTCCA TGAGTGCGAT ACCCAAGACT CCCCTTGTTT
TGTCGTTACT CGGTATATTC TCGGCCCTTA CGGCACTTTG GTTTCGCGGA AAGTTTTGGA
ATTACGTAAT TGTCCTCGTC ATGGAAGGCT TGCTTGGTGT ATTCGTGGCC TTGGCCTTCC
CCGTAATGGA ACTCATTCTG GAAGCGGTTC TGAATGAGAA ACTGAAACGG GTGGGCGACA
AGGTTCGTGC GCATCATCGA CAGTTAGAAG CCTACTACGC GGCCAAACTG CAACAAAGGG
ATCACTAAAT TAAGAAGTGG TATCTTTG
 
Protein sequence
MPAASLHAPN DALAELLAPD GYYKYLGVCK PSPAAESSGR SSEMEGPSGS SAKEDTLDED 
TVKKAYRRLS RKHHPDKPGG DADTFRMLNR AQKVLLNPKL RQQYDILGID LDDDEEEHAD
NNHHDHPDDK KDGNTAQGIV HEIASMALTT IVQLGVRTPY IAFTILKQAR LPGHSLLDML
PPLLIATGLL CMFYGRVVSV GDSDSPDAGT TTAPSWTWLF WSGEVLVIAM FTFNSMSAIP
KTPLVLSLLG IFSALTALWF RGKFWNYVIV LVMEGLLGVF VALAFPVMEL ILEAVLNEKL
KRVGDKVRAH HRQLEAYYAA KLQQRDH