Gene PHATRDRAFT_40489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40489 
Symbol 
ID7198415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp51638 
End bp52825 
Gene Length1188 bp 
Protein Length395 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184560 
Protein GI219128732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0229238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACA CCCCTAGGTC GGACTTCAGT AAGGCAGCAT TGCTGTCGGC AGTCCGTCAG 
GCCGCTGGTT CGACACCTGT ACGCCGAAAT AAAATGAGAT CTGCACTGGA TGCGAAGACG
GCACCAGATA AGGCTCGAAA ACTAATTGCG CCCACAGTCA AATCGTTTGC GCTTATGAGG
CAGGTTTCAC GCTTGGGAAT GGATGACCCC GTATATACCC TAGCTGATCG TGGGGTACCC
AACCCTGCGA ATAGAATATA CGACGACATG AATGTAGTTG ATGTCCCTGA TGATGTATTG
GAGCAAATGT CGATCGCTTC CGATCCAACA GCAGCTTTTG GAAACGGCAT CGATGTCACA
GTTTTTGAAA AAAGTCTAGG CGACCTGCAG CCTCATTTCA GCATGTCCCA ATTTTCTCAG
ACGGAGTCGT CAGTACCTGT CGTTCCGCAA TCTCCTTCGC CTAGGTCGAT GGCTACAATC
GGATCCAACT CCTATTTGTC GCATATTCAG CAGTCCCAAA TGCCCTCCTC CTTTCGGCGG
TCGAGACCCC AAGAAATTAC AATAAGTACG GAAGATACGA GGATGGAGGA TTTGGGTGCT
TCCATGCCGT CCCTCGACGT CATATCGCTG GATCTTGAGG AAGCAGTCTT TGGTGGCATG
GTCCGCAAGC TAGCACGTAC CGATGATTCC AATACATCAC GAATGAATAA AAGTGCGGAT
GATAGCATGT TGGGCGTACG AAATTCGCGA CGGGGCTGTC GTCGAGGAAA AAGCAGCGTA
TCCAACAGCA TGATGCGGGA GGCCGCCCTG GCTATGGAAA AAGACGGAAA CACAAATCAC
CAATTAATCG CCGATGCTCT CAACAATTCG ATTCAAGACT TGCGCTCCGA AGGCTTGCAA
GTCCGCCATG TGCCTCGGAG AACCAAGAGC AATCAAGAAA AGTACGAGGC CCCAGACCCC
GTTTTTCCTT CCAATAAATC CGTCAACTCT GCTTCCAGCA AATCTCGTAC ACTCCCCTCA
ATTGCTAGTC TTTTCCCGGA AGAAAATGAA ATATTTCCCT CAAAAATTCC ACTCCGTCGC
CGGGTTGGTG TGCGAGTGAT ACAGCGAAAG CACAGCGGGG ACACAGATGA GGCTTTGGTA
GGCAACCCAG ATCTCTTTGT GCGATCTCTC GTGAAGGCAA TACAGTGA
 
Protein sequence
MDDTPRSDFS KAALLSAVRQ AAGSTPVRRN KMRSALDAKT APDKARKLIA PTVKSFALMR 
QVSRLGMDDP VYTLADRGVP NPANRIYDDM NVVDVPDDVL EQMSIASDPT AAFGNGIDVT
VFEKSLGDLQ PHFSMSQFSQ TESSVPVVPQ SPSPRSMATI GSNSYLSHIQ QSQMPSSFRR
SRPQEITIST EDTRMEDLGA SMPSLDVISL DLEEAVFGGM VRKLARTDDS NTSRMNKSAD
DSMLGVRNSR RGCRRGKSSV SNSMMREAAL AMEKDGNTNH QLIADALNNS IQDLRSEGLQ
VRHVPRRTKS NQEKYEAPDP VFPSNKSVNS ASSKSRTLPS IASLFPEENE IFPSKIPLRR
RVGVRVIQRK HSGDTDEALV GNPDLFVRSL VKAIQ