Gene PHATRDRAFT_44286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44286 
Symbol 
ID7198003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp127584 
End bp128787 
Gene Length1204 bp 
Protein Length398 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178170 
Protein GI219114749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAAAGCATG TTTCGTCGCT GCCCAACGCA TGACAAACAA GGTCAAGATC TTTGCATAGT 
CGAGACAAAG AGGTTTCAAG GCATGATTGT ACCGCGACAA GATCAAGCTG TGCAACCGAC
GACTCATGAT GTGATATTTG GCAGAGGTCG GCGGTATGAC AACCATCCCG GTAATGTGCA
ATTCCATCAC ATTGTTAAGA ATTACATATA CCAATATGCG ATGGCAGCGA ATCGGGCCGA
GAAAAATGCT GTTGCGGAGT CCATCGTCAA CAACGTTTGT GTGACTGGGC GCTTTCTGAA
GTATGACGAA TGTTGTGATG GTTGGATTAT AGCCGATCCA GATTCTACCT GGAAAAAGGT
AAGACAAGCT CTACGATATC GCGGAAAAGG CTTTACAACA CAAAGGAGAT ACGAAATAAT
AAACTTCACA GCCTCCGATC TTGAGAATCT TCAAACCGTA TCGCAAACAA ACGCAAAGGA
AAGGGAAAGT ACAAACACGG CCCGTCCCGC AGTCTCACAA AATACAGGCG ATGCATACAT
AAGCGACCTT TTGTCTGACT CCTTTGTTGG CAATTCAGTA TACTATTCTT CATCTACTGG
GAGGGCATAT GGAGAAACAA AAGGGGAAAT GGAATTGTGC CAAGGAACCC CTGTGATTGT
AGAAGAGCGG GGTGCAACGA GCAGTTTCAG TTGTGACACT GCAGAAGACA AATATACCAA
TAAGGACAAG TTCTTGAGTG GAGCACCCCC CAATAAGCGT CTGCGTTTCG AGGTTGACGA
CGATTACCAG CAGTCGTCAT GTAAATATAG ATGTCCATCA TGCAATTGGA ATCAAGAACT
CTCGAAGGCT TACCCAGGAA TCAGCCTCGA GCCTACCCCG ACAGTCCGCT GGCAGCAGTC
CACTACATTG AATCCTCCTG CAGATATTCT CGATGAAGTT GACTTTCTCG ACTTTGACAT
ATCAATGTGG AAAGCGACGG AAGAAGCCTT GACCGTTGGC GACTCAGAGG GGCTACTTTG
CCCAGAGGAT GACTCGGACC TGAAGACAGC AACCTCGCAT TGGTATCCTT TTACAGCTGT
CCATACTGAC TTGGCTGACC AATCTCCTTG CGAACATCTT TTGACGGACG AAGAAATTCT
CCGTGCCATT GATTGTGATT GCGCAGAAAA CGTCCAAAAT GTTTGGGAGA GTGAGACAGG
CTAG
 
Protein sequence
MFRRCPTHDK QGQDLCIVET KRFQGMIVPR QDQAVQPTTH DVIFGRGRRY DNHPGNVQFH 
HIVKNYIYQY AMAANRAEKN AVAESIVNNV CVTGRFLKYD ECCDGWIIAD PDSTWKKVRQ
ALRYRGKGFT TQRRYEIINF TASDLENLQT VSQTNAKERE STNTARPAVS QNTGDAYISD
LLSDSFVGNS VYYSSSTGRA YGETKGEMEL CQGTPVIVEE RGATSSFSCD TAEDKYTNKD
KFLSGAPPNK RLRFEVDDDY QQSSCKYRCP SCNWNQELSK AYPGISLEPT PTVRWQQSTT
LNPPADILDE VDFLDFDISM WKATEEALTV GDSEGLLCPE DDSDLKTATS HWYPFTAVHT
DLADQSPCEH LLTDEEILRA IDCDCAENVQ NVWESETG