Gene PHATRDRAFT_37779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37779 
Symbol 
ID7202761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp47592 
End bp48662 
Gene Length1071 bp 
Protein Length356 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181991 
Protein GI219123354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.648541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTA TTCCACTTGA GCTCCAAGCC ATCTTTGTTG CTTCGAACAA AGGAGAGGAA 
GTACACAAGC CGCCTTCCAA TAGAGCTGCT TTCAAGGTGT TCAAGAGCAT CTTCGGAGGC
TTTGGGCGAC GTAAGCGATC CACCGAGAAC AAAAGAGAGA ATTTGAAGCG CATTGCAAGC
CAGAGCTCAT TTGCCACAGT CAGTACTCTC GGGGTGGATG ACTTTAGCGG CAGCTCCCAC
AATGAGAACA ACGAATGTGC CCCCGTTACG CCGCACGAAC ACCTGGAAGT TTTATTGAAG
GCACGTGGCT ACTGTACCGA ACGCTATTCT GTTCTTCAAA CCGCATTCTT CAATCGACCT
ACACCCCTTC AACTCGCTTC ATACGATACC AAGCTTATAC AGCTCATCAA GAGCCAGGAT
GAGCAGAAAG TTCGAGAAAT TCTTGCCAGT GGTATCTCAC CTAACGCTTG CAATATTCAT
GGCGAATCTT TGATTCACAA GGCGTGTCGA TTAGGATATC ACCGTCTCGT CCGGGCGTTC
ACCGACTTTG GCGCAGATCT CGCAATCTCT GATGCCCAGG GACGTACGTT GCTACACGAC
ACCTGTTGGG GTGCCCGACC TTCGTTCCAA ACTTTTTCTC TTATCGTTGA TCGCCAACCA
GAACTTCTTT TTCTAGCTGA TTGCCGTGGT GCCTGCCCCC TCGAGTATGT TCGCAAGGAT
CACTATGTTT TCTGGATTGA GTACTTGGAC CAAATAGCGG ACAAATATTG GCCTTCAACT
CAGTCCACAC CCAAACTTTC GTATCTGGTA AAGCAAGAGC CGCACTCAAG ACAGATTGGA
GAACCAGGAA ATGCTCTCTC GTTGGAACTT GCCGCAATGG TTGCGTCGGG AAGACTGAGT
CCAGAAGAAG CTACATACTT GGCAAAAGGA GACGAAGAGG ATTCAGTCAG CGGTGAGGAG
GATTCATTGA GTGACGACGA CGAATCTACG TGGAACGAGG AAGACAACGA AGACGACGAG
CTACTTGCTG ACCTGTGTGG AATTCACAGT CTATCAAGCA TTCCCGTCTA A
 
Protein sequence
MSRIPLELQA IFVASNKGEE VHKPPSNRAA FKVFKSIFGG FGRRKRSTEN KRENLKRIAS 
QSSFATVSTL GVDDFSGSSH NENNECAPVT PHEHLEVLLK ARGYCTERYS VLQTAFFNRP
TPLQLASYDT KLIQLIKSQD EQKVREILAS GISPNACNIH GESLIHKACR LGYHRLVRAF
TDFGADLAIS DAQGRTLLHD TCWGARPSFQ TFSLIVDRQP ELLFLADCRG ACPLEYVRKD
HYVFWIEYLD QIADKYWPST QSTPKLSYLV KQEPHSRQIG EPGNALSLEL AAMVASGRLS
PEEATYLAKG DEEDSVSGEE DSLSDDDEST WNEEDNEDDE LLADLCGIHS LSSIPV