Gene PHATRDRAFT_33814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33814 
Symbol 
ID7197858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp413171 
End bp414307 
Gene Length1137 bp 
Protein Length378 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178222 
Protein GI219114853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000658856 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACG AGGAGGAAAA GCTTGCCAAG ACCTCAATGA CGACGTCCAT TGATACCTCC 
GCAGCCAACA AACCAGCGAC CGAATCGCCT GAAATCGAAT CGTTACTGAG CCCCGTGCGA
CAGAAGAGAG AATGGCTAGA AAACAAGTTC AAGGAAGACT TTTTGGCGAA TCGGCCACGA
CTTAGTCCGG GACATGAACT TGTGGAGGCC CGCCACAAAT GGTTACAAGA AGAAGCCCGT
CGGAATCGGG AAGCAGTTAT ACGCCTGAAT GACATTGAAG TCTCGCAAGA TGTCCTGGAA
GCCAAGAAAA AGTGGCTTAC TGAAGACGAA CGGATCGTCC AGGAAATGCG TGTGCATGTG
TTCTCCGAAA ATCCAGCTGG AGATGGCAAC GTTTCGGGTG AACCCGAAGT TGATTTTCGG
GCGTATTCGT CTAAAAGTGA GCGACGGGAT TTGCCCAATG ATAGAGAGAC AGAAACCAGC
TCTAATGAGG ACGAGCGCGA AGATAACTTT GAGGACTGGG GTAATCTGTT GGAAAAGATT
TTTACCGACG AAATGGTCTT TATCGATGAG TCCGCCGATG ATTATATTGA GGTAGCTCAA
TCGAGGGCGG AAACACTAGC TCCCATGCCA GGTGGCATGC CTTCATACCC ACTTTTCGTT
GAAAGCGACA CGTCGGGAAT CTCGAATAAA AATCATTGTG TTTCAGAAAG TTCAGATGTC
CGCAACGAGC GGACGACAGT GGAATGCGAA TTATTTCCGG GTGAATTGGC TTATCCCGAA
AAATCGAGGC GGGATCCTGA AAAGACGGAG CTTTCGTCTG GAGAGACAGC TAATCTGGCA
GTGCTCAAAG CCGAAAAGAT GACATCTGCG AAGAAAAAAG ACCTAGAGTC TGCTGAATCG
TACTTAGCAA TTGCCGATAA AGTCTACTGG GAAAGTATGG CACTTTTGGA TCTCACTAGA
GGTCAAATGC ATGCCTCTCC CGCGAAATTG TCGAAAGGGG AAATTTTGTT GTCGACAGAA
AAGGCAGGAG TGGATAGTAA CAAGATTCGA GTTGTGGAAA AGGAACCCCT TGTTCCGCTC
AACACGGACT TTGGAGTAGT GGAAGCGCGT TGTCTTGAAC GTTGTGTCAT TTCTTAG
 
Protein sequence
MIDEEEKLAK TSMTTSIDTS AANKPATESP EIESLLSPVR QKREWLENKF KEDFLANRPR 
LSPGHELVEA RHKWLQEEAR RNREAVIRLN DIEVSQDVLE AKKKWLTEDE RIVQEMRVHV
FSENPAGDGN VSGEPEVDFR AYSSKSERRD LPNDRETETS SNEDEREDNF EDWGNLLEKI
FTDEMVFIDE SADDYIEVAQ SRAETLAPMP GGMPSYPLFV ESDTSGISNK NHCVSESSDV
RNERTTVECE LFPGELAYPE KSRRDPEKTE LSSGETANLA VLKAEKMTSA KKKDLESAES
YLAIADKVYW ESMALLDLTR GQMHASPAKL SKGEILLSTE KAGVDSNKIR VVEKEPLVPL
NTDFGVVEAR CLERCVIS