Gene PHATRDRAFT_41094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41094 
Symbol 
ID7198889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp335619 
End bp336755 
Gene Length1137 bp 
Protein Length378 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185016 
Protein GI219129691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACC TTCCCAACAC TGCGCAAGTG CTGCACGAGG CCATCTTGAC CAACTCTCTC 
GATGTCGTGC AGTCCTATCT TGGCGCTGCC GGTGTGGATC CCAACCAGTC CCCGCCGCTG
TGTCCTCTCG CTCAACTGAA GCGAGCCCTA CGGCGTCCCG ACGCCGTGGC TCACGATGTT
GTCGTCGTCG CCACTCAAGA TGTCTATCGC GTTTCACCGT TGCACGTGGC TATATTCAAT
TGTTACCACA ATCACGGAAG CCAAGAAGAT CCTACTCCCC GCGAGACTGC CTTGGCGATC
GTGCAAGCCT TGGTGGACGC CGGGGCCGAT ACTACACTTG TTGCGTCCCA CATTGCCGTT
GTACAGAAAA GAGATTCTTC ACTCATACGT ATACAAGATC GGGCGCCGAT AGGATTGGCG
CTGCTTTTGA AACAGAACGC CCGCGGTCCG GACGAAGTGA ATATGGTAGA AAGTCTGGAT
GCGGCTATGC AGTGTATCGT GCCGAGAGCG GATGCTAGGG ACGCTTCCGG ACAAGACGAT
GTTATTGAAA CCATGACAGT ACCCGAGTCC TTTGCCCAAA GTTTTGGGTC GCTTCTATTT
TCACAGGAAT TTAGTGACGT CAAGTTTGTC TGCAAGGATG GCACGGTGCT CCACGCCCAT
CAGAATATCC TGGCCGCCGC GAGCTCGTAT TTTCGAACCT ATTTTCAAGG ACCGTGGGGA
ACTCTTCACA CCGACGGTTG CTGGAAAACC GAAATCACCC CCGATGTATT GCGTGCCGTA
CTCATGTTTG TTTACACGGG GAAAGTAGAC GATGATCTTT TGGAAGAAGA GGCAAAAAAC
ATCATTTCGG TGGCTCACGA ATACGAGCTC TTTGATCTGC AATTGCTCGC CCAATCATCG
TGCGTTGCCA ATCTGTGTCC CGAAAACTCG AAAGAAATGC TCCAACTCGC AAAATTGCAC
GAGTCTGATA CGCTTAAAGA CGCCTGTTTT GACTACATCA AAGAAAATAT GGCCGCTGTC
TTGATGCATC CGACCTTTGT TTCTTTGGCC GACGAGGACT CCGCGTTGTG GGCCGAGCTG
AACGAATTCT TGCAAGAGTC CCCCACACAA GCGGGCAGGA AACGATCGCG AAGCTGA
 
Protein sequence
MADLPNTAQV LHEAILTNSL DVVQSYLGAA GVDPNQSPPL CPLAQLKRAL RRPDAVAHDV 
VVVATQDVYR VSPLHVAIFN CYHNHGSQED PTPRETALAI VQALVDAGAD TTLVASHIAV
VQKRDSSLIR IQDRAPIGLA LLLKQNARGP DEVNMVESLD AAMQCIVPRA DARDASGQDD
VIETMTVPES FAQSFGSLLF SQEFSDVKFV CKDGTVLHAH QNILAAASSY FRTYFQGPWG
TLHTDGCWKT EITPDVLRAV LMFVYTGKVD DDLLEEEAKN IISVAHEYEL FDLQLLAQSS
CVANLCPENS KEMLQLAKLH ESDTLKDACF DYIKENMAAV LMHPTFVSLA DEDSALWAEL
NEFLQESPTQ AGRKRSRS