Gene PHATRDRAFT_49755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49755 
Symbol 
ID7198342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp173995 
End bp175161 
Gene Length1167 bp 
Protein Length368 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184501 
Protein GI219128609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGGTCATCG GAGACGTAGA ATGACAATTG ATGATGAGGA GACTGACAAA TCATCCACCC 
AGAGTCCTAA GAAGGCGCTC CAGGTATTCA TTCGACGACC ACACAAGAAA GGCGTGCACA
AGTTGGAACA CAAACGTTCC AGCTCGCCTT CAGTAGCTTC CCGTAAGCGC TCGGGACAAT
TGGTGATCAG CAGCAACCAC GGCTTACCTT CCATTGGTCG GAAAGTTTCA GAAATACGGA
AACCTGTACT GACATCGGTG CACAGATCCG AGATCGATAC CAACGATTTG TTCCGAGAGA
ACGACGATTC CGTCTTCCCG GATGTTGCCT GCGTGTCCGA TACTCTCTTG GCAATTCAGA
GCCTAAAGAA CGGCAGTAGA TCCCAAACGA TTGCGATACC TCTTACGCAA GCGCCTGGAA
GACAAGCACA GCAACGAGCA CCGCATTATA TACACGGAAT TCTCGAATGC CAGCTCTATA
TTCTGATAAA GGATGCGAAC AACCATTTCC CATCAGTCGG AGGTACAGTA CCGTCTTTGC
AAGTTAGTAC AGAATTGCCG CAACTACTAC GGGCGAATAA ATTGCGGAGA CTTTCCTCCA
CTACGCAATC CTCCCATCCG CTGACCATCT TGCTGGAAAC GAACGATTAC GTTCGCGCCG
TGTGGGATGC GCACCACCGG TACCCGGGTA ATGCCGCTGC CACCGAATGG TTTCTCAGCA
TTCTCCCCAA ATGTACCGGG CTCTGCATTC CAATAATACA GCTAGAAGAA CTGTACCGTA
ACAGTTCCGT GGAATGCAGC GAGCCCCTGG AGTCCATTCT TAAGCAATTG CAGCAGATGC
AAGTGCTCAT GGCGTCGCAT TCGTCCGGCG TTTTTCAATT GTGGCTGCCA TCTTGGGGTC
TCGTTTTGAA CGCTTGGGAG GCAGCCCGCA GAAAACTGCT TTTGCAGCTC AAGCAGAGCT
CGTTCCAGGA ACGTTCCGTA CAAGCTTTGC AACAAGAGTA TAGCCCAATC GACACGAAGC
TTCTGATTGA CTGGATGGTC GACCAGGGCG AAGTGCAATT CCGAAAACGA CCCGCAGGCG
TCTTTGTCAA ACTCCTGGCT TCTGATGAGT CTTGGTCGAC AAGATAATTC AGAGAGACCA
ACAACTAAAT TTACGTGTTA TGTTAGC
 
Protein sequence
MTIDDEETDK SSTQSPKKAL QVFIRRPHKK GVHKLEHKRS SSPSVASRKR SGQLVISSNH 
GLPSIGRKVS EIRKPVLTSV HRSEIDTNDL FRENDDSVFP DVACVSDTLL AIQSLKNGSR
SQTIAIPLTQ APGRQAQQRA PHYIHGILEC QLYILIKDAN NHFPSVGGTV PSLQVSTELP
QLLRANKLRR LSSTTQSSHP LTILLETNDY VRAVWDAHHR YPGNAAATEW FLSILPKCTG
LCIPIIQLEE LYRNSSVECS EPLESILKQL QQMQVLMASH SSGVFQLWLP SWGLVLNAWE
AARRKLLLQL KQSSFQERSV QALQQEYSPI DTKLLIDWMV DQGEVQFRKR PAGVFVKLLA
SDESWSTR