Gene PHATRDRAFT_32852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32852 
Symbol 
ID7197477 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp942755 
End bp944006 
Gene Length1252 bp 
Protein Length291 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178039 
Protein GI219112575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0416343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTCC GTTCCAACGG CTTGCCAAGA CATCGAAACG GAGTCTTCTT AATAAGTAGT 
TTGTCTCTGT TACAGTAATA TACACGCCGG TCGGATTGCT TCCCGTCACG AGCTGTGCGA
TGGACGAAAC GACGTAGCTG CAGTAACTGT GAAGGGCCCA GCCTCCTTAC GGAAGTATTT
GTGTTTGCAG TATTATACGA CCCTACACAT ACCCGCCCCT CTCATCCTTT ATGGCTGAAT
GAACCGAATG AAGTAGTAGT CAGGCGATGA TCACGCCGTA CACTTGACTC CAGTACGTTC
TCGCGCAGCG ATCGGTTCCC AAGCTGCGTC TACACCGACA CCTCTTTGGC CTCGGAAGTC
CGGATTCGCC ATGAGCAAAC GGCTATTACC GACGAAAGAA GATGATTTCT GGACTTTGGC
ATTGGACGAA AGATTGAAGA ATACGCAGAT CCTTCCTGCG GGTGAAGGTG TTGACTATTT
TAATGCCACA ACGCGCTCCA TGTTGTATAA TATACCGTAT GGGGAAAGCA TGATAGTAAC
CTTGCCCCTC TCGGACCTCC CCATGATTGA CGGCGCCTGG AGTCCAACTG GTTCCCAAGC
ATGGTATGCT TCCGCATTAC TATCTGCCAT TCTTCTACAA GAATCGGACG AGCGTATCGT
AGGGATATTG TGTAGGTCAG AGTCACTGTC AATTCTTGAG TTGGGAAGTG GAGCGGTTGG
GCTTTCTGGG ATCGTGTCCA ACTTGCTGTT GAGCCGACGG CCGGGGACCC ATCGTGTCTA
TTTAACAGAT CGCGATCCAA ATATTTTGAA GCAGCTCGAG CAAAATGTCA TGCAATATAA
CGAACACCTA AGAAAGCACT ATCCCGCGAT AAAAGAAGAG CATATGGAGG TCCAAAATTT
GGATTGGAAT GACGGATCGG CATGCTCGCG CTTAAAAGAC TTGGATCTAG TCATTGGATC
TGAGCTGGTC TATACGCTAG AGACAGCCAA AGGCTGCGCA TCTTGTGTAC AAATTCTTCT
TAAAAACAAT CCCAATGCAG TGGTAGTAAT TGTGCAAGTG AAGGATCGAG ACGGTTGGAG
CAATATATTG GTTCCGACAA TGTTACTTTG TGGATACCAA GTATCCGAGG AGAGCATCCC
AATCGGATGT GACGAAATAG CTAGCACTAT GATGCAGCAC AGAGGGATTT TGGATCAATC
CCAGTTTACA GCCTGTTTTA TTTCAACACC GAGAGTAATT GGATCCGACT AG
 
Protein sequence
MSFRSNGLPR HRNGVFLIIR SRAAIGSQAA STPTPLWPRK SGFAMSKRLL PTKEDDFWTL 
ALDERLKNTQ ILPAGEGVDY FNATTRSMLY NIPYGESMIV TLPLSDLPMI DGAWSPTGSQ
AWYASALLSA ILLQESDERI VGILCRSESL SILELGSGAV GLSGIVSNLL LSRRPGTHRV
YLTDRDPNIL KQLEQNVMQY NEHLRKHYPA IKEEHMEVQN LDWNDGSACS RLKDLDLVIG
SELVYTLETA KGCASCVQIL LKNNPNAHRG ILDQSQFTAC FISTPRVIGS D