Gene PHATRDRAFT_37440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37440 
Symbol 
ID7202359 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp154002 
End bp155126 
Gene Length1125 bp 
Protein Length271 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181669 
Protein GI219122681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATA AAGCCAAGGA ATGGTATCCA AAGATTGGAG GAACCTACAG GCTGATGGAT 
GACGTACGTA CCGTCAGTGA AAACTTATAT TTACGACCAG TGTCACAAAT GCTAGAGAGC
TCTAGCTAGA GTAGACACAC AATTCTTATG TGACTTTATT AGTACCCTCG TGTCAAAACT
CATCCGCGTT TTTGTGGACC AAATACTTTT GTTGCTGCTG CCTATATTAC TTTTCCTCAA
CAGCGAGAGG TCTTGCTATA GCCATGAGGC CCGTTTTGAA GAAGCGAAAA TTCAAGGAGG
TCATCGATTT GTGTTCCGAG GACGAGAGCG TTGAGACCGG AAAGCGCTCG GCATGTTCGA
CCGTAAAGTC GGGGGTGCGC CCAACTGTTC CAATCGACGA CGGTGACTTG ATCGAGATGT
CTTACAAAAA GGATAGAATA AATCACAGAC CGCTTGTCTA TATCGATACA GAAGGGGGGG
ATGACAGTAT CGTGTCAGAA GACTGCTACT ATCCGTTGAT GGAATGGAAT AAGGGGGCGG
CACGGACCTG CGCCGGCGGC GATGAAACCA TCAAACGACC CAGCTTTTAT CTCCTTCATA
TCCAACAACG CGACAAATGG AGCTGTGGCT TTCGCAATTT ACAAATGTTG CTAGTTACTT
TGGTTCCCTC TCTACCACCC GAGCATATTT ATTTTGATAA TGGCAGCTTG AATGGACAAG
CTTTTCGGGT GCCTTCTTTG CAACAGCTCC AAGCTTCTTT GGAACGGTCG TGGCGTGCCG
GATATGATCC CGATGGTGCA CAGCACTACC AGAATCGTAT TTTGGGTAAA AACTCGAAAA
TTGGAGCAGT CGAAGTTTCG ACAACCTTGT AAGTTTGCTG ATCGAGTGGC CGAGGCTACG
ACTACAATTG ATGCTGACCA AGAGATTTTC ACCAATTTAT AAAATCAACA GATCATTCAT
GAGTATTGAC TCGGTCGTGG TACAATTTAT AACTTGCCCA GAGTCTCGGG GCAAGTTGGG
GCCTTTTTGC TCTACGTACT TTGGAAAGGA AGCGGACTGC TGTCCCTTTT GCCGGACCGT
CGCGATCCCA TCGTGTTTAA CTATTGCCAA GCAGGTTGTC AATAG
 
Protein sequence
MVNKAKEWYP KIGGTYRLMD DVPRGLAIAM RPVLKKRKFK EVIDLCSEDE SVETGKRSAC 
STVKSGVRPT VPIDDGDLIE MSYKKDRINH RPLVYIDTEG GDDSIVSEDC YYPLMEWNKG
AARTCAGGDE TIKRPSFYLL HIQQRDKWSC GFRNLQMLLV TLVPSLPPEH IYFDNGSLNG
QAFRVPSLQQ LQASLERSWR AGYDPDGAQH YQNRILGKNS KIGAVEVSTT LVSGQVGAFL
LYVLWKGSGL LSLLPDRRDP IVFNYCQAGC Q