Gene PHATRDRAFT_43135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43135 
Symbol 
ID7196899 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2125408 
End bp2127685 
Gene Length2278 bp 
Protein Length587 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177451 
Protein GI219111399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACGATCCC TGGGCAAGGA CGTGTGTGTC CATATTGATG TATTCGTGTG CCCAGACGAC 
AGAGAACGAT CATTGCGAGT CCAAGACGAC TACGGTAAAC AATTCCGACA AAGCGACACA
ATGAAAGTCA CGTAATGTCA GGGTGGAGGT TCCGCTGAAG GTAAGAAGAC GACGTGTGCG
GGAAAGATGC CATGGCTGAG TCCCCTGGAA ACAATGAAAG AGGGATGTTA CGTGTAGGTG
ATTAAATACG TTTTTAGATG ACGAGTCGGA GAGATCGTTT CCTGCGTTAT GCAGAGAAAT
ACTCGGTGTC GATCAAGAAC GAGGCATTTG TATGATACAC GTGGACGTTA CGATTGGTTG
GCGACAGGGG AGTGTTCAGA AGCCTGTCCC TCACGACCTT GTTGCTTTTA TCTCATTCGT
CAGTATCCAA ATCCGCCACC ATGTGGAAGA CCATCAGTTT TTTGGTTTGC GTCTCGATAC
TATACAGCAG TACGCTACCG CAAATACAAG GCTGGACCGG GGAGACAAAG CTGATGCGTC
GACCTGTGTC CGCACGGAGG CGTCGTGCAT GTGTTCCTCC TGGCCATAAC CACCACCACC
GCGATGCGTA CTCGCTGTGT TTGACGAATG CGGCGGACGA GTATAATGCC GACGCGTCGG
CATCTAAACG TCGACGGGTA CGTGGTCCCA TTCGCCGCGC CGCGAGTCCC ATCGCCGCAC
AAGCACAAGA GCGGGTGGTA TCTGCACGTG CCCGTCACGA ACAGGCCACA CAAGATCCCA
CGCTATTGAC CACTTACCGC TTTGACGAAC GCGCTGATCT TCATCCCGGC ACGAAACGTG
CCGTAACCGA GGTCATGGGA TTTCAACAAA TGACAGAAAT TCAGTACAAG ACATTCAATG
CAGCTTTGGA AGGCAAGAGT GTGTTAGGAC GAGCACGAAC CGGAACGGGC AAAACGTTAG
CCTTTTTACT ACCCGCTATC GAACGTTTAA TGTTCATGGA TGTGTCCGTA TACCGAGCTG
ACCGGAACGT TGGTATCCTC ATTGTTGCAC CAACACGAGA GCTGGCTATG CAAATCGGGA
GTGAAGCTTC GCGGTTGCTC ACGTTCGAAT CCAAATGGAG TGTTTTGACG CTCTATGGTG
GAACAAAAAT ACAGCGCGAT GTTGCCCTTC TTAACAGACA AATCCCTACC ATTTTGGTGG
CTACCCCAGG TAGACTTCTG GATCACCTGG AAGACACGAG ACTCCGTGGA CGAAAGTTCA
GCGACGTTGT TGGAGAGACG CCAATTGTGG TCCTGGACGA AACCGATCGG CTATTGGAAG
GCTTTGCTAA GGATACGAGA CGAATTCTTT CGTTTCTGCC AAGACCAGAA AAGCGTCAAA
CGCTACTGTT CTCGGCGACG GTACCCACAC GTCTAAAGCG CATTTTGGAC GAGATCCTAC
CGGCAGATTA TGTGGAAGTC GACTGTGTCG GGAACAACGA CAGTTCAAAA CAAACAAACA
AGAGGGTAAC GCAATCGTAT ACACTTTTGC CTGCTATGGA CTCATACGTG TCATATTTGG
TATCTATCAC CAAGCAAGCC ATGGAGGAAG AGAAGGATTA CAAAATCGTT GTGTTCTTTC
CTGCAGCACG ACTTGTTCGT TTTTTCACCC GTTTTTTCAA CGTCGGACTC GGAATACCGG
TATTGGAGAT GCATTCCAGA ATGTCCCAGT CGGCACGAAC CCGCATTAAT TCATCTTTTC
GCAACGCAAA GCGAGGAGTT CTGTTTACGA GTGATGTTTC CGCCCGCGGT GTTGATTTTC
CGGACGTAAC TCTCGTCGTG CAGGTAAGAG TCTTTCCCTT CCTTCGATTC CCCTTCCAAT
AGGGAATCAT TCTCACTGGT CGTTCTGTTT TCTCTAGTAT GGTGCCCCAA GCAACAAGGA
ACTTTACATA CACCGTCTCG GACGCACGGG GCGCGCTGGA CGGGAGGGTA AGGGGTTGCT
TGTACTGTTG CCGTTTGAAA AGAAGGCCTT GAAAGAGATT GATTTACAGC GATTGATATG
CGTTAATATT GAGGATCACA AGGATCTCAT GGACAAAGTT GATTTCGCCC AGAATCTGGT
TCGCAGCGGA CATCCCTTGT TGACGCCCAA TGCTGAAGCC GCATATCTTG CATTTGTTGC
ATACTACATG ACCAGCAAGG GAATGGGGTC CCGTGATGAC GTGGTGGATG CGGCGAAAGT
TTTTGCACAA ATAATCGGTC TACCCAAACT TCCTGATTTG TGGGGAAAAT TGCAGTAG
 
Protein sequence
MWKTISFLVC VSILYSSTLP QIQGWTGETK LMRRPVSARR RRACVPPGHN HHHRDAYSLC 
LTNAADEYNA DASASKRRRV RGPIRRAASP IAAQAQERVV SARARHEQAT QDPTLLTTYR
FDERADLHPG TKRAVTEVMG FQQMTEIQYK TFNAALEGKS VLGRARTGTG KTLAFLLPAI
ERLMFMDVSV YRADRNVGIL IVAPTRELAM QIGSEASRLL TFESKWSVLT LYGGTKIQRD
VALLNRQIPT ILVATPGRLL DHLEDTRLRG RKFSDVVGET PIVVLDETDR LLEGFAKDTR
RILSFLPRPE KRQTLLFSAT VPTRLKRILD EILPADYVEV DCVGNNDSSK QTNKRVTQSY
TLLPAMDSYV SYLVSITKQA MEEEKDYKIV VFFPAARLVR FFTRFFNVGL GIPVLEMHSR
MSQSARTRIN SSFRNAKRGV LFTSDVSARG VDFPDVTLVV QYGAPSNKEL YIHRLGRTGR
AGREGKGLLV LLPFEKKALK EIDLQRLICV NIEDHKDLMD KVDFAQNLVR SGHPLLTPNA
EAAYLAFVAY YMTSKGMGSR DDVVDAAKVF AQIIGLPKLP DLWGKLQ