Gene PHATRDRAFT_36165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36165 
Symbol 
ID7201301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp706577 
End bp707923 
Gene Length1347 bp 
Protein Length448 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180694 
Protein GI219119886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAGG CTCTGGAGCG GCGCATACAG ACCGGAAAAC TAAGACAGCT TCCCTCCCTT 
GGTACTTTCA CAAACGATTC TACGAAGCAT GCGTTGCCCA CAGGGAAAGT GGTTGATTTT
TCTAGCAACG ACTATCTGGG ATTGGCGCAC AGTACGACTC AACACAAGCT GGTTCAAAGG
ACTTACAACG ACCTCTTCGC AGAACGACCA CACGACGAAA CCGAAGCGCC CTACGCAACA
CGGGCCGTAC TCGGCGCAAC CGGATCGCGA TTACTATCCG GTGATAGTGG TATGTTTCAT
GATCTCGAGT CAAAATTGGC TCTACTGCAC CGTCGAGAAG CCGCACTATT GTGTAATTCG
GGATACGATG CCAACCTGAC CATGGTGTCC TGTTTGCCAT GTCGCATCAT TGTCTATGAT
GAATACATTC ATAACTCACT GCATATGGGG ATTCGCTTGT GGCAACAACA GTCATTGACA
AACAGTGCGG CAGACCAACA AAGCACTTTG CGTAAACAAA CGTTTTCCTT TCGTCACAAC
AATGTCGAAG ATTTGCGAAG CGTTTTGGAC TCGATTGTAC AAGATGCTCC TGAAATCGTC
ATTTTAGCTG AAAGCGTCTA CAGTATGGAC GGTGATGTCG CGCCGCTACA CTCCTTGCTT
GATGTGGCCT TAGAGTGCAA TGCCAGTGTC GTCGTAGACG AAGCACACGG TTTGGGTGTT
TTCGGTTTTC GGGGCTTGGG TGTACTATCG AAGGAACATC AAACACTCAA CAGCCACCCT
GCGTTACTTG CTTCCATCTA TACCTTCGGA AAGGCTGCTG GTTGTCATGG GGCTGTTATC
TGTGGCAGTA CAATCTTGAA ATCTTACCTT TTAAACTTTG GATACCCTGT AATTTATTCC
ACATCCTTGC CGATGCATTC GCTCGTATCC ATCGATTGCG CCTACGACAC GATGGCGAGT
ACTCGCGGCG ATTCGTTGCG CACTCATCTA TTTCAACTGG TGCAGGTGTT TCGATCACTG
CTGTTATCGG CACTGAATCT TCATGGAGCC TCTCGCACCG ACCTTGCTTT GTCACCGTCA
ACATCACCAA TCCAGGCGTT GCTTATTCCA GGCAACGCGA CTTGTGCCGC CATATGCGAC
ACCGTTCACC AGCTGTCACG TCAGCAGTTG CGTTTGTATC CCATCAAGTC TCCGACCGTG
CCAGTCGGTC AAGAGCGTAT TCGTATCGTT TTACACTCAC ACAATTGCAC TTCAGAAGTG
CAGTGGTTGG TACAACTTCT GACTCAAGCT CTGCAATCGC ATGGCTTGTT GAAGACAGGT
CCCAGCTCTA TACTCGCTAA GCTATAG
 
Protein sequence
MRKALERRIQ TGKLRQLPSL GTFTNDSTKH ALPTGKVVDF SSNDYLGLAH STTQHKLVQR 
TYNDLFAERP HDETEAPYAT RAVLGATGSR LLSGDSGMFH DLESKLALLH RREAALLCNS
GYDANLTMVS CLPCRIIVYD EYIHNSLHMG IRLWQQQSLT NSAADQQSTL RKQTFSFRHN
NVEDLRSVLD SIVQDAPEIV ILAESVYSMD GDVAPLHSLL DVALECNASV VVDEAHGLGV
FGFRGLGVLS KEHQTLNSHP ALLASIYTFG KAAGCHGAVI CGSTILKSYL LNFGYPVIYS
TSLPMHSLVS IDCAYDTMAS TRGDSLRTHL FQLVQVFRSL LLSALNLHGA SRTDLALSPS
TSPIQALLIP GNATCAAICD TVHQLSRQQL RLYPIKSPTV PVGQERIRIV LHSHNCTSEV
QWLVQLLTQA LQSHGLLKTG PSSILAKL