Gene PHATRDRAFT_45954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45954 
Symbol 
ID7200831 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp765913 
End bp769235 
Gene Length3323 bp 
Protein Length1041 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180113 
Protein GI219118691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.340544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATTG ATTGTGGCCG TGGATACAGG ACAGTGGTCA ACGAACCAAT ACAAAATAGT 
CCAGAATTCG TGCTGACCAC CGCTAATGTA AGTGGATCTG AAATCAATTC TGGGTACGCG
ACATCTCTAG GATATCTGTT CTGAGTCCAA AATTTACTTA CATCAGGGCC ACTCCCAGTA
TAATCGCTAT GTTTTTCGGG TTCCCAGAAT TTCAAGATGA TACGGCTCGA AAAGGCGTGT
ACCAGTTACT GTATACGGTG TGTAAAACTC TGCGCTGACA ACAGTCAACA GAGCGACGAT
CCCGTCGTAT GTTCATGTTC CTCTTTGGAG ACGAGGGATG TATTACTTCG ATCATGACGA
GCAATAGCAA TAGCAATACC AATACCACTA CTTCTACTCC TGCCACTTCC GCCTCTATCC
CGCTAGACCT TACGAGTAGC AGTAGTGCTC CCCGCAGTCA CAATCGTCGC TACTTGTGTG
CGGCCCCACG TCCGGATCAG TACGACGAGC ACGCGTCGCC AGTGTTGCAA CGTCTGGGTC
ATACCCGTGC ATCGCGGAAG ACTTTGCTGG AAGCCACGTT GCGGGCCTTG CAAACGCGAC
ACAATCTGGA ACGGAAAACT GGGCCGGGTA GAAAGTCGAA CGCTGAGCAC GCCATTCATT
TTTACGAATA TTTATTGGAG TTGGTAGAAA ATGGCGAGGA GGAGGGTGAG GACATCGGGG
ACGCCGACTC TATTTCGAGC GCTAGTAATA AGGATGAAGC AGAGCACGAG AGAAATGTAG
AGTCTGTAAA CAAAAATACG CTCAAGGACG AGAATAACGC GACCGAAGCA CAAGAGGATT
CTGTCAAATC GGAATTAGAC GAAGCCTCTT CAATGGACAC GGCAACCTTT GTGAAAGCTC
ACAACGACCT TTGCGAAGTG TGTGACGAAG CCGGCGATTT GCTCATGTGT GAAACCTGTA
ATCTTGTCTT TCACGTTGCT TGTGTCCGTC CTGCCTTGGA AACGTTGCCG GAGCAAGATT
ACAAGTGCGC GTATTGTGTC CTGTCGACAG AACCGAAGAA TAGTAGACCA CGTAAGCAGG
CCGCCGCGGC AGTTCGGCTT ATGGCGCGTT TGCGTAATCA GTTTCAACGA AACAAAAGAC
GTGGACGAAA CGATGCAGGA GACAGTAAAC GCGACAACGA GGACGAGGAC CAAATCGTCA
GCAGTGATCT GGGCGACAAG CCTTCCGAAG CATCAGGCAA CGAGGAAATA TCGAAGTTAG
AGCCATCGGC CGGAACGGAT GAGGAGAAAA GCAGTGAAGA AGAGAAAGAC AAAACGAACG
ATACTAAAGA CACAACGAAA GAAATAGCGG ACCCACGGAA GGTGACAGAA GACAGAACGA
ATATAGAGGA AGCCACACCG CTTGCTTCCA ACAAAAGGCG TAAGCTGGAG CTCTACAGGA
TTACTGATGC CTTTTCGACT CCTGAAAATA TGGAAGACGG ACGATCTAAA CGCAATCGTA
AGCAACCAAT GCTTTACGAC CCTCAGGCGG GACCGGCTCG CAAGTGGCAA TCGGACGAAC
CTAAGTACTG GAATTCTGAC TCACAGTCAG ACAATTCGAT TTCTGGTGAA AGTGAAACCC
GCGATGCTGA CGAAAAAGAT GCATTTGGTA CTGTCGTGAA AAAGGTAGGA TCGGCGCGCA
AAGAAAACGG TGAGATACAC TGCAGCTTTT GTGAAGACGA CCCTTTTATC GAGCTATGCT
GCTTCTGTGG CTGTCGTGTA TGTTTTGGGA AGCATCATCA ATCAAAACTG TTGTTATGCG
ACGAATGTGA CGACGAATAC CACACTTTCT GCCTTAGCCC ACCGCTTAAA TCACTGCCAG
CGTCCAATGC AGAATGGTTC TGCCCTTCAT GCTCCGTTAG CCAACAAAGA CGACAGATCA
CTACGAAATC GCTGTCGTCA CGTATCGGGA CCCGGGGTGC CATGAGCAAG TCCCCAAGTC
CAACACGTAT ATCCTCTCGC CAAGCCGCGA CCAAAGCGTC GCACTCGACG TCTTTGACCG
AACACGTTAA GCGCGGTCCA GGGCGTCCGC GAAGCAAGGA TCGAATTCTT ACAGTGGCCG
TAGGCAAGAA GCGTGGACGA CCGCCCAAGT CAGCATCGCA AGGCAGCCCA GACAAGAAAC
GTGCAAAGAC AGAGCCGACA ACGAAACTTT CTTCACAGAC ACCAAAATCG GATGACGGGA
AAGCACGGAG TAAGTCTTTG GTTGGCGCCG TCTCCGATCA TGACCCGGTC ACAGACTATA
TCAGCGCCAC GGCTCCAATA CAAGAAGTGA AAATTAGCCG AAGTGGACGT CCAGTCAAGC
GGGGCAGTTT TCACGACGAA ATTGAACAGC GCGAGCAGCA TCTGAGATCT GACCGGTCTC
ATCCAATATC ACCGTCGAAA GCTATCACAA ACCAGACATC ATCTACTCCT GCCACCATCC
CAGCTGAAAT TATTGAGGCC TCAACCATAG CTGATTCTGA ATCTGATTCT TTGAAGGCCA
CCAAAACGAC TTCTGTGTTA CAGGGTAGTC AACAGGCTTC CGCAAACGCA ACTGCACTGG
AAAATGCCTC TCCTTCTCCT GTATTTTCGG AACCAGCGTC CCCGCCAGCG CCAATGAACA
ATAAAGCCAG CAAGGTGTCA GTGAAAGTTT CTGTCTCGAA TCCCACCCCA AATGTGCAAC
CATCGGTTCG AATTAACACT GTTCCTTCTA TTGCCACTAT TCCTCCCATG GTAGAGCCCA
CACCCGTGTC CGTGCCCCCT ACAGCGATGA AGCCAGTGTC AGCTCCGATG TCTTCTTCCA
CAAGCACTTC GTTGCCAACA ACGGGACAAT CTACACTTCT ATCACCCTCT AAAATATCGG
ATCCCGCTCC TGCAACTAAC AATGTTGAGC CCGCTGCCGT TATTTCAACT GCAGCCATAA
CTATGGCCAA AGCAGCTGTA CCACAGCCGC TTACAGCTTC TGCCAGCGTA CCACCGCCTA
TCGCCCAAAA CAAAGAAGTC AAGGTGCCTC GCCGCAAACC TGGAGCGCGA GAGTGCATGC
AAATCTCTCG GCGATTCGGT GTTAGAGCCA TTCCTCAAAA GTACATGGAT ATTATGACGG
ATTATTGCAA GCGCGGAAAA GTTGAACATT TGATTCGGAT GCGCGAGCGA TTGGACGATC
ACTCCCGCTT TCTAGAAGCA CAGTTGGCAG GATTGGAGGC CTTGGTCCTG GAAAAGGGAG
AATCAAGTGT TGTGGTTCCT TCCATGCCGT CAGGTCCTGA TCGCAAGCTA GAACGCACTT
TGGGGACAGA GTATGATTTG TAA
 
Protein sequence
MYIDCGRGYR TVVNEPIQNS PEFVLTTANS TERRSRRMFM FLFGDEGCIT SIMTSNSNSN 
TNTTTSTPAT SASIPLDLTS SSSAPRSHNR RYLCAAPRPD QYDEHASPVL QRLGHTRASR
KTLLEATLRA LQTRHNLERK TGPGRKSNAE HAIHFYEYLL ELVENGEEEG EDIGDADSIS
SASNKDEAEH ERNVESVNKN TLKDENNATE AQEDSVKSEL DEASSMDTAT FVKAHNDLCE
VCDEAGDLLM CETCNLVFHV ACVRPALETL PEQDYKCAYC VLSTEPKNSR PRKQAAAAVR
LMARLRNQFQ RNKRRGRNDA GDSKRDNEDE DQIVSSDLGD KPSEASGNEE ISKLEPSAGT
DEEKSSEEEK DKTNDTKDTT KEIADPRKVT EDRTNIEEAT PLASNKRRKL ELYRITDAFS
TPENMEDGRS KRNRKQPMLY DPQAGPARKW QSDEPKYWNS DSQSDNSISG ESETRDADEK
DAFGTVVKKV GSARKENGEI HCSFCEDDPF IELCCFCGCR VCFGKHHQSK LLLCDECDDE
YHTFCLSPPL KSLPASNAEW FCPSCSVSQQ RRQITTKSLS SRIGTRGAMS KSPSPTRISS
RQAATKASHS TSLTEHVKRG PGRPRSKDRI LTVAVGKKRG RPPKSASQGS PDKKRAKTEP
TTKLSSQTPK SDDGKARSKS LVGAVSDHDP VTDYISATAP IQEVKISRSG RPVKRGSFHD
EIEQREQHLR SDRSHPISPS KAITNQTSST PATIPAEIIE ASTIADSESD SLKATKTTSV
LQGSQQASAN ATALENASPS PVFSEPASPP APMNNKASKV SVKVSVSNPT PNVQPSVRIN
TVPSIATIPP MVEPTPVSVP PTAMKPVSAP MSSSTSTSLP TTGQSTLLSP SKISDPAPAT
NNVEPAAVIS TAAITMAKAA VPQPLTASAS VPPPIAQNKE VKVPRRKPGA RECMQISRRF
GVRAIPQKYM DIMTDYCKRG KVEHLIRMRE RLDDHSRFLE AQLAGLEALV LEKGESSVVV
PSMPSGPDRK LERTLGTEYD L