Gene PHATRDRAFT_46097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46097 
Symbol 
ID7201437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp262409 
End bp265438 
Gene Length3030 bp 
Protein Length980 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180604 
Protein GI219119700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGGCGTTG GTGAATTTAA TCCAAGCTAC GCCTTGGAAT GAACGACGAC CGCAAAGAAG 
CCGAGGATTT GGCGGATGTC AGCTACGAAG TCCCCATTGA TTCGGGAGAA GTCGTGGATC
GGGCACGGAT TCTACCTTCG CACGAGACGT ACGGTTCTAC GGATAACCGC TCGACGCGGC
AACGCAAATG GATCGACGAC GGTGTGAATC AGAAATCTGC CAGTACCGGA AAACTGCGAT
ACCCGAGCAG TCTTGATTTT GAGAGAGTCA TCAATGATTA TTCCATTCAA GCGACCAAGG
ATCGAGTGCT CGTGCAAGAA TTAGAAGCTG ATAGGAATCC ACCAGATCCA TCGGAGACTG
ATCGACACGC GCTTCTGCAC GCACCCGATG GACTCTTGAC CTACAATTCG TTTGAAGAAC
CCAGTCGAGA CGATTCGTTC CGTCCGCCAC TACCCCCTCC GCCTCCACCT CCTCCCCCTA
TGGCGTACTT GAAACGGAGA CCCAAAAAGT CACCACTGGG GTACACTGGA CGGACTGCTA
CACGATGGCT ACTGACCAAT GCCACTGGGC TCATGACGGG GTTAATATCC ATCATGATTG
TTAGTGCAAC GGATTTCATT CAGACGTGGC GATCACATAC TATAGACTAC TTATGGAAGA
ACGACAAGAA CCATCACCGA TTAACGACTG TGTTTATTCT TTACGCGTCC GTCAATCTCT
CTCTTGCTCT GGCGTCATCG GCTCTTTGTC TATTCTTGGC TCCAGAAGCT GCCGGATCAG
GTATCCCCGA AATCAAAGCT TATTTGAATG GGGTGCGAGT CAAACGCTTC ACTTCCGTGC
AACTCTTCTT TGTCAAAATT GTTGCCACGA TTCTTTCGGT ATCGTCGGGT CTCGCGATTG
GACCAGAAGG ACCTCTGGTA CACATCGGTG CTATTCTAGG CGCGAGTTGT ACCAAGCTTT
CTAGTCTCAT GCTCAGGGTC CTTCCCAAAT CTTGGTCAAC TCATTTGTGG TCGTTCGTCA
CAATGGATCT TTCTCACTTT TCAACGGACG GAGAACGTCG TGATCTCGTT AGTATCGGAG
CGGCTGCTGG CTTTGCAGCT GCCTTTGGTG CACCCATCGG AGGTCTACTC TTCACCGTCG
AAGAGGCTTC AACATATTTT GATCAAAGCA TGTTCCTGAA GACTCTCTCG GCGACGGCGC
TAGCGACATT CTGCTTGGCT GTACATCATG GTGATTTGAG CCATTACAGT ATCATTTCTC
TGGGTGATTT CGAATCATCC GACTCCAATA TTTTCGTGAA TCGAGTCGAG CAAGTGCCAC
TCTATTTTAT TGTCGCTATC GCTGGGGGGA TCCTGGGAGG ACTTTTTTGT CGATTCTGGG
AGTTTCTGCA GCGATCTCGA CAGCGTCTCA AGCAACGTCG CTGGTCGTAC GAACTGCTTG
AAGTAGCCTT TGTTAGCTTG CTTACGTCGT CGGTGACATA CTTTGCACCC TTCATGAGCT
TCGCTTGCCG GGCGGTAGCT CCCACCGACG ACATCGTTTC CGAAAAGAGC CTTTTCGACC
CTTGGATGTC GCACGCGCAT CAGTTCGACT GCCCCACAGG GTCAGTGAAT GAGCTCGGAA
CGATCTTTTT CGGCTCACGC GACGACGCTA TCGGCACAAT CTTAAGTGAC CCTTCGCAAT
TTGACCCGAG GACATTATGG ACGGTTGGCA TACTATTCTT TCCTCTTATG ATACTGACCC
TTGGTGTGAA CATTCCATCC GGAATATTTA TGCCAACGGT ACTGATTGGC TGCTCACTCG
GTGGCGCAGC CGGTCTCGCC TTTCAAAACT GGATCAGCGA GGATCTGTCG CCATCCACGT
TCGCCTTGCT AGGTGCTGCT GCTCTCTTGG CTGGTATTCA ACGATCTACC GTCAGTCTTT
GTGTGATTCT CGTTGAAGGC ACGGGACAAA CCAAAGTGTT GATTCCCGTT ATCATTACGG
TTGTGGTCGC GCGCTACGTA GGAAATTTGG TCAGCAAGCA TGGCTTGTAC GAAACTGCCA
TTGAAATCAA CCAGTATCCA TTTCTCGATC ACGAGCCCAA GAAGCGCTAC GATATATTCC
AGGTTGGAGA AATAATGAGC ACACCGGCAG TGACATTGGG CCCGCGGGAA CGGGCGCACA
CCCTTGTCAA GCTTTTGCGT GACTCTGGGC ATCACGGCTT TCCTGTGACA GAAAAAGACA
CGGGAAAATT TCTCGGGCTT GTACGACGGG ACCAAATTGT TGCTTTACTG GAATGTGGGA
TCTTTGAAGA CGAGCATGAA TGGGATGATG ATTCATCTAC TGGAACCAGT TCCATGCCGG
GGACGCCTTC TACTGAATGG ACGCCAAAGC CAGGAATCGG AAAGTCATCG CTGATGCATT
TGGCTTTCCA TATTCCAGAT GACCGCTACG ACTACTTGAC GGATAATCAG GGTGCAATCG
AAGCAGTAGA AAATATTAAC AAAATGATGG TTGAAGACGA GTTCGACGCA AACGCTTGGC
TTGTATCAAT TCGACGGAGT CGAGAACACT TGGCCGGTCT GGAAAATAAC GAAGAGGATT
CGGCTTGCCC TCACATTGTG GTTGGAGACG ATACACTACC ACCAATTTCA CAGAACCGCC
GATACATTCC TAAAGGCACA CTCGGGAGCA CCCGAGCAGC TGTTTCCCAG GGCCGCTTTG
CTACGGTGAC TACCAATTCG AAAGGCGATG TCTACGTTCA ATGGCTGAAT CCAAGCTGCA
AGCGCAAATG GGTCCATGTT GCCGCCGTCA TGAATCGTGG CACGTACTGT GTGACAGAGA
CGACTCCTTT GAGCAACGCC CATTTTCTCT TCACCTCTCT TGGATTGCGC CATCTAGTGG
TGCTTGGCGG CAAAAGAGGA GGCACGGTTG TTGGTGTTGT CACACGCATC AATCTTCTCA
AAGATTTTAT TCAGGAGCGC ACAGGATGTA AGTTTTATTG AGGGGCGTCA CCTTATTTAT
AGCATCAATT TAGACTCTCT ATGACATACC
 
Protein sequence
MNDDRKEAED LADVSYEVPI DSGEVVDRAR ILPSHETYGS TDNRSTRQRK WIDDGVNQKS 
ASTGKLRYPS SLDFERVIND YSIQATKDRV LVQELEADRN PPDPSETDRH ALLHAPDGLL
TYNSFEEPSR DDSFRPPLPP PPPPPPPMAY LKRRPKKSPL GYTGRTATRW LLTNATGLMT
GLISIMIVSA TDFIQTWRSH TIDYLWKNDK NHHRLTTVFI LYASVNLSLA LASSALCLFL
APEAAGSGIP EIKAYLNGVR VKRFTSVQLF FVKIVATILS VSSGLAIGPE GPLVHIGAIL
GASCTKLSSL MLRVLPKSWS THLWSFVTMD LSHFSTDGER RDLVSIGAAA GFAAAFGAPI
GGLLFTVEEA STYFDQSMFL KTLSATALAT FCLAVHHGDL SHYSIISLGD FESSDSNIFV
NRVEQVPLYF IVAIAGGILG GLFCRFWEFL QRSRQRLKQR RWSYELLEVA FVSLLTSSVT
YFAPFMSFAC RAVAPTDDIV SEKSLFDPWM SHAHQFDCPT GSVNELGTIF FGSRDDAIGT
ILSDPSQFDP RTLWTVGILF FPLMILTLGV NIPSGIFMPT VLIGCSLGGA AGLAFQNWIS
EDLSPSTFAL LGAAALLAGI QRSTVSLCVI LVEGTGQTKV LIPVIITVVV ARYVGNLVSK
HGLYETAIEI NQYPFLDHEP KKRYDIFQVG EIMSTPAVTL GPRERAHTLV KLLRDSGHHG
FPVTEKDTGK FLGLVRRDQI VALLECGIFE DEHEWDDDSS TGTSSMPGTP STEWTPKPGI
GKSSLMHLAF HIPDDRYDYL TDNQGAIEAV ENINKMMVED EFDANAWLVS IRRSREHLAG
LENNEEDSAC PHIVVGDDTL PPISQNRRYI PKGTLGSTRA AVSQGRFATV TTNSKGDVYV
QWLNPSCKRK WVHVAAVMNR GTYCVTETTP LSNAHFLFTS LGLRHLVVLG GKRGGTVVGV
VTRINLLKDF IQERTGCKFY