Gene PHATRDRAFT_46516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46516 
Symbol 
ID7201672 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp576336 
End bp579674 
Gene Length3339 bp 
Protein Length1051 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180860 
Protein GI219120234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATCGCCCCA CCACTCCGTG AAGACACTGC AAGTTGCACG TCCGACAGTT TTTGGCTAAC 
CATACGTCAA TCACAGCATG GTTCGCAAAC GCTTGGACGA CCGGGTGCGT GCACTGTTGG
AGCGCTCCGT CGTGACGGGC CAGCGATCCA TGCTGGTCCT GGTCGGCGAT CACGGCAAGG
ATCAAGTGCC CAATCTGCAC CAAATATTGA CGAAATGTTC CGTACAGGCC CGTCCCAAGG
TGCTCTGGTG CTACAAAAAG GAACTCGGCT TTTCGACCCA CCGCAAAAAG CGCATGAAGA
AACTCAAACG CGACAAATCC CGGGGATTGG TGGGTGGGGA AGCCGACCAG GCCGATAACT
TTGAACTCTT TGTCAGTCAA ACCGACATTA CCTGGTGTTA TTACAAGGAC AGTCACCGCG
TGCTGGGAAC CACCGTTGGT GTGCTCGTCT TGCAAGACTT TGAAGCCTTG ACACCCAATC
TCATGGCCCG TACCATCGAA ACCGTGGCCG GCGGTGGCTT GGTCATTTTC TTGCTCCGGA
CGGTCAAGTC GCTCAAACAG TTGTACGCCA TGAGTATGGA CGTACACGCC CGGTACCGCA
CAGAGTCCGC TGGTGACCTG GTCCCACGCT TCAACGAGCG ATTCATTTTG AGTTTGGGAA
AATGTCCCAA CTGCTTGGTG TGTGACGACG AACTCAACGT ACTACCGGTG AGTCGCAAAG
CACTGAACGA CTTGTCGCCA AACGCCGGTT GGTCCAAGGG TGATGCCGGC GAGGTAATTG
TGCAGGATAC ACCGGAGCAA CGGGATTTGA AGGAAATTCA AGAAGCACTC CTGGATACAC
CCCACGTTGG TGTGCTAGTG GAGCTAACGA AAACACTGGA TCAGGCCAAG GCCTTGTTGG
TGTTTTTGGA AGCCTGTTCG GAAAAGACAC TCAAATCGAC GGTAGCCATG ACGGCCGCTC
GTGGTCGGGG AAAGTCGGCA GCCATGGGAT TGTGTCTCGC TGGCGCCATT TCGCTGGGAT
ACTCAACTAT CTGTGTGACC GCCCCGGAAC CGGAAAACTT GGTCAGCGTC TTTGACTTTC
TATGCCGCGG TCTCAAGGCA CTCAAATATC AAGAACACAT GGACTATAGT GTAACGTACA
ATTCAGCCAG TGGTCGCGAA CAGACCAAGT GTATCACGGC CATCAATGTA CATCGTAGTC
ACCGGCAGGT GATCCAATAC GTTGATCCAG CGGAAACGGA CAAGTTTACG AGCGCCGAAA
TTGTGGCCAT TGACGAAGCT GCAGCAATTC CGTTGCCCGT TGTGCGAGCT CTCATGAGCC
ACCCAGATCG CTTGACTTTT TTGAGTTCGA CTATTAACGG ATACGAAGGT ACCGGTCGTG
CTCTTAGTCT CAAACTCATT AAGGAACTGC GGGATGCCAA AGGAGGACGG CACGCCGAGA
TGCAGGCCGC CTCTTCCGCA GCAAACTCTA TCGTTGGAGC AAAATCGAAA AAGGGCGAGG
CCAAAGTTCA TGAACAACGG TGGGCCGCAG CAGCGGCTGC AATTCTGGAA GCCAGCGAAG
GTTCTGATAA GCTGTTTGGG CCATTGAGAG AAATCGAACT GCTGACGCCC ATTCGATATG
CACACGGCGA TTCAGTCGAA GCATGGCTCA ATAAGCTCCT TTGTTTGGAC TGTGGATCAG
CGTCTAACTT GAAACTGAAC GGAGGGGCTC CTGCTCCAGG CGATTGTGAA CTTTACAGCG
TAGATCGTGA CGCTCTTTTT TCTTTCCACA AGTTATCCGA AGCTTTCTTG CAGAAGGTCA
TGGGACTTTA CACGAGTGCT CATTACAAGA ATTCACCCAA CGATTTGCAG ATGCTCTCTG
ATGCTCCAGC TCATTCACTT TTTGTACTGC TTTCGCCGTC GGCTGAACAA GATGCAAATT
CTCTCCCGGA TGTTCTTACC GTTGTTCAAG TGGCCCTAGA AGGACGTATC TCTAGAAAGG
CTGTCGAAGC GCAGCTTGCC CGAGGACATC GCTCGGCCGG CGATTTGATT CCTTGGACAA
TTTCTCAGCA ATTTGGTGAC TCTAAATTTG CTCAGTTGAG TGGAGCGCGA ATTGTGCGTG
TTGCTGTCCA CCCTTCGGTG CAAGGAATGG GATACGGGTC CAGGGCGATC GAACTTCTTT
ACCGATTCTA CAACGAAGAA ATGGTTTCGC TCGTCAATGA CGAAGGTAAC GATGATGCTG
ATTCTGACGC AGAGCGCAAT GGAGAAGAAG AAAGCGACAA TGATGAGCCG ACGACATCTG
GAATTGGAAT TTTGGGTGAA AACTTGAGGC CCCGCAAGGA ACTTCCACCT CTTTTGCTTC
CTTTGACCGA AGTGGATATG CCGAGACTTG ATTGGGTTGG GACATCGTTT GGGCTAACTC
TTCAACTTCA CAAATTTTGG AGCCGTAGCG GAATGCGGAT GCTATATTTG AGACAAACCA
AGAATGAGCT TACGGGTGAA CATTCATCCA TTATGGTCCG TGCTCTACCA AGGCGAAGTG
GTGTCGATGA CTCTTGGCTT TACGCGTATC TGAGTGATGC TAGGCGACGA TTCACCACTC
TCTTTAGCGG GCCTTTTCGC CACTTGGACG TTAGGCTTGC TCTTTCCGTG TTCGATAATA
TGGATGTGCC AAGCAACACC ACCGAAGCTA AGCAACGCGC AGGAGCTTTG GCAGGCACTC
TTACCTTCAA GGAACTCGAC TACTTCTTGA CACCATATGA CTTGAAGCGC CTTGAATTAT
ACGGACGAAA TTTATGTGAT CATCACCTTG TAATGGATCT ACTACCAATA ATTGGGCGAT
TGTACTTCAC TGGGCGTTTT GGATCTGACT TCAACTTATC TAGCGTCCAA GCCGCGCTCT
TCTGTGGGAT TGGACTACAG AACAAGAGCG TCGACATTTT GACGAGAGAG CTTGGTCTGC
CAACCAATCA AGTCCTTGCA ATGTTCAATA AAGCAGTGCG AAAAATGTCC ATTGCCTTGA
ACTCTGTCGT TGAGGAGAAA GAGAAAGAGA GTCTCTTAAC TGGTGAGAAA CGAAGCAGAA
TTGAAGAAAG TGCCGAACAG ATGCGTCATG TTTCTCGGCA AACTTTGGAT GAAGATGCCG
AGCAAGCTGG CCAGGAAGCG ATCGCAACGC TCAGGGCGAA CGAGATGGCG AACCATCTAC
CGGAGCTTGC ACACGATACT GAAATGCTGA AGTACGTCGT CAAGGGATCT GATAAACAGT
GGGAAAAGGT CCTCCAAGAC AAAGATGTAA GTGGAACAGG CACTGTGCAA ATTTCGGAGG
TACGAGAAAA GAGAAAAATT GTTGACGATG ACGACATAG
 
Protein sequence
MVRKRLDDRV RALLERSVVT GQRSMLVLVG DHGKDQVPNL HQILTKCSVQ ARPKVLWCYK 
KELGFSTHRK KRMKKLKRDK SRGLVGGEAD QADNFELFVS QTDITWCYYK DSHRVLGTTV
GVLVLQDFEA LTPNLMARTI ETVAGGGLVI FLLRTVKSLK QLYAMSMDVH ARYRTESAGD
LVPRFNERFI LSLGKCPNCL VCDDELNVLP VSRKALNDLS PNAGWSKGDA GEVIVQDTPE
QRDLKEIQEA LLDTPHVGVL VELTKTLDQA KALLVFLEAC SEKTLKSTVA MTAARGRGKS
AAMGLCLAGA ISLGYSTICV TAPEPENLVS VFDFLCRGLK ALKYQEHMDY SVTYNSASGR
EQTKCITAIN VHRSHRQVIQ YVDPAETDKF TSAEIVAIDE AAAIPLPVVR ALMSHPDRLT
FLSSTINGYE GTGRALRGRH AEMQAASSAA NSIVGAKSKK GEAKVHEQRW AAAAAAILEA
SEEIELLTPI RYAHGDSVEA WLNKLLCLDC GSASNLKLNG GAPAPGDCEL YSVDRDALFS
FHKLSEAFLQ KVMGLYTSAH YKNSPNDLQM LSDAPAHSLF VLLSPSAEQD ANSLPDVLTV
VQVALEGRIS RKAVEAQLAR GHRSAGDLIP WTISQQFGDS KFAQLSGARI VRVAVHPSVQ
GMGYGSRAIE LLYRFYNEEM VSLVNDEGND DADSDAERNG EEESDNDEPT TSGIGILGEN
LRPRKELPPL LLPLTEVDMP RLDWVGTSFG LTLQLHKFWS RSGMRMLYLR QTKNELTGEH
SSIMVRALPR RSGVDDSWLY AYLSDARRRF TTLFSGPFRH LDVRLALSVF DNMDVPSNTT
EAKQRAGALA GTLTFKELDY FLTPYDLKRL ELYGRNLCDH HLVMDLLPII GRLYFTGRFG
SDFNLSSVQA ALFCGIGLQN KSVDILTREL GLPTNQVLAM FNKAVRKMSI ALNSVVEEKE
KESLLTGEKR SRIEESAEQM RHVSRQTLDE DAEQAGQEAI ATLRANEMAN HLPELAHDTE
MLKYVVKGSD KQWEKVLQDK DKREKLLTMT T