Gene PHATRDRAFT_54143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54143 
Symbol 
ID7197024 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1331261 
End bp1336799 
Gene Length5539 bp 
Protein Length1695 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178120 
Protein GI219112737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGA CGACGACGGT ACCGCATGCC ACGGTTCCCG CCGTCCCCGC AGCGCGCCTC 
GTGGGTCAGG AAGCCCGAAT GGTCCTCACA TCCTTACGGG GTGGACCCCC GTACGTGGCA
CGCGGAACGC TAGCGCAAGA CTTGTTGGAT CTACGGGATC GGCTCCCACA GTTGCGGACT
ACGACGAACA CCACGCTCGC TACCGGTAGC CCCACGACGC CGGTGGGTGA GACTACCCAC
ACCCACACCA ACACCAGCAT TAGTGTACAC CCCAATGCGT CCGATACAGC CGTGGACGAT
ACGAACGACG ACCGTTCGTA CGATTTTGTC CGTCCCTTTC TACAAGTCGT CACGGATCCA
CGAGCGGCAG GTCCCCACAC ACTCGTCGCC TTGCGATCTC TCCACCGCAT GCTGCTCAAC
AAATCCTTGT TTGTCCTTTA CGATCATCAA CATCAGCAAC TACAACACCC ACCACCACCA
TTGAAATATC TGCAGCAAAA CCACAGTGCG GTGTTGGCTT CGATTGTCAA GGCCGTCTTG
ACGTGTCAGT TTGAACAGAC GGATGCCGGG GCGGACGAAG CCGTAGAAAT GGCCGTCGCC
GAAGTCCTCG GACAAGTCGT CGCGCTACTC CCGTCGCTCG GGCTACCGGA AACCGTGGAC
CCTCGTCACG CCGCTATCAC TCCCGAGACG TTAGCGGAAA TCTTCCACGC CGTCTTTGTC
ACGCGCACCA GCAGTGCCCT GGCCAATTCC CCGGCACTGG TCCTGCGCTT GGAAGATATA
CTCCTCCAAA TGACCCAACA CGTCTTTCGG CCCGGCTCGC CACCGAACGA AGCAACCGCC
ACGCCCAATC GAAAAGTGTG GCTGGGACGA TGTCAAGCCG TTTTGGAGTT CTGGACGCAT
CCGCTCCTGC ACACGCCCCT GGTCGGTGGG GATGGATTGG ACGAATCTAC TCGGGAAGAT
CAGCGTCTTT ACGACGCCAC CCGGGTACTC TGCTTGCGAG CGGTACGCAC AGCCCTCCAA
ACCGGATGGG CCGAAGCCTC AATCGCTACC AGTTACGATA TGGACGACGA GGAAGACGAC
GACGAACACT ACCAAAGCCT AATCAGTATC ATTCAGGACG ATTTGTGTTT GTCCTTGCTC
ATGACTGGTC AGGCCATTTG GGCTTACCAC GATGCCCACA CCAATATCTC TCCCGGATTC
GTATCCCTCG AAGTCTTGTC GGAAATTTGT GCCACCCTCA CGACGCTTTG GAACACCCTC
CCACTACGCA CCGTGTTAAT TGCGCAGTTT GAAACCATCT GGACCGGCTT TTACACCCGG
GCTCTCGTGC TCTTGCGCAA ACGCCACCCA CCCACTAATT CCTTGTCCTT CAACGCCAAC
TTGACCTTTG ACGCCGAAGT GGAAATTATT CTCGAATCGC TCGTGGACGT TCTCTGTCTC
CACGATCACG TCCGCTCTAT TGCCGACGGA GACGGCGGAG CGCTGGAGAC AATGTTTGCC
TACTACGATT GTCATTTACG TCGCTCCGAC GTGGCCGTCG GGCTCATGGT GGAACTCTGT
CGTTGTTGTG GCGGGGCCGT GGATCCGGAC GGGGAGACGC TCGTCCTAAC CCCGTCCACC
AGCTTTTTGG CGCGGCCCCC CACTTGTGAG GATTCGGTGC ATTCGGATAC TTCCGATACG
GCGACACCTC CCGATACGGC CGTATCGTCT CCCATGGTAC AAGTCGATCA CGTATGGCGA
CCGGTTCCGC CCCATCTCAA GGAACTGTGC GCACAAGCTC TCATGGGAGG AATGAAATGC
TTGTTCCGCG ACGACAAGGC CTCGGCCGAA ACCCTGCTGG AACGATCACG GCGCAAACGG
TCCATAATGA GTCGACAACT GAAAGACAGT TTCGAGGAGG CCTTGTCCGA GACGATCCCC
AACACAAACG TTGCCGAATT ATCGGTGTCA CCCTCCTGCA CGCACGTGCT CCGAGACGTG
AAAACGAAAA AGCGTCTCAT GCGCAAGGCG GCGCGCATCT TTAACCACAA GGCCTCCCGG
GGCATCGAAT TTTTGCTCGA TGCCGGGCTG GTGGCCGACC CCGTTACTCC CATGAGTGTC
GCTACCTTTT TGCGCAACGG CATTGTGGTC GGTCTGGACA AGAAAGCGGT CGGCGCGTAT
TTAGGCGAAG CCGGTAAGGC TCCCATTGCC GGCAAGTCGC CACTGTCCTG GGAACGCGAC
TGGTTTCATA AGGATGTCCT ACAGAGTTAT TGTGGACTGT TTCGATTCGA AGGACAGTCC
TTGTTGGACG GGCTGCGCAT GTTCTTGGCG GCGTTTCGTT TGCCGGGGGA GGCTCAACAA
ATTGATCGCA TTCTACAAGC CTTTTCCGAT TCGTGTGGAC AGGTCTGCGA AGAATCAGCC
GACGGCCGTC TCCAGTTGTT CTCGGAAGAC CCTAAGCGGG CAAGTGACGC GGCCTATCTA
CTTTCTTTCA GCATCATCAT GCTGAATACC GATCGACACA ACACCAATAT CCGGGAAGAC
CGCAAAATGA GTGCCGCTGA CTTTGTCAAG AACAACACGG ACTACGGACG TGACATTACC
GAAAAGGGAA AGGAATTTCC GAGCGAATTT CTGGAAGGAA TTTACCACAG CATCAATGAT
GAAGAAATTC GTACGGAAGG GGAAGGAGCG GACGGCGCCA TGACTGTAGA ACGGTGGAAA
GATGTCCTCC GTGGCTCCAC CGAAGAAGCC GAAGATGAGT TTCTGCCCTC CTTGCACGAT
GCCGAAGACC TGACCGAACT AGTTCTGGAA CACGTGTGGA AGCCAATCAT GTCGTCCATT
GGAGCTTTCT GGGGGATGCC TCGTGTAGCA GACGATGAAC CCCTGTCGCC AAGCGATCCG
GCACAAAACG GCATGCTTGG GGTACAAGGT GCTCGTCTCG GAATGGACAT GGCGTTAGAA
ATGCTGCACG GAGTCCGGAA GTTGGGTCGT ATCGATATCT TCCGTAAGAT TTTTTCTTGG
ATCTGTGACT ATACCGGTCT AATTGGGGAT TACTCGGTAG ATGCTGTGGA ACGCACCTGG
TCGCTGACCA ACTCGGTCGA AGCACAGAGT GCCGTTGTTG CTGCGATTCG TACGGCGCTG
GACGCTGGCG AGGACCTGAA CGGAGACGGA TGGAAGCGAC TCTGGTCCAT CCTTTTCGAA
ATGCGCGACT TGAAGCTTCT TGCGTACGGT GGACCTTCTG CCAAGTCAAG TCTACTCCAC
GAATCGGATC CGGACATCCT CGACGAGAGC GCTCGTCGAG ATTGGACAAT TGTCTCGTTA
AGGGTGATAT GGATTTCTTC AATCGCCCTC GTAAGGAAAA AAAGTCAACT ATGAGCAGTT
CCGTGTTCGG TGCTTTCGGG CGAGCGCTCT TTGGAGCGGA CACGGAAAAC GATGACGAAA
GATCTGCACA ACTGGATAGC CCAAGTCGAA GAGCACCCGT AAGCTCGGTT CACGGCAAAG
AAGACCTTGT CGTGTGGGAC GATTATGCAC CTAGTGACGA CGAAGAGGAG CCACAATCTG
TGGAAGAGTG TGATGATTTA TCGAGCGAAA TGGAAGGGCT CAGTCCAGGC GCCGAGTTTG
AAAACCTTTT GATTAGGGAA AGCCTAGGCA TGAGTCGTCA ATTGGACTTA CCAGTCACTG
GCCTGGAACG AATGGACGAA GCCAGGCGGC ATCTCGTGTC TCCACGCGCT CGCGTCCGTG
GCCGACTGAC AAATGCATGC AACTTTAAGG CACTTGTTTC GGACAGTCGA TTTCTCAACG
ACGCGGGAAT TCGTGTCCTC TTGCAAGCGT TGGCGGAGCT CATCGCAGGC ATGAGTCGGT
CGACGAGACT TGCTGAAGCT CCACCGCTTC CTCCCCCAAG CGGAGGTCTC GAACGCAGCT
CTAGTAGTGA TTCCATCGCA ACCCCGGTTT TCTTGCCGAC TAGTGGGTAT CTTCCGATTT
CCCCGGCTTC CGAAGCGTTT GCTGAAGTTC TCATATGCGA AATTGCTTTG AAGAATAGAG
ACAGGTTGAA GATGCTTTGG AAAGATGTTC TGCAAGACCA TTACCTGAGC TCTTTGACGA
GTATTCTTGT CAATCCAGTC GAAGGTGCTA GTACCGCAGT TCCCCAACCA GATCCAGGTC
TCGAGAAGCG AGTCACGGGA TTGCTTCGCA TCAGTATCTG CGCTGTGCAG CGCGACGAGC
TTTCCAACGA AATTTTGTCT GCATGGAAAT ACTTGCTTCC TATAAGCGAT GAACAACGAG
CGTCGTCGCC TTTGCGTGTG CTCGACAAGC ACATTGGTGA AGGATTGTGG AGAACCGCAT
CTTCGGTTGA TGGCCTTCAT TCACTCAACG CCGATGGCTG GGAAGGTTTG ATGTCGCTTT
TAAAGTGGTG CGCAAAATGT GGCGGTATGT CAAAGCCTGT CATCTCGCAC GGCAGTCAGG
TGTCGGCGCC TCTTCCTGAA AACGATCCAG CATACCAAGG ATATCGAACA GCGCATCTGA
TACTTAACAC AGAAGACTTG GATAAACGTG TCCCTTGCTC TATTGTGGAT GCTCTTAAAG
CATTGGTAGA AGCAGGTCAA AACCGCGCCT ATCCACAGCT GAGTATCGCA TCTTTGGATT
TGCTTCACAC GCTGCACGAG AAAAAAATAA ATTCGTTGCA AACGGAATCA TTTTCCGACG
AAAACGCTGC TTTGTTCTGG TCCGGATGCT GGCGAGAGAC CGTTGCAGTG ATGGCGGAAG
CGGCTGAGCT GTCTTCCGAT ACGGTGCGTT CAGCTTTCCC TCTATGTTTT GTTTTAGTGC
TTGACTAAAC CTGTTTTCTT TCATTTTAGA ATGTTCGACA ACATTCATTG TCAATGCTGA
CTGATTTATT TTTGGAAAAG CGAAAGACTG CAATACCAGT AGCACACGTT GCTGGTGTCC
TCAGTGAAAT ATGCGTCCCA CTGGCAGGGC GCTGCATCTT ACGCCTTCAA ATGGGTGATG
ATTCGATTGA GAATTCAGAC GCGTTGATGA TTGAGTTTGA GCTGTGCATC AGTCTTATCT
TCAAGCCCTT GCGACACCAT CTCAACACGG GTATGTCGGC GATTTCCGAC GGAAACCTTT
CATCCATTTG GAAGTCGGTG TTATCTGTTC TCGAAGAACT GCTGCGCGAA GACAGCCCTT
CGCTGGACAG CAACGAAGGT CAACCTTCGT TACCGGTGAA TCTGAAAGCT ACAATGAATC
AACTTGTTAA CGAACATCTT CAGAATGCCA TATCAGTACT TATTGCCGCC GGGGTCTTGC
TGTCGGAAGG CTACTCCAAA GCTTCAGAGG ACATTTCGTT TATAACCTGG GAATCTGTTG
GTCGAATGGG GATTCCTGAA AGTGCCGTTG TGGAATGGCG ACAGCAAGCT TTGCATGAAT
CATAATAAAT TGTATCGATT TATCTCAATC AAATCAAAAA CTGTCCCTTT ACGGATCAAG
CCACTATGTA TTTGCAATAT CCAAGTAGTA TGCAACTGGG ACTAATGTAG TGAATATTTA
AAAGAAGCGC AAGTTTCGC
 
Protein sequence
MTTTTTVPHA TVPAVPAARL VGQEARMVLT SLRGGPPYVA RGTLAQDLLD LRDRLPQLRT 
TTNTTLATGS PTTPVGETTH THTNTSISVH PNASDTAVDD TNDDRSYDFV RPFLQVVTDP
RAAGPHTLVA LRSLHRMLLN KSLFVLYDHQ HQQLQHPPPP LKYLQQNHSA VLASIVKAVL
TCQFEQTDAG ADEAVEMAVA EVLGQVVALL PSLGLPETVD PRHAAITPET LAEIFHAVFV
TRTSSALANS PALVLRLEDI LLQMTQHVFR PGSPPNEATA TPNRKVWLGR CQAVLEFWTH
PLLHTPLVGG DGLDESTRED QRLYDATRVL CLRAVRTALQ TGWAEASIAT SYDMDDEEDD
DEHYQSLISI IQDDLCLSLL MTGQAIWAYH DAHTNISPGF VSLEVLSEIC ATLTTLWNTL
PLRTVLIAQF ETIWTGFYTR ALVLLRKRHP PTNSLSFNAN LTFDAEVEII LESLVDVLCL
HDHVRSIADG DGGALETMFA YYDCHLRRSD VAVGLMVELC RCCGGAVDPD GETLVLTPST
SFLARPPTCE DSVHSDTSDT ATPPDTAVSS PMVQVDHVWR PVPPHLKELC AQALMGGMKC
LFRDDKASAE TLLERSRRKR SIMSRQLKDS FEEALSETIP NTNVAELSVS PSCTHVLRDV
KTKKRLMRKA ARIFNHKASR GIEFLLDAGL VADPVTPMSV ATFLRNGIVV GLDKKAVGAY
LGEAGKAPIA GKSPLSWERD WFHKDVLQSY CGLFRFEGQS LLDGLRMFLA AFRLPGEAQQ
IDRILQAFSD SCGQVCEESA DGRLQLFSED PKRASDAAYL LSFSIIMLNT DRHNTNIRED
RKMSAADFVK NNTDYGRDIT EKGKEFPSEF LEGIYHSIND EEIRTEGEGA DGAMTVERWK
DVLRGSTEEA EDEFLPSLHD AEDLTELVLE HVWKPIMSSI GAFWGMPRVA DDEPLSPSDP
AQNGMLGVQG ARLGMDMALE MLHGVRKLGR IDIFRKIFSW ICDYTGLIGD YSVDAVERTW
SLTNSVEAQS AVVAAIRTAL DAGEDLNGDG WKRLWSILFE MRDLKLLAYG GPSAKSSLLH
ESDPDILDES ARRDWTIVSP SRRAPVSSVH GKEDLVVWDD YAPSDDEEEP QSVEECDDLS
SEMEGLSPGA EFENLLIRES LGMSRQLDLP VTGLERMDEA RRHLVSPRAR VRGRLTNACN
FKALVSDSRF LNDAGIRVLL QALAELIAGM SRSTRLAEAP PLPPPSGGLE RSSSSDSIAT
PVFLPTSGYL PISPASEAFA EVLICEIALK NRDRLKMLWK DVLQDHYLSS LTSILVNPVE
GASTAVPQPD PGLEKRVTGL LRISICAVQR DELSNEILSA WKYLLPISDE QRASSPLRVL
DKHIGEGLWR TASSVDGLHS LNADGWEGLM SLLKWCAKCG EDLDKRVPCS IVDALKALVE
AGQNRAYPQL SIASLDLLHT LHEKKINSLQ TESFSDENAA LFWSGCWRET VAVMAEAAEL
SSDTNVRQHS LSMLTDLFLE KRKTAIPVAH VAGVLSEICV PLAGRCILRL QMGDDSIENS
DALMIEFELC ISLIFKPLRH HLNTGMSAIS DGNLSSIWKS VLSVLEELLR EDSPSLDSNE
GQPSLPVNLK ATMNQLVNEH LQNAISVLIA AGVLLSEGYS KASEDISFIT WESVGRMGIP
ESAVVEWRQQ ALHES