Gene PHATRDRAFT_41359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41359 
Symbol 
ID7199214 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp267845 
End bp271091 
Gene Length3247 bp 
Protein Length1009 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185300 
Protein GI219130288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAG ACGCTCCTTC CCACACAGTA TCCGCAGCTG AAGCTGCCGT AAATAAGACG 
CATTTCAGGA ACAACACGGC TAAGCACGCT TCGGTGGATG CGGATGCCGT CAGTCACGTT
TCTAACTTGA CTGCGGATGA TGCTAGCACC GTTGTTTCGT TTTCCGATCG GGTCACGTCC
AGTATTGGAT CCGTCTTGGG ACAAGCCAAG CGCATCTTGG AAGGCGAAGC ACCCATTCTA
GAAGATGAGA CTGATGGTGA CGAAGTTGCC GAGAGTCCGT CCAACTTGGT AGTCACGTTG
GAACAGGAAA TCGGCAAGTC GCGGAGTGCC CGTTCCCTTG GTGAGGATAA CAACAAGGGA
GAAAGCGTGT CCACAGCACG CATGCGTGTC GCCAAGGACT TTTTACGGGA CGAACGCTCG
GAGATGACAT TTTCCCGTCG AATCGCTCTG GCCTTGTCTC ACAAGTCCTG GTACAACCCG
CGTGCGAAGG AGGAACCAGA GTTCGCCCCC AATGAGATAG AAACACCGGA AGCCTCCACA
AGGATCGATT CGCAACATTC GATGCCGGTG CTGCCCGTTG GGGGGCCCAA CATTGAGGCC
TATCCCTTTA CCCACAGCCG TAGAGAAAAC CCGAGTCTGG GAAAAGCCTG GGCTTGTACG
TATGGTACAC CTTTCGGTCA TCCAATGACA ATGCCCGAGA CACCTTGACA TATGGCTACT
CACACCTTGC AATTTTTGTC TTTCTGGCAG ATTTTGAGCA TGTTGCCATG CCTCGCTATG
TGGTGGAAGC CAAACTGGAC CAGCGCCGAA AAAACATTCT GCATCGCATC GTTCGGAAAT
TCCAAAAAGC CGACAAACAA CTTCAACGGG CGGAGCCAGG CGAAAAATAT TTGCCAACTA
AACTATATGG ACCCATCTGC ACACCGCACA AACAGCTTGG TGACTGGGGT CTTGGCTTTG
GTCTCTATTT TTCGACGCTC AGGGCAATCA CTGTACTGAC CTTCTGTGCT GGTTTGCTCA
ATATACCCAA CTTAATTTAC TTTTCTTCCG AATCCTATAG CAGCGGCCAA GACGGCGTGA
TTCCGCTGCT GCAGGGGTCC GCGATTTGTA CCGATACGCG GTGGGTGCCG TGTCCAAATT
GCACGTCAGG TGATTTTGAA GCTACTCGAT TCGCTTACGG AACCAACGAT GCTGGTCTCA
ACGTGACCTT TGTGTTGCGG AACACCTGCG AGGGTGCCAC AATAGAGCAA GGCTTTACCA
ATTACGCGTC CCTGATGCTT ATCATGTTGG GCACAGTGTT TTTGAATCGC TATCTGAAAC
GCATGGAGGT TGCCTTTGAT GAAGACGAAC AAACGGCACA AGATTATTCG ATTGTGATCG
GAAATCCACC GGGTGACGCG ACCGATCCCG ACGAATGGCG AATTTTCTTC CACGATTGCT
TCGATGGTGC CAAGGTGACA GCACTGACGG TGGCCGTGGA CAATGACTTG TTAGTCCGAT
CGTTGGTGGA GCGCCGCGAA AAACTGCGAG AAATTGAGAT GATGGTTGAG CCGGGCACTT
CGCTGGATAC GCTCACTTTG GCTGGTATCG CTGCCAAGCA GGAACGGGAA CGTAGCGTGT
GGGGTCGTTG GAAATCAATG ATTATTCCAG GCATTCCGGA ACTGTTCAGC CGCACGGTCG
TCCTGACAGC CAAAGTCCAA GGACTGGCGC AACAAGACTA TCCAGCCACA AATGTATTCG
TGACCTTCGA AACCGAAGCT GATCAGCGCC GCGTGCTAAG TGCCTTGTCG GTTGGTAGCT
GGGACGTTCA GCGCAATCGA CAGAGCGCCA TCGCTGACCC CAAACATTTG TTTCGCAGTG
AGCTCGTCTT GTCGGTACAC GAGCCGGATG AACCCAACAC TGTTCGCTGG CAAGACTTGA
ACGAAAAGTT CAAAGATCGA CTCAAGCAGC AATGCCTTAC CACTCTTTGT ACCTTGACAG
CCATCATTCT AATTGCCTTC GTCATTTTTC TTGTCAACGA GCAGAGCATA ACGTTTTCGG
CGTTTGCGAT TGCCATTTTC AATAGCATCT TTCCTCTTTT TGCCAAACTG CTGACTGGCA
TGGAGGCTCA TTCGTCGGAA GGTGGAAAGC AGAGGTCACT TTACTTTAAA ATTGCGGTCT
TTCGGTGGGT GAACACGGCG GTCGTGATTA CAATCATCAC TCCCTTCACG TCAACCTTGA
CAGACGGTGG CTTGGTGAAT CAGATTTATG CTCTGTTCTT CGCCGAGATT GTTACAACAA
ATGCAATTCA GTTGCTGGAT CCTGTTGGAC ATTTTCAACG CCACTTTTTA GCGCCGCGGG
CAAAGACACA AGATGCTATG AATCTTTGTA TGCAGGGACA GCAGGTTGAG CTTGCTGAAA
GGTAAGAATG CTTGTTTTCT GGTCGTTTAT CTATACTTTT CATCTCGACT AAATTTCCCC
TTGATTTTTG CCTACAGATA CACAAATATG ACCAAGGTTT TATTCTTGGC ACTGTGGTAT
TGTGCCATTT TCCCTGGAGC CTTTTTCTTG TGCTCCTTCG CTCTTCTTAT CAACTACTTC
ACTGATCGGT TCAGTCTTAT GCGAACATGG AAGCGTGCTC CTCAGCTTGG AGGAAAAATT
TCATCTTTCA GCAGACGCTA CTTTTTTTCA TTGTCTATCG TAGCAATGGC GCTCGTATCG
TCCTATTACT GGTCAGCCTT CCCATTCGAC AATGTTTGCT CCACCGAATT ACCGGTAAAC
ACGTCATTTG TTGGTGTTTG GAACATAACA GGGTTCGCCA AAAATGATGA AAAGGAACCC
ACTTTCCAGC TATCTTTAGT GGAGGATGCG GATACTTCCT TCTTTTTCTG TGTACAAGAC
TTTTTTCGCT ACGAGGCCGA GGAACAAGCG TTTCCCTTTA TACCAAAGTT TCAACGCAGC
GGAGAGGAGT GGATGACCAG CGATCAAGAG ACTTTGACGG CTGTCTATGG TTGGACTGTC
GTCGCCGTCG CTGCTCTTGT TCTACTCAAG TTCATCCATG GTTGGTTCAG CAGTATTATG
AAAATGTTCC GGGGGACTTA TAAGCCTTGC GGCGATGACC AGACTATCAA TTTTAGCGAT
GTCCCGTCTA TTTCGGCCTA CGTCCCACAG GTCGTAAGCA ACCTTTTTTC GTATCCCTTG
CTGGCTTGCA ATTTTCAAGG AATTGACGAA GATCTTATGG ATTGGAGTGA CCCGGACCGA
CCTATAG
 
Protein sequence
MTKDAPSHTV SAAEAAVNKT HFRNNTAKHA SVDADAVSHV SNLTADDAST VVSFSDRVTS 
SIGSVLGQAK RILEGEAPIL EDETDGDEVA ESPSNLVVTL EQEIGKSRSA RSLGEDNNKG
ESVSTARMRV AKDFLRDERS EMTFSRRIAL ALSHKSWYNP RAKEEPEFAP NEIETPEAST
RIDSQHSMPV LPVGGPNIEA YPFTHSRREN PSLGKAWAYF EHVAMPRYVV EAKLDQRRKN
ILHRIVRKFQ KADKQLQRAE PGEKYLPTKL YGPICTPHKQ LGDWGLGFGL YFSTLRAITV
LTFCAGLLNI PNLIYFSSES YSSGQDGVIP LLQGSAICTD TRWVPCPNCT SGDFEATRFA
YGTNDAGLNV TFVLRNTCEG ATIEQGFTNY ASLMLIMLGT VFLNRYLKRM EVAFDEDEQT
AQDYSIVIGN PPGDATDPDE WRIFFHDCFD GAKVTALTVA VDNDLLVRSL VERREKLREI
EMMVEPGTSL DTLTLAGIAA KQERERSVWG RWKSMIIPGI PELFSRTVVL TAKVQGLAQQ
DYPATNVFVT FETEADQRRV LSALSVGSWD VQRNRQSAIA DPKHLFRSEL VLSVHEPDEP
NTVRWQDLNE KFKDRLKQQC LTTLCTLTAI ILIAFVIFLV NEQSITFSAF AIAIFNSIFP
LFAKLLTGME AHSSEGGKQR SLYFKIAVFR WVNTAVVITI ITPFTSTLTD GGLVNQIYAL
FFAEIVTTNA IQLLDPVGHF QRHFLAPRAK TQDAMNLCMQ GQQVELAERY TNMTKVLFLA
LWYCAIFPGA FFLCSFALLI NYFTDRFSLM RTWKRAPQLG GKISSFSRRY FFSLSIVAMA
LVSSYYWSAF PFDNVCSTEL PVNTSFVGVW NITGFAKNDE KEPTFQLSLV EDADTSFFFC
VQDFFRYEAE EQAFPFIPKF QRSGEEWMTS DQETLTAVYG WTVVAVAALV LLKFIHGWFS
SIMKMFRGTY KPCGDDQTIN FSDVPSISAY VPQVELTKIL WIGVTRTDL