Gene PHATRDRAFT_46341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46341 
Symbol 
ID7201613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp43919 
End bp46897 
Gene Length2979 bp 
Protein Length948 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180936 
Protein GI219120394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGC CCAACCCAAC TCGGAATTAT CCGCGGCAGT TTAAGAATCC CGAGCTGGAT 
CCCAAGACAC ACCAAGCCAG CACTGCTCCT CTACCCTTAT CGAATGCTCC CGTCGTTGCT
ACTACTACCG CTACTACTAA TAACACTATT ACTACTAGTC GAGCCGTGAC GGGGGTACCA
GAATCCACCA GTGTTACTTC TGACGATCAT CGCAACGCGC CCCAACACAA GCTGACGGTG
AATGCAACCA CACAATCGAC GCAAGAATCA GAACTCGTCA CGGTGGAACT CTCGCATCCG
TACGCACCGT CGAATGAAAT CCGTGAGACT GTGGTAGAAG GCGATCGAGA CGCATCGGTA
CGACGCAGAC CATCAACGGA CTCGTCCTTG TCCGAACCGC CGCCGTACAC GAGTTCGGAC
GAACGGGAGA CAGCCGGCAA GACCGGACCG GCTTTGCGTC ACAAAGCATC ACCACGTCGA
CTCGCACAGC GCCGTCGTCG ACCTAGACCA AACGTTACGA CTGTCGTGGA GCCGTCGATC
ATCCAGTTGC GTCCAGCGCA GCGTCACGAC CAATCGACGG CCTGGTCCCC CGTAGACGAT
GACACCGTCA CGGGTAACTG GAGTCTCGAC GACGATTTCG ACCATTTGCA AAATTCGCAA
CGACGAGCTG ACCAACGTGG GTCCATGGAC AATCCCCGCT CCCGCCTCCC GTCAATCGTG
TTGTCGGCTC AGGATTTGGC CCTCTGTCAG CGTCTGGATC AGGACTACGA ACGTGCTTTG
GAATCACTAC AAGTCGGCTA CAGCGCTCGC TACTATTCCG TACGACAATC CGCACTGTGC
AGCGTCATCT TCATGCTCGT ACTCCTGACC CTCGGGACTA TTTTCTTTTT GCGCCAAGCT
CCCTTTTGGA GTCTGGAAGA GGCCCTACTT TTTAGCGTAT ACACCATTAC GACGGTAGGC
TACGGGCACT TGCAGCATCC CGAAACAGCC GCCTTTCAAC TTTACACCGT CGCGTACATT
TTTGTCGGTA TTGCCACCCT GACCATCATG GTGGCGCAAG TCTACCAATG CGTGGCCTTG
GAAGCGGCTC GGGCGCAACA CGCGGCCGCC GACGAAGGCA ACCGTCGCAA CGCACAAATG
CCGGACCGGG ACGGCATTGT CCGTGACCAG CGCCACAACC AGGACCCCCC GAGCCTCCAC
CACCACGAGA CACGGAGCGA GAGTTTCTCG TCCGATATGG TGTGGGAATA TTCCAGCTCG
GTCTGGGATT CCGTGACGGC CATGCTACGG CGAGCCTACC GCTACTTTCG CCAAGACGAA
TTCGGTCGGA GCTTGTCGGT CATTTTCCCC ATGACTGGAC TCGTTCTCAT TGGAGCCGTT
GTCATTGGCG TGTTGGAATC CTGGACCTGG CCGGAAGCCC TCTATTTTGC CGTCGTGTCG
CTCACCACGG TGGGCTTTGG CGACTACTAT CCCACCAACC CAGCCGCCAT TTGGTTCTGT
ACCTTGTGGT TGCCCTTTTC GGTGGGGTTC ATGAGCGTCT TCTTGGCCAA GGTGGCAGCC
TTTTACATTC GACTGTCCGA CACCAACATT TCCCGGATTG AACGAGCCCT ACGACAAGAC
CTGGTACAGA CCAAGCGCCA GGCGGCGCGT GAACGGCAAG CCGCTCTCGC GCGGGCGATG
AGGGGACAAC AATTGCGTGA TATCGAGAAC GACCACGGTC ACAATGGTGA GAGTCACGAT
TTGGCTTTGA AGGAATCAAT TGCGCTAGCG AAAGAAACAG TGTCACAACA TCGACGACGG
AGGCGAGGTT TTGATACCGT CCCTACGCAA TCAGTCGAAG CCGGAAAGGC GGTGGTCTAT
AAGGACAATG ACGGCGAAGA CTGTGAGGAT TCGACCGCCG ATAATAGCGC CGGTTTGTCG
GTCGACTCGG AGTCTCGTCG ACACCTGTTT GGATCTCCGG AAGCTCAAGA GGATCCGGCG
GAGACTCGTC GTGAGCTTGT CCTGCGCAAC AGTTTGGCGT ACAGTACTCA CTCCGAGGGA
GATGAGCTGG TCGACGAGGA TCAGGAGCTG GAGGATGGTC GATCCGAAAC GAGTGACGGC
ACCGTGTCCC GCCCTCGCGG CTCGACTCTG TCTACAATGA AAGACGTCTT ACGTACCGTA
CACGGAAGTG ATAGGGATGT GCGGTACGGT CCCGATTCGG AATTCTTGTC CGTAACGTCG
AAGCAGCCAC TCCACGCACA ACACCACGCG CTGCGCCGTC GATCCCAAAG CCTGCTAAAA
CCGTCCTTTG CCTTGCGCGC GTTGGTACAG GAACGGTTTG CCGAAATAAT TGCGACCGAC
ATTGCCGGTT ATCAGAACGC GATTGAAATA AAAGACAACT CCATGACCGT CACCATCCTT
CGTCTCAAGG CCGTGGCCGA CAAATGGTGT GTCCCCAGAC GCGCCCGTAA GGCATTTCGG
GCCGTCGCCT TTGAAACACT ATATTTCGTG GGCGAGCACG ATTTAATTGT CGAGGGCGCC
GATGCCTTGT TTGCCTTGTC ACCGTTGGAA TTTCACAGCC TGTTTGCGCC GCTCGTGGCC
GCTCTAGGAG ATGCTACGAC CATGGAAACG TGGCTGGAAC AAACGCAAGT CTTGGCGGAT
GTGGACTTGA TCAGTCGCGA TGAGAGAGTG TCACAAAGCA TGCAAGAGCA ACGAAGTCGA
CGACGGCCGA GGCGTTTGGG TCGAAACGAC TGGGGGGATG TTGAAGAAGA TGGAATCATT
AAAGGGGAGG AAAGGCTATA CCGAACTACT GACAAGTCAA CACGGGGTAA TGCCGCGATT
CCAGGAAGCG AGTTACATCT GACGTGACGG TAGCCTGTTC AAAGTCTACG TACAGGTTCT
TTCAATTATG GAATGGTCGT TGTGGCAATT TTTTCTTATT CCGATCCATG GATCAGTTTG
CCAAACCGTC CATAGGAAAA CGAAATACCC ATTCAAAAA
 
Protein sequence
MDKPNPTRNY PRQFKNPELD PKTHQASTAP LPLSNAPVVA TTTATTNNTI TTSRAVTGVP 
ESTSVTSDDH RNAPQHKLTV NATTQSTQES ELVTVELSHP YAPSNEIRET VVEGDRDASV
RRRPSTDSSL SEPPPYTSSD ERETAGKTGP ALRHKASPRR LAQRRRRPRP NVTTVVEPSI
IQLRPAQRHD QSTAWSPVDD DTVTGNWSLD DDFDHLQNSQ RRADQRGSMD NPRSRLPSIV
LSAQDLALCQ RLDQDYERAL ESLQVGYSAR YYSVRQSALC SVIFMLVLLT LGTIFFLRQA
PFWSLEEALL FSVYTITTVG YGHLQHPETA AFQLYTVAYI FVGIATLTIM VAQVYQCVAL
EAARAQHAAA DEGNRRNAQM PDRDGIVRDQ RHNQDPPSLH HHETRSESFS SDMVWEYSSS
VWDSVTAMLR RAYRYFRQDE FGRSLSVIFP MTGLVLIGAV VIGVLESWTW PEALYFAVVS
LTTVGFGDYY PTNPAAIWFC TLWLPFSVGF MSVFLAKVAA FYIRLSDTNI SRIERALRQD
LVQTKRQAAR ERQAALARAM RGQQLRDIEN DHGHNGESHD LALKESIALA KETVSQHRRR
RRGFDTVPTQ SVEAGKAVVY KDNDGEDCED STADNSAGLS VDSESRRHLF GSPEAQEDPA
ETRRELVLRN SLAYSTHSEG DELVDEDQEL EDGRSETSDG TVSRPRGSTL STMKDVLRTV
HGSDRDVRYG PDSEFLSVTS KQPLHAQHHA LRRRSQSLLK PSFALRALVQ ERFAEIIATD
IAGYQNAIEI KDNSMTVTIL RLKAVADKWC VPRRARKAFR AVAFETLYFV GEHDLIVEGA
DALFALSPLE FHSLFAPLVA ALGDATTMET WLEQTQVLAD VDLISRDERV SQSMQEQRSR
RRPRRLGRND WGDVEEDGII KGEERLYRTT DKSTRGNAAI PGSELHLT