Gene PHATRDRAFT_42778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42778 
Symbol 
ID7196400 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1087639 
End bp1090192 
Gene Length2554 bp 
Protein Length675 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176720 
Protein GI219109935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00214847 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCAATCGT CGCAAAACGA AGTCATCATC CATGGCTACA CTGCTACCTA TAGAGATACA 
GCAAGACTTC TATACTCCTC CCTACTAAGC GATGGAGACA AGAGCAGCAA ATCATGTTGC
TGCGGAGGAG CACAGGTGCG ACTTGGCAAA AAGTTCGCGA ATGAAAAAGG CAAGATCTGT
CGATTTTGGA CGATTTCTTC GGTCCAGTCT TCGGAATCCG GCTGATAGCT CCACCGTTGT
AGCATCGGCC TTCAGCCAAA GAGCTGCAAG GGACAACTTT TCTAACTGGT AAGTCATTGC
TGTAGAATCA GATATGGCTT TTGATCGCCC TGCCGTAAAG GACGAACTTG CGATCCGCTC
TATTTCCGTG GTTGATGGTC TTGGTGTCAC TGTCAGTGTG GAATAGTCGA CTGAGTCAAA
ATGGTGTTTC ACCATTTTTT TTCAAACCGG CCGGGCGGTG ACGTGTTTTT CGTTACAATC
GCCAGATCAA GCTCGTGCTC TGCAGAGACT AGCTTATGTT TCGACGGTGT CCGCCCAGTC
AAGATAAAGG TTCTCTTTGT CTCTGGTCAA CTTCGTAAAT TTTGTGTCCC GGAGGCGTAC
ACACAGGAAC AGCGATACCA CGCGCTTTTA CCGCGCCCTG ACAGTGAACA CATATTGCAT
TTCTAACTCG TTTTCTTTGT AACTCTTACT TCAATGCGGT TAAACCACAG TACCTCTGTT
TTCGCGCCAT CCCTGTCAGA ACAGTGCTCC GCAACAGCTA CTACAGACAA TTTGTTACTA
CGAAAAGGCT GGCCGAACGG AACCGCCATT TCCGCCGCTT CTTCATTTCC CCCGTCGGCG
TCCAGCTTCC CCATCGGAAC GTGGACTCTT GGCAAGCTTT CTGTCTTAGT TTTAGCGCTA
GCATCTTTGA CAGATTTAGC ACTTTCTCAC ATATACCGAC TAGAATCACC AGCAGACTTC
TATGACGCTC TGTCCGCCGA TGTTATTCGA CCGCCCTTTC GGTATAAGTC GAGTACATCC
ATTGCACGCA CACGGTCTAG CCTTGGTTCT GCTGCGATTA GTTCTGTCAC AGAATTTCTT
TCCCCAATCC TTCCCTTTAC TGGCGGGTTG GATCTGAGAA GGGAAGATTA TTGGACAACT
TCAGGAAGCT GGCTAGAAAC ATTGGAGTCC TTGACGAGAC AAATCCAAGA GGCGCTGTTT
TCGTCGGACG ATGACCGGAC AAGTCTACTC AGCAGTATAT CGCTTATTCG TAGCGGAAGT
CTCTCCAACA AGATGTCGCC GCGGGTCCAC ACTAAGGGAA GAAATACGGG CACTTTTACG
AAGCATGTGT CATCTATTTC AGCTCAGAAA CCGTTCTTCA GCGAAGATGA GATCGCTGAG
CTTTCTCTCG GCGAGGTAGC GCAGGCCTTC CGATATGCGT CAGAAAGCTC GTCCACAGAT
TTCAACGAGG ATAAGTTTTT GAACAGCTTG ACCACCCGAG TGCGCAGAAT GATTCTCTCC
ATCAGGGAAG CTGTCTCCGA GTCGCGCGGC ATCGATGTCG AAGATGCCTG TGTTTTGACG
AAGAATCATC GAGTGAATGG AAGCGTCGAT GCGCTGAAAT TTTCGGCCGC TATGAGAATA
TTTGCGGAAT GGCGTATCCT CCGGCAGGTT CCAGAAGGAT ACAAGGGATA TGCTGTTGGA
ATGAATCTTG GTCATAAGGA TGTCGTCCAA AACGTTGCAA AGATTGAACA GGCAGTTCAT
TCTTGGCTAG ACAATCAACG AGACTTGCGT TCGCTATCAG AAATCGAATC GCAAACCGGC
TGCCCTATAA TTGAATTGAG ATCCTCGAAT CTTTGCTCGC CAACGATTCG AGAGCTAATG
CAAGACGAAG TTGACATGGA CATTCATCCA ACAAATCGTC TACCTCGCCT GAAAGAAAAA
ACCGCGGCGA TGGGCATTCT TTGGGTGAGA CGACAGCTAC ATTACCAAAC AGGTGTTTTT
GGCAATCTGT TAGTCGTACC GGAAAGCTTT CCGACAACAG AGAGAGCAGT GGCGTCGGCC
TATAAGGAGG TTTACGACAA GTATCATGGT TGGGCTGTAC AAAAGATTTT CAGCTACTCT
TTTCAATCAG CACCAAAGGC AGAAGAAATC TACCAGCATA TGAACCCAGA ACGGCTGAAA
GAAGTCAAGG CTGCCGCTGA TGAATTGGTG TTGCATTTTG ACTCTGAATC ACGATGTTCA
GCCAAAGGAA TGAACATTTC GCCTAAGGGC AATCTGATTG ATGTCATTTT ACTGAACGCT
AGTAGGGAAT TCGAGAATCT TGTAGAGGCT TTTCTACAGC TTGTCAATTC TGGCATGGCA
TCACCAGGTT CTGACGTTCG AGGTGGCGGC TGCAACATTC ACAGCTCGGA TGAAAATGAC
AGAGAATCGT TCATTGCCAA GGAAATGATC AAGGATGCAC ACAAGCACAT TGAGTTTTAC
CTAGAGGTTG TCCGTCCACT CTTGGATGAC CTTGCCCTAG TTTTTGATGA GCTCAATATG
GACGATCCCA CCAAGGTTTA AGTCCACTTA ATGT
 
Protein sequence
METRAANHVA AEEHRCDLAK SSRMKKARSV DFGRFLRSSL RNPADSSTVV ASAFSQRAAR 
DNFSNCTSVF APSLSEQCSA TATTDNLLLR KGWPNGTAIS AASSFPPSAS SFPIGTWTLG
KLSVLVLALA SLTDLALSHI YRLESPADFY DALSADVIRP PFRYKSSTSI ARTRSSLGSA
AISSVTEFLS PILPFTGGLD LRREDYWTTS GSWLETLESL TRQIQEALFS SDDDRTSLLS
SISLIRSGSL SNKMSPRVHT KGRNTGTFTK HVSSISAQKP FFSEDEIAEL SLGEVAQAFR
YASESSSTDF NEDKFLNSLT TRVRRMILSI REAVSESRGI DVEDACVLTK NHRVNGSVDA
LKFSAAMRIF AEWRILRQVP EGYKGYAVGM NLGHKDVVQN VAKIEQAVHS WLDNQRDLRS
LSEIESQTGC PIIELRSSNL CSPTIRELMQ DEVDMDIHPT NRLPRLKEKT AAMGILWVRR
QLHYQTGVFG NLLVVPESFP TTERAVASAY KEVYDKYHGW AVQKIFSYSF QSAPKAEEIY
QHMNPERLKE VKAAADELVL HFDSESRCSA KGMNISPKGN LIDVILLNAS REFENLVEAF
LQLVNSGMAS PGSDVRGGGC NIHSSDENDR ESFIAKEMIK DAHKHIEFYL EVVRPLLDDL
ALVFDELNMD DPTKV